You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We add support (order by / sort) for DataFrameWriteOptions, but when a user try to query the table which the file already ordered, we can't get info from the table.
We need to find a way to communicate the ordering of a file back with the existing listing table implementation.
Describe the solution you'd like
It is also conceivable that DataFusion itself could write custom metadata in paquet and other formats that support that custom metadata with the ordering, but that seems like we can use iceberg and other table formats.
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered:
There is field for sort column, but it seems rowgroup level metadata, so when we set the sort column to parquet, it will applied to rowgroup level metadata.
@alamb This is a good idea for file level metadata storage. And i am wandering do we need to add sort column to parquet file metadata also besides the row group level metadata, so we can use it in datafusion?
Is your feature request related to a problem or challenge?
This is the follow-up for:
#13874 (review)
We add support (order by / sort) for DataFrameWriteOptions, but when a user try to query the table which the file already ordered, we can't get info from the table.
We need to find a way to communicate the ordering of a file back with the existing listing table implementation.
Describe the solution you'd like
It is also conceivable that DataFusion itself could write custom metadata in paquet and other formats that support that custom metadata with the ordering, but that seems like we can use iceberg and other table formats.
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: