-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement parquet_metadata
function in datafusion-cli
#8367
Comments
parquet_metadata
function in datafusion-cli
This would also be a great test of the user defined table function feature to see if we can build something slightly more complicated than |
I can help with this ticket as a following PR to #8306 |
After this ticker was finished, maybe we can have a list of internal table functions to implement, just like:
|
I am not sure about Instead of
It uses
|
Is your feature request related to a problem or challenge?
When exploring Parquet files using
datafusion-cli
I would often like to see how they are structured (how many row groups, if they have statistics, etc).Describe the solution you'd like
I would like to create new functions for exploring parquet metadata using the new User Defined Table Functions (🙇 to @Veeupup ) introduced in #8306
Ideally we could implement something like
parquet_metadata
: https://duckdb.org/docs/data/parquet/overview(I think parquet_schema is covered by
describe 'filename.parquet'
alreadyNote I think this should be done in datafusion-cli (not core DataFusion)
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: