Panic: 'collect(streaming=True)' on 'scan_parquet' Fails for Hive-Partitioned Parquet Files in Azure Storage #18779
Closed
2 tasks done
Labels
accepted
Ready for implementation
bug
Something isn't working
needs triage
Awaiting prioritization by a maintainer
python
Related to Python Polars
Checks
Reproducible example
File structure:
/external-data/
Log output
Issue description
Additional info based on testing:
Regarding tested polars versions:
thread 'polars-3' panicked at crates\polars-parquet\src\parquet\encoding\bitpacked\decode.rs:41:49:
called
Result::unwrap()
on anErr
value: OutOfSpec("Bitpacking requires num_bits > 0")The only somewhat similar issues I could find were: #12635 and #13162
This issue doesn't seem to be affected by the number of selected columns or threads as suggested in these.
Expected behavior
Collect dataframe without panick.
Installed versions
The text was updated successfully, but these errors were encountered: