[FEA] support reading decimal data stored as byte array from parquet files #6909
Labels
cuIO
cuIO issue
feature request
New feature or request
improvement
Improvement / enhancement to an existing function
libcudf
Affects libcudf (C++/CUDA) code.
Spark
Functionality that helps Spark RAPIDS
Currently, we can read decimal columns from parquet files, if their storage type are INT32 or INT64. But in real world applications, there are many parquet files containing decimal columns stored with FIXED_LENGTH_BYTE_ARRAY.
I think we had better to support reading them as fixed-point data type, just as integer based decimal columns. For data exceeding 8 bytes, perhaps we can perform rounding cast to fit them in DECIMAL64?
The text was updated successfully, but these errors were encountered: