-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] mergeSchema on ORC reads does not work #135
Comments
This is actually rather complex now that I have dug into it and to fully make this work we are going to need to support schema evolution for orc. Which is rather hard. I filed rapidsai/cudf#5447 for this with CUDF. |
In the short term I am going to do what I can to fall back to the CPU in cases we know that will not work. |
Marking this for us to look at again because it is related to #5445 in parquet. |
Seems Personally the schema evolution is required to support the user specified schema. I can run into the type casting case even without
|
Filed a new issue #5895 to track the schema evolution feature. |
Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Describe the bug
This is a lot like #60 but for ORC files. If you try to use
mergeSchema
or provide your own reader schema that has more columns than the orc file does it results in an error.Steps/Code to reproduce bug
an integration test is being added for this.
The text was updated successfully, but these errors were encountered: