Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable write/read Parquet when Parquet field IDs are used #4882

Merged
merged 5 commits into from
Mar 7, 2022

Conversation

res-life
Copy link
Collaborator

@res-life res-life commented Mar 1, 2022

Temporarily disable write/read parquet when schema has a specified Parquet field ID
Solution is: Fallback to CPU when writing Parquet with field id column. Fallback to CPU if
"spark.sql.parquet.fieldId.read.enabled" is true when reading Parquet.
Contributes #4846

Signed-off-by: Chong Gao res_life@163.com

…rquet field ID

Signed-off-by: Chong Gao <res_life@163.com>
@jlowe jlowe changed the title Temporarily disable write/read parquet when schema has a specified Pa… Disable write/read Parquet when Parquet field IDs are used Mar 1, 2022
@jlowe jlowe added this to the Feb 28 - Mar 18 milestone Mar 1, 2022
@sameerz sameerz added the audit_3.3.0 Audit related tasks for 3.3.0 label Mar 1, 2022
@firestarman
Copy link
Collaborator

build

1 similar comment
@sameerz
Copy link
Collaborator

sameerz commented Mar 4, 2022

build

@res-life res-life merged commit d30adec into NVIDIA:branch-22.04 Mar 7, 2022
@res-life res-life deleted the tag-parquet-field-id branch March 13, 2022 05:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
audit_3.3.0 Audit related tasks for 3.3.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants