-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update doc to note that single quoted json strings are not ok #2316
Update doc to note that single quoted json strings are not ok #2316
Conversation
Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we need to mention unescaped control characters as well? I assume the cudf json parser does this differently. cc: @nvdbaranec
Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
We don't do any explicit filtering for them at the moment, so they "should" just come through. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Worth mentioning that this behavior will be fixed in the future?
Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
Updated to mention this will be addressed in the future. |
build |
Actually, to add to the confusion a bit. The JSONPath string (the query itself) allows single quotes. In fact the spec for it requires them. It's the input JSON data itself that requires the double quotes. Might be worth a clarification. |
Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
Updated to note that the double quote requirement is for strings in JSON data. |
build |
…#2316) * Update doc to note that single quoted json strings are not ok Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * Correct capitalization of PySpark Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * operator -> operation Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * Mention the behavior will be updated in the future. Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * Note that the double quote requirement is for strings in JSON data Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
…#2316) * Update doc to note that single quoted json strings are not ok Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * Correct capitalization of PySpark Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * operator -> operation Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * Mention the behavior will be updated in the future. Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * Note that the double quote requirement is for strings in JSON data Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
Update doc to note that
get_json_object
on the GPU only supports double quoted strings in JSON per the http://json.org/ spec. Spark appears to support single quoted strings in JSON per https://github.com/apache/spark/blob/4e8701a77dff729c4e8e0ad39c16e2717c2c32fe/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/jsonExpressions.scala#L108 . Single quote support will be addressed in a future release.