-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[window] Disable GPU for COUNT(exp) queries #666
Conversation
GpuWindowExec currently counts null-rows when running COUNT(col) (or generally COUNT(expr)) window queries, owing to a bug in CUDF/Java. Left unchecked, this will produce incorrect results for said queries. This commit disables GPU acceleration for COUNT(expr) queries, while retaining support for COUNT(1) and COUNT(*). This may be reverted once we have a fix in CUDF/Java. Signed-off-by: Mithun RK <mythrocks@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we link the cudf bug issue to this PR? Approving as changes look good to me.
Linking the bug will close it when this PR is merged. (Please correct me if that's wrong.) I'd like to keep it open, and actually fix it in |
Oh, I didn't know that cudf issues would close automatically as well if a PR in this repo went in! Thought it was only the rapids plugin issues. I stand corrected. |
D'oh! Wait, maybe not. I'll raise the CUDF bug and leave a link here. |
rapidsai/cudf#6156 seems to be the cause. The |
For it to be closed you have to say something like fixes hash-bug-number or closes hash-bug-number |
build |
The text under "Linked issues" on the right pane seems to suggest that the issues listed there "may" be closed. In any case, I can't seem to manually link CUDF issues or indeed anything outside of |
Thanks for the reviews, all. |
GpuWindowExec currently counts null-rows when running COUNT(col) (or generally COUNT(expr)) window queries, owing to a bug in CUDF/Java. Left unchecked, this will produce incorrect results for said queries. This commit disables GPU acceleration for COUNT(expr) queries, while retaining support for COUNT(1) and COUNT(*). This may be reverted once we have a fix in CUDF/Java. Signed-off-by: Mithun RK <mythrocks@gmail.com>
GpuWindowExec currently counts null-rows when running COUNT(col) (or generally COUNT(expr)) window queries, owing to a bug in CUDF/Java. Left unchecked, this will produce incorrect results for said queries. This commit disables GPU acceleration for COUNT(expr) queries, while retaining support for COUNT(1) and COUNT(*). This may be reverted once we have a fix in CUDF/Java. Signed-off-by: Mithun RK <mythrocks@gmail.com>
…IDIA#666) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com> Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Mitigates #218.
GpuWindowExec currently counts null-rows when running
COUNT(col)
(or generally
COUNT(expr)
) window queries, owing to a bug in CUDF/Java.Left unchecked, this will produce incorrect results for said queries.
This commit disables GPU acceleration for
COUNT(expr)
queries, whileretaining support for
COUNT(1)
andCOUNT(*)
.This may be reverted once we have a fix in CUDF/Java.
Please note that there is already an XFAIL test that checks this condition, in window_function_test.py
Signed-off-by: Mithun RK mythrocks@gmail.com