-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove INCOMPAT for NormalizeNanAndZero, KnownFloatingPointNormalized #181
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it worth considering removing the has nan check in hash aggregate at this time?
build |
I wish we had a way to flag if a test was marked as incompat, but didn't need it. The pyspark has aggregate tests also have incompat on several of them for this reason. |
I should try and remove |
(Removed incompat in Python integration test for HashAggregateExec, where obvious.)
I have removed the @incompat for the NaN/Zero test cases added in #160. I tried going through and removing the |
buil |
build |
@kuhushukla, I'm looking into this now. |
Removed obviated check for NaNs from GpuHashAggregateMeta
@kuhushukla, I have now removed the NaNs check from That's the only change in the last commit (0a241a8). Could you please confirm that I haven't missed anything? |
build |
I dont think any of the tests from before what u have just added test for the grouping key as Nan. I suspect we will hit some issues around that but since all of the legacy tests were written with has_nans=false in mind I would probably not rely on them too much and possibly look at adding more tests for it. This change then makes me think if it should be a separate PR at this point. We want coherency when we say this data has nans and now we can normalize them but we have hit issues with sorting nans and such before so maybe testing and identifying what does or does not work with it on different operators and turning it off at aggregate operator level would be better. I apologize if I have mislead you as far as this PR goes but we need to look at how to use has_nans properly here or as a follow on. |
Added more aggregation functions to groupby tests.
The
I'd be delighted to revert my change to I have added more aggregation functions to the NaN/Zero normalization tests. These tests succeed with and without the |
build |
build |
Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Removed
INCOMPAT
flag for FP Normalization functions.