Fixes up the overflowed fixed-point round on nullable column #10316

sperlingxx · 2022-02-17T04:10:39Z

Signed-off-by: sperlingxx lovedreamf@gmail.com

Fixes up the overflow round on nullable fixed-point columns. In previous implementation, we built the zero replacement column via cudf::detail::fill_in_place with a zero scalar. However, this API overwrites the null mask of original column with the validity of the zero scalar, which is unexpected.

Signed-off-by: sperlingxx <lovedreamf@gmail.com>

codecov · 2022-02-17T05:41:53Z

Codecov Report

Merging #10316 (40a0763) into branch-22.04 (8b0737d) will decrease coverage by 0.04%.
The diff coverage is n/a.

@@               Coverage Diff                @@
##           branch-22.04   #10316      +/-   ##
================================================
- Coverage         10.67%   10.63%   -0.05%     
================================================
  Files               122      122              
  Lines             20874    20953      +79     
================================================
  Hits               2228     2228              
- Misses            18646    18725      +79

Impacted Files	Coverage Δ
python/cudf/cudf/core/frame.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/series.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/utils/utils.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/dataframe.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/testing/_utils.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/_base_index.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/column.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/string.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/indexed_frame.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/groupby/groupby.py	`0.00% <0.00%> (ø)`
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8b0737d...40a0763. Read the comment docs.

ttnghia · 2022-02-18T22:16:14Z

cpp/src/round/round.cu

+                         stream);
+    } else {
+      detail::copy_range(thrust::make_constant_iterator(static_cast<Type>(0)),
+                         thrust::make_constant_iterator(false),


Wait, if the input does not have null, the output will be a column of all nulls?

Oh, you are right. It is unnecessary to set the null mask if the input doesn't have one.

If so, then we can just use thrust::uninitialize_fill to fill the zero value for the output.

Done. While, I am quite curious why we use unititialize_fill instead of thrust::flll here. Is it because thrust::fill will have extra cost on initializing the data before assigning the value? @ttnghia

That's very interesting. I believe that for plain types they are the same. Otherwise, unititialize_fill calls copy constructor while thrust::fill call assignment operator.

In addition, it would be nice if you can add a unit test for this case 😃

That's interesting!
The unit test for validity was added into one of the existed test blocks.

sperlingxx · 2022-02-22T02:29:11Z

@gpucibot merge

Signed-off-by: sperlingxx <lovedreamf@gmail.com> Closes #3793 Pushes cuDF-related decimal utilities down to cuDF. This PR is relied on cuDF changes: rapidsai/cudf#9809, rapidsai/cudf#9907 and rapidsai/cudf#10316.

fixes up the overflowed fixed-point round on nullable column

4ee7365

Signed-off-by: sperlingxx <lovedreamf@gmail.com>

sperlingxx added bug Something isn't working 3 - Ready for Review Ready for review by team 4 - Needs Review Waiting for reviewer to review or respond non-breaking Non-breaking change labels Feb 17, 2022

sperlingxx requested a review from a team as a code owner February 17, 2022 04:10

sperlingxx requested review from devavret and hyperbolic2346 February 17, 2022 04:10

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Feb 17, 2022

update licence

27c5f50

This was referenced Feb 18, 2022

[FEA] Push decimal workarounds/cleanup back to cudf NVIDIA/spark-rapids#3793

Closed

Push decimal workarounds to cuDF NVIDIA/spark-rapids#4822

Merged

ttnghia reviewed Feb 18, 2022

View reviewed changes

sperlingxx requested a review from ttnghia February 21, 2022 01:47

sperlingxx added 2 commits February 21, 2022 15:59

update

306cca3

update

40a0763

ttnghia approved these changes Feb 21, 2022

View reviewed changes

rapids-bot bot merged commit 7a17f28 into rapidsai:branch-22.04 Feb 22, 2022

sperlingxx deleted the fix_fp_overflow_round branch February 22, 2022 02:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes up the overflowed fixed-point round on nullable column #10316

Fixes up the overflowed fixed-point round on nullable column #10316

sperlingxx commented Feb 17, 2022 •

edited

Loading

codecov bot commented Feb 17, 2022 •

edited

Loading

ttnghia Feb 18, 2022

sperlingxx Feb 21, 2022

ttnghia Feb 21, 2022

sperlingxx Feb 21, 2022 •

edited

Loading

ttnghia Feb 21, 2022

ttnghia Feb 21, 2022

sperlingxx Feb 22, 2022

sperlingxx commented Feb 22, 2022

Fixes up the overflowed fixed-point round on nullable column #10316

Fixes up the overflowed fixed-point round on nullable column #10316

Conversation

sperlingxx commented Feb 17, 2022 • edited Loading

codecov bot commented Feb 17, 2022 • edited Loading

Codecov Report

ttnghia Feb 18, 2022

Choose a reason for hiding this comment

sperlingxx Feb 21, 2022

Choose a reason for hiding this comment

ttnghia Feb 21, 2022

Choose a reason for hiding this comment

sperlingxx Feb 21, 2022 • edited Loading

Choose a reason for hiding this comment

ttnghia Feb 21, 2022

Choose a reason for hiding this comment

ttnghia Feb 21, 2022

Choose a reason for hiding this comment

sperlingxx Feb 22, 2022

Choose a reason for hiding this comment

sperlingxx commented Feb 22, 2022

sperlingxx commented Feb 17, 2022 •

edited

Loading

codecov bot commented Feb 17, 2022 •

edited

Loading

sperlingxx Feb 21, 2022 •

edited

Loading