fix: Compute joint null mask before calling rolling corr/cov stats #18246

agossard · 2024-08-18T00:08:41Z

codecov · 2024-08-18T00:36:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 80.24%. Comparing base (a284174) to head (ff1a318).
Report is 36 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #18246      +/-   ##
==========================================
+ Coverage   80.23%   80.24%   +0.01%     
==========================================
  Files        1500     1500              
  Lines      198871   198897      +26     
  Branches     2837     2837              
==========================================
+ Hits       159556   159604      +48     
+ Misses      38788    38768      -20     
+ Partials      527      525       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

agossard · 2024-08-18T01:20:30Z

Doubt the hypothesis test failure is due to this PR. Maybe it's being addressed by #18245 @MarcoGorelli ?

ritchie46

Not entirely happy about the implementation as it leaves a lot to CSE, but it is fine for now. Let's first make it correct.

Thanks! Got one comment about the test.

ritchie46 · 2024-08-18T07:20:11Z

py-polars/tests/unit/operations/rolling/test_rolling.py

+        pl.rolling_corr("a", "lag_a", window_size=10, min_periods=5, ddof=1).tail(1)
+    ).item()
+
+    assert val_1 == val_2


Can you also test the actual value here?

ritchie46 · 2024-08-18T07:20:17Z

py-polars/tests/unit/operations/rolling/test_rolling.py

+        pl.rolling_cov("a", "lag_a", window_size=10, min_periods=5, ddof=1).tail(1)
+    ).item()
+
+    assert val_1 == val_2


Can you also test the actual value here?

I changed the test as suggested and pushed. However, I also spent some time trying to put together a hypothesis test that would cross check these corr and cov functions against numpy. I could not get it to pass, and have an example frame which yields correlation > 1.0 :-/

df = pl.DataFrame(
{
"a": [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0],
"b": [101.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.000061, 0.0],
}
)

df_corr = df.select(
pl.rolling_corr("a", "b", window_size=7, min_periods=5, ddof=1)
)

I don't have time to push more on this right now (or even this week maybe). But I will log a separate issue.

Compute joint null mask before calling rolling corr/cov stats

bb4fbe9

agossard requested review from ritchie46, c-peters, alexander-beedie, MarcoGorelli, reswqa and orlp as code owners August 18, 2024 00:08

github-actions bot added the title needs formatting label Aug 18, 2024

agossard changed the title ~~Compute joint null mask before calling rolling corr/cov stats~~ fix: Compute joint null mask before calling rolling corr/cov stats Aug 18, 2024

github-actions bot added fix Bug fix python Related to Python Polars rust Related to Rust Polars and removed title needs formatting labels Aug 18, 2024

ritchie46 reviewed Aug 18, 2024

View reviewed changes

explicitly test result frames

ff1a318

ritchie46 merged commit 56b1219 into pola-rs:main Aug 22, 2024
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Compute joint null mask before calling rolling corr/cov stats #18246

fix: Compute joint null mask before calling rolling corr/cov stats #18246

agossard commented Aug 18, 2024

codecov bot commented Aug 18, 2024 •

edited

Loading

agossard commented Aug 18, 2024

ritchie46 left a comment

ritchie46 Aug 18, 2024

ritchie46 Aug 18, 2024

agossard Aug 18, 2024

fix: Compute joint null mask before calling rolling corr/cov stats #18246

fix: Compute joint null mask before calling rolling corr/cov stats #18246

Conversation

agossard commented Aug 18, 2024

codecov bot commented Aug 18, 2024 • edited Loading

Codecov Report

agossard commented Aug 18, 2024

ritchie46 left a comment

Choose a reason for hiding this comment

ritchie46 Aug 18, 2024

Choose a reason for hiding this comment

ritchie46 Aug 18, 2024

Choose a reason for hiding this comment

agossard Aug 18, 2024

Choose a reason for hiding this comment

codecov bot commented Aug 18, 2024 •

edited

Loading