refactor(python): Very minor refactor of `DataFrame.to_numpy` code #16325

stinodego · 2024-05-20T00:36:41Z

Going to take a look at the dataframe code next, starting out with some small refactors.

codecov · 2024-05-20T00:56:38Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 81.37%. Comparing base (f5c32f2) to head (cdcd2ee).

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #16325      +/-   ##
==========================================
+ Coverage   81.35%   81.37%   +0.01%     
==========================================
  Files        1403     1403              
  Lines      183707   183691      -16     
  Branches     2954     2954              
==========================================
+ Hits       149460   149472      +12     
+ Misses      33736    33708      -28     
  Partials      511      511

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ritchie46 · 2024-05-20T06:47:05Z

py-polars/polars/dataframe/frame.py

@@ -1602,12 +1602,12 @@ def raise_on_copy(msg: str) -> None:

        out = self._df.to_numpy(order)
        if out is None:
-            return np.vstack(
+            return np.column_stack(


I can recall I benchmarked this one. Why do we change from vstack to column_stack and what are the perf implications?

Let me revert this one for now and look into it more closely - I haven't looked into this too deeply but there are some correctness implications, e.g. currently Array types are not handled correctly. It should not have been included in this PR.

py-polars/src/to_numpy.rs

…ola-rs#16325)

stinodego requested review from ritchie46, c-peters, alexander-beedie, MarcoGorelli and reswqa as code owners May 20, 2024 00:36

github-actions bot added internal An internal refactor or improvement python Related to Python Polars labels May 20, 2024

ritchie46 reviewed May 20, 2024

View reviewed changes

stinodego added 3 commits May 20, 2024 11:15

Use macro

f2bd9fa

Use dtypes_to_supertype util

172ec86

Add some docs

cdcd2ee

stinodego force-pushed the df-to-np-fixes branch from 7d7864d to cdcd2ee Compare May 20, 2024 09:15

ritchie46 approved these changes May 20, 2024

View reviewed changes

ritchie46 merged commit ec904e6 into main May 20, 2024
17 checks passed

ritchie46 deleted the df-to-np-fixes branch May 20, 2024 09:34

c-peters added the accepted Ready for implementation label May 21, 2024

c-peters assigned stinodego May 21, 2024

Wouittone pushed a commit to Wouittone/polars that referenced this pull request Jun 22, 2024

refactor(python): Very minor refactor of DataFrame.to_numpy code (p…

9e03d52

…ola-rs#16325)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(python): Very minor refactor of `DataFrame.to_numpy` code #16325

refactor(python): Very minor refactor of `DataFrame.to_numpy` code #16325

stinodego commented May 20, 2024

codecov bot commented May 20, 2024 •

edited

Loading

ritchie46 May 20, 2024

stinodego May 20, 2024 •

edited

Loading

refactor(python): Very minor refactor of DataFrame.to_numpy code #16325

refactor(python): Very minor refactor of DataFrame.to_numpy code #16325

Conversation

stinodego commented May 20, 2024

codecov bot commented May 20, 2024 • edited Loading

Codecov Report

ritchie46 May 20, 2024

Choose a reason for hiding this comment

stinodego May 20, 2024 • edited Loading

Choose a reason for hiding this comment

refactor(python): Very minor refactor of `DataFrame.to_numpy` code #16325

refactor(python): Very minor refactor of `DataFrame.to_numpy` code #16325

codecov bot commented May 20, 2024 •

edited

Loading

stinodego May 20, 2024 •

edited

Loading