Releases · pola-rs/polars

02 Feb 15:26

github-actions

py-0.16.2

9c56208

Python Polars 0.16.2

🚀 Performance improvements

improve dynamic groupby performance on sorted keys (#6599)

✨ Enhancements

implement fill_null for list data (#6635)
expression functions should be nullable (#6629)
Implement unary plus operation on exprs and series (#6517)
add streamable udfs (#6614)
is_first for struct dtype (#6595)
Added from_str_radix method to StringNameSpace that allows to parse strings from any base to i32 (#6570)
Implement DataFrame Interchange Protocol through pyarrow (#6581)
improve predicate pushdown (#6579)
raise error on invalid binary cmp (#6564)

🐞 Bug fixes

make string_repr private (#6636)
treat literal values consistently in select context, improve related typing (#6628)
Fix _repr_html_ double-height rows (#5645) (#6534)
fix(rust, python) cast to and from fixed offsets (#6602)
raise error on string numeric arithmetic (#6601)
don't convert "ns"-precision temporal types via pyarrow (#6592)
partially assert sortedness in groupby dynamic (#6593)
fix(rust, python); raise oob if negative index given to take (#6590)
fix predicate pushdown key check (#6577)
fix schema of apply with many inputs on empty df (#6571)
let lhs determine struct order in supertype (#6572)
ensure consistent handling of 1D numpy arrays with respect to other sequences (#6569)
fix(rust, python) validate utc, fmt, and tz-aware in strptime (#6550)
add strptime to filter boundary (#6560)

🛠️ Other improvements

make string_repr private (#6636)
add example of using is_between with string bounds, and extend test coverage for the same (#6627)
provide additional examples for diff methods (#6630)
Consistent handling of env vars (#6626)
make structify behaviour experimental, while also extending it to aliased expressions (#6615)
Disallow clippy borrow deref ref (#6605)
Update ruff version and some settings (#6588)
Add release flow info to contributing guide (#6480)
Use assert_series_equal instead of s.series_equal(...) (#6582)
cleanup last vestiges of experimental kwargs setting (#6568)
Use assert_frame_equal instead of assert df.frame_equal(...) (#6553)
Update to PyO3 to 0.18.0 (#6531)

Thank you to all our contributors for making this release possible!
@2-5, @MarcoGorelli, @abalkin, @alexander-beedie, @cojmeister, @dependabot, @dependabot[bot], @jjerphan, @plaflamme, @ritchie46 and @stinodego

Contributors

plaflamme, abalkin, and 8 other contributors

Assets 2

29 Jan 16:54

github-actions

py-0.16.1

15e85e4

Python Polars 0.16.1

🏆 Highlights

Formalize list aggregation difference between groupbys, selection and window functions (#6487)
automagically upconvert with_columns kwarg expressions with multiple output names to struct; extend **named_kwargs support to select (#6497)

⚠️ Breaking changes

error on string <-> date cmp (#6498)
Formalize list aggregation difference between groupbys, selection and window functions (#6487)
show where error messages originated (#6482)
Remove deprecated paths from Series.__getitem__ (#6048)
change behaviour of named rows (#6302)
Remove deprecated read/write_json arguments (#5990)
make schema, schema_overrides, and orient consistent on all user-facing interfaces (#6387)
Groupby iteration now returns tuples of (name, data) (#6350)
Remove Groupby.pivot (#6016)
Remove deprecated argument aliases (#5993)
Change Series.shuffle default behaviour (#5991)
Change Expr.is_between default behaviour (#5985)
Restrict certain function parameters to be keyword-only (#6464)

✨ Enhancements

let cast_time_zone accept None (#6539)
automagically upconvert with_columns kwarg expressions with multiple output names to struct; extend **named_kwargs support to select (#6497)
add some missing type annotation in series dispatch methods (#6523)
better errors in get_ptr and a probability on a boolean… (#6522)
add utc parameter to strptime (#6496)
add meta 'has_multiple_outputs', 'is_regex_projec… (#6500)
error on string <-> date cmp (#6498)
~30% faster iter_rows(named=True) and to_dicts(), if pyarrow available (#6493)
show where error messages originated (#6482)
Remove deprecated paths from Series.__getitem__ (#6048)
change behaviour of named rows (#6302)
Remove deprecated read/write_json arguments (#5990)
Groupby iteration now returns tuples of (name, data) (#6350)
Remove Groupby.pivot (#6016)
Remove deprecated argument aliases (#5993)
Change Series.shuffle default behaviour (#5991)
Change Expr.is_between default behaviour (#5985)
Restrict certain function parameters to be keyword-only (#6464)

🐞 Bug fixes

implement ser/de for BinaryChunked (#6543)
on frame-init from generator, initial chunk_size cannot be smaller than infer_schema_length (#6541)
raise if tz_localize called on UTC-aware (#6526)
make concat_list group aware (#6527)
error on invalid expanding expression (#6521)
create from dicts directly as struct categorical (#6520)
fix oob in arr.get by expressions (#6519)
fix cse schema (#6518)
panic when max_len -1 is reached (#6494)
Formalize list aggregation difference between groupbys, selection and window functions (#6487)
fix(rust, python) validate tz in with_time_zone (#6417)

🛠️ Other improvements

Remove verify_series_and_expr_api util (#6524)
Disable some tests for Windows (#6532)
Remove unnecessary brackets in doc examples (#6332)
Enable some tests for Windows (#6511)
Fix test issue with tmp directory (#6508)
Fix some deprecation warnings (#6495)
added all missing examples for temporal expressions (#6488)
Utilize pytest-xdist for faster unittests (#6483)
test(python) I/O test improvements (#6475)
make schema, schema_overrides, and orient consistent on all user-facing interfaces (#6387)
improved error message from Expr on incorrect usage in boolean context (#6473)

Thank you to all our contributors for making this release possible!
@MarcoGorelli, @alexander-beedie, @gab23r, @papparapa, @ritchie46, @romanovacca, @stinodego and @zundertj

Contributors

alexander-beedie, ritchie46, and 6 other contributors

Assets 2

26 Jan 18:13

github-actions

py-0.15.18

292fe16

Python Polars 0.15.18

✨ Enhancements

More precise pipe type annotation (#6457)

🐞 Bug fixes

use consistent floor division for floats/ints (#6460)
split semi/anti join optimization (#6459)

🛠️ Other improvements

Specify deltalake minimum version (#6363)
deprecate iterrows in favour of iter_rows, add new @redirect class decorator (#6461)
Improve IO test structure (#6453)

Thank you to all our contributors for making this release possible!
@alexander-beedie, @josh, @ritchie46 and @stinodego

Contributors

josh, alexander-beedie, and 2 other contributors

Assets 2

26 Jan 05:17

github-actions

py-0.15.17

96c0e35

Python Polars 0.15.17

✨ Enhancements

allow expr in str.contains (#6443)
Deprecate with_column (#6128)
expose efficient iterator over DataFrame slices (#6414)
add float formatting option (#6432)
10% speedup for to_dicts method (#6415)
add datetime/duration dtype selector groups covering the different timeunits (#6425)
allow internal api to get pointer to values buffer (#6385)
infer ISO8601 datetimes (#6357)
minor improvement to auto-detection of ambiguous data orientation (#6376)
allow expressions as arguments in str.ends_with (#6361)
Make groupby rolling/dynamic iterable (#6372)
accept expr in str.starts_with (#6355)
Move explode to namespaces (#6351)
Rename Series.struct.to_frame to .struct.unnest (#6352)
auto-detect %+ as tz-aware (#6434)

🐞 Bug fixes

fix projection pushdown on double semi join (#6440)
ensure column-exclusion works with the new dtype groups, and improve some related typing (#6442)
ensure from_dicts and DataFrame init from list of dicts behave consistently, update/improve related docstrings (#6431)
cumulative_eval ensure output dtype is respected (#6435)
allow from pandas null structs (#6430)
fixed interaction of schema_overrides with frame-init from list of dicts (#6424)
only use float simd on specific alignment (#6427)
no early escape when window is equal to len in rolling_float (#6408)
is_between typing with time in start and end (#6393)
dont incorrectly infer Zulu time (#6378)
raise error on invalid sort_by argument (#6382)
take offset into account with str.explode (#6384)
Return empty batch for pl.read_csv_batched().next_… (#6381)
ensure pyarrow.compute module is loaded (#6353)
implement ser/de for StructChunked (#6359)
series of empty structs (#6347)

🛠️ Other improvements

add explicit note about use of Config as a context manager (#6439)
ensure from_dicts and DataFrame init from list of dicts behave consistently, update/improve related docstrings (#6431)
Fix docstring of series.interpolate (#6399)
Remove duplicate test (#6390)
deprecate columns param for DataFrame init; transitioning to schema (#6366)
Add docs and tests to Expr.flatten (#6370)
Example of filtering partitioned delta tables (#6365)
Uppercase project URL refs (#6362)

Thank you to all our contributors for making this release possible!
@ChayimFriedman2, @MarcoGorelli, @alexander-beedie, @c-peters, @flowlight0, @gab23r, @gam-phon, @ghuls, @jgmartin, @josh, @ritchie46, @romanovacca, @stinodego, @universalmind303 and @zundertj

Contributors

josh, jgmartin, and 13 other contributors

Assets 2

21 Jan 13:40

github-actions

py-0.15.16

ad6d42e

Python Polars 0.15.16

🚀 Performance improvements

Improve rechunk check (#6268)
reuse allocated scratches in ipc writer (#6287)
use dedicated writer thread for sink_parquet (#6285)

✨ Enhancements

add strict parameter to decoding expressions (#6342)
allow unordered struct creating from anyvalues (#6321)
allow pass_name in aggregation apply (#6318)
parse abbrev month name (#6314)
Add warning for new behaviour of named rows (#6300)
add dt.combine for combining date and time components (#6121)
improvements to dtype-based column selection (#6295)
add sink_ipc (#6286)
additional schema_overrides param for more ergonomic DataFrame init (#6230)

🐞 Bug fixes

don't cast nulls before trying normal cast (#6339)
properly dispatch categorical string comparison (#6336)
expand all nested wildcards in functions (#6334)
fix groupby rolling by_key if groups are empty (#6333)
Fix some type hints and bugs for groupby (#6329)
Reject None input for head/tail (#6326)
parse abbrev month name (#6314)
default to pyarrow for writing parquet (#6313)
disallow alias in inline join expressions (#6312)
block proj-pd and pred-pd on swapping rename (#6303)
convert nested dictionary with i64 keys (#6299)
fix(python) Print instantiated dtypes in glimpse (#6298)
infer y-m-d datetime even if single element (#6297)
fix panic dynamic_groupby on empty dataframe (#6294)
implement missing DataFrame __floordiv__ op (#6280)
Allow low and high in date_range to be str (#6275)
allow integer-compatible row indexes that are not strictly typed as int (#6266)
Parse negative dates with polars parser (#6256)

🛠️ Other improvements

run cse optimization only if joins and caches… (#6337)
Fix wrong description for variable_name argument in melt (#6331)
Fix random groupby test failure (#6327)
fixup test names, adjust test_struct (#6317)
simplify _from_pandas constructor (#6310)
Ignore hash doctests (#6304)
Fix docstring formatting for truncate (#6291)
Move package metadata to pyproject.toml (#6271)
Move io tests to the same folder (#6277)
Enable Dependabot (#5036)

Thank you to all our contributors for making this release possible!
@MarcoGorelli, @alexander-beedie, @c-peters, @dependabot, @dependabot[bot], @ghuls, @n8henrie, @ritchie46, @stinodego and @universalmind303

Contributors

n8henrie, ghuls, and 7 other contributors

Assets 2

15 Jan 10:52

github-actions

py-0.15.15

dc51544

Python Polars 0.15.15

✨ Enhancements

ensure ooc sort works ooc with all-constant values (#6235)
The 1 billion row sort (#6156)
optionally treat missing UTF8 values as the empty string at CSV parse-time (#6203)
check file target is not an existing directory (#6187)
support -ve indexing for DataFrame head and tail methods (#6173)
Implement DataFrame.unique(keep="none") (#6169)
support use of explicit Struct dtypes on DataFrame/Series init (#6145)

🐞 Bug fixes

Add list inner dtype when printing Series (#6233)
strptime now respects pl.Datetime's time_unit (#6231)
fix when then otherwise with arity and aggregation… (#6224)
collect now uses the storage_options given to scan_parquet (#6223)
set_sorted keep schema (#6222)
pass name to value counts in aggregation (#6221)
don't set fast_explode on list of structs (#6220)
address a frame init/construction error, and expose infer_schema_length to frame init (#6210)
explode of empty nullable list (#6190)
fix oob arr.take (#6189)
Make with_columns in with_columns_kwargs mode compatible with more data types (#6126)
Update docstring with_columns to reflect a new dataframe is being returned (#6122)
fix empty streaming joins (#6149)
fix streaming joins where the join order has been … (#6143)
write tz-aware datetimes to csv (#6135)
add null behavior for oob indices (#6133)

🛠️ Other improvements

Create DataFrame from schema (#6225)
don't set aggregated flag on null propagated aggregation. (#6191)
undo cargo.toml change (#6219)
Improve drop_nulls docstrings (#6127)
Clarify docstrings for closed argument (#6198)
minor docs and typing updates (plus additional test coverage for related areas) (#6182)
explain n_field_strategy (#6158)