Simple arithmetic operations on the "list" type columns #8006

mkleinbort-ic · 2023-04-05T12:34:40Z

Problem description

It'd be nice for this code to work:

df = pl.DataFrame({
    'x1': [[1,2],[3,4]],
    'x2': [10,20]
})

df.with_columns(scaled_x1 = pl.col('x1')/pl.col('x2'))

# Desired output: 

shape: (2, 3)
┌───────────┬─────┬─────────────┐
│ x1        ┆ x2  ┆ scaled_x1   │
│ ---       ┆ --- ┆ ---         │
│ list[i64] ┆ i64 ┆ list[f64]   │
╞═══════════╪═════╪═════════════╡
│ [1, 2]    ┆ 10  ┆ [0.1, 0.2]  │
│ [3, 4]    ┆ 20  ┆ [0.15, 0.2] │
└───────────┴─────┴─────────────┘

The idea here is that x1 is column of arrays, and I want to divide each element in by the value in pl.col('x2').

I'd be nice to support the basic arithmetic operations from numpy: +, -, *, /, %

The text was updated successfully, but these errors were encountered:

lunarspectrum · 2023-05-19T17:02:53Z

This is a much needed feature.

It seems that the expected syntax would be

df.with_columns(pl.col("x1").arr.eval(pl.element() / pl.col("x2")).alias("scaled_x1"))

A clunky way to get to the desired result can currently be accomplished by

pl.concat(
        [
            _df.with_columns(
                pl.col("x1").arr.eval(pl.element() / x2).alias("scaled_x1")
            )
            for x2, _df in df.groupby("x2")
        ]
    )

tim-x-y-z · 2023-07-18T10:36:20Z

If you are okay using numpy, this is another way to do it:

import numpy as np
df.with_columns(scaled_x1 = pl.struct(["x1", "x2"]).apply(lambda x: np.array(x["x1"]) / x["x2"]))

itamarst · 2024-09-18T19:15:22Z

I have implemented a working prototype (see branch 8006-list-arithmetic-part-2 in my fork), based on the work in #17823.

itamarst · 2024-09-19T14:53:48Z

itamarst · 2024-09-20T18:53:13Z

Have made decent progress on making this work.

itamarst · 2024-09-24T15:08:28Z

The PR will also close #14711.

cmdlineluser · 2024-10-14T11:53:13Z

This has been added #19162

df.with_columns(scaled_x1 = pl.col.x1 / pl.col.x2)
# shape: (2, 3)
# ┌───────────┬─────┬─────────────┐
# │ x1        ┆ x2  ┆ scaled_x1   │
# │ ---       ┆ --- ┆ ---         │
# │ list[i64] ┆ i64 ┆ list[f64]   │
# ╞═══════════╪═════╪═════════════╡
# │ [1, 2]    ┆ 10  ┆ [0.1, 0.2]  │
# │ [3, 4]    ┆ 20  ┆ [0.15, 0.2] │
# └───────────┴─────┴─────────────┘

mkleinbort-ic added the enhancement New feature or an improvement of an existing feature label Apr 5, 2023

ritchie46 self-assigned this May 19, 2023

ritchie46 removed their assignment Jul 18, 2023

ritchie46 mentioned this issue Jul 18, 2023

Multiply all element of an array by value of another column #9948

Closed

zbenmo mentioned this issue Apr 16, 2024

value is a list (of dates in this case). Wanted to subtract another column (scalar date) from all the entries. #15706

Closed

itamarst mentioned this issue Sep 16, 2024

Allow arithmetic operations for list and array type #9188

Closed

This was referenced Sep 23, 2024

pl.Array + pl.lit PanicException Cannot apply operation on arrays of different lengths #18831

Closed

feat: Support arithmetic between Series with dtype list #17823

Merged

cmdlineluser mentioned this issue Sep 23, 2024

Support comparison operations for list types #18873

Open

This was referenced Sep 24, 2024

Support datetime arithemetic within lists #18899

Open

Support arithmetic operations between numeric List Series and a scalar #18900

Open

feat(rust, python): allow arithmetic operations between numeric Series and list Series #18901

Closed

nameexhaustion closed this as completed Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple arithmetic operations on the "list" type columns #8006

Simple arithmetic operations on the "list" type columns #8006

mkleinbort-ic commented Apr 5, 2023

lunarspectrum commented May 19, 2023

tim-x-y-z commented Jul 18, 2023 •

edited

Loading

itamarst commented Sep 18, 2024

itamarst commented Sep 19, 2024 •

edited

Loading

itamarst commented Sep 20, 2024

itamarst commented Sep 24, 2024

cmdlineluser commented Oct 14, 2024

Simple arithmetic operations on the "list" type columns #8006

Simple arithmetic operations on the "list" type columns #8006

Comments

mkleinbort-ic commented Apr 5, 2023

Problem description

lunarspectrum commented May 19, 2023

tim-x-y-z commented Jul 18, 2023 • edited Loading

itamarst commented Sep 18, 2024

itamarst commented Sep 19, 2024 • edited Loading

itamarst commented Sep 20, 2024

itamarst commented Sep 24, 2024

cmdlineluser commented Oct 14, 2024

tim-x-y-z commented Jul 18, 2023 •

edited

Loading

itamarst commented Sep 19, 2024 •

edited

Loading