extend GPU memory to run cuDF for medium and big data #97

jangorecki · 2019-08-21T14:36:12Z

Even if possible to fix #94 without the need to extend GPU memory we still need more GPU memory for handling 1e9 data (45 GB csv).
We need more gpu cards, better gpu cards or a better machine in general.
For now we have cuDF results only for 1e7 data.

datametrician · 2019-11-03T19:46:12Z

I would recommend 2x RTX 8000s. In addition, dask-cuDF would allow you to use both of them vs just using a single 1080 ti as you are doing now.

jangorecki · 2019-11-05T08:31:56Z

Before trying to move to new hardware I would like to resolve #94, so I can be sure that present, and later new hardware, are properly utilized.

jangorecki · 2019-12-11T03:20:25Z

Assuming required memory scales linearly to data size (and it looks it does), then 2x RTX 8000s will not allow us to compute 1e9 groupby task, as we would need around 220GB for that. But then even a single RTX 8000s will allow us to compute 1e8 groupby, so we wouldn't need to use dask-cudf. As of now using dask-cudf to might not even help to resolve 1e8 using current 2 gpus, as explained in #94 (comment)

Join task is another thing that we should not forget about, it is more memory demanding so eventually 2x RTX 8000s might be useful to compute 1e8 join.

datametrician · 2019-12-12T23:43:57Z

I highly recommend moving to RTX 8000 regardless, but Dask-cuDF (as I said in the other issue) allows spilling to system memory.

jangorecki · 2020-01-09T10:14:08Z

Running medium data size was resolved by spilling data from gpu memory to main memory. Yet it was not enough for a big data case (50 GB), thus I filled new FR for spilling data from main memory to disk memory: rapidsai/cudf#3740
Ultimately we should upgrade GPU cards thus leaving this issue open.
Additionally moving to dask-cudf is still on roadmap, for now postponed till cudf documentation will be improved, status of that can be tracked in #116

jangorecki · 2020-02-24T06:44:13Z

Unfortunately we need to fall back to running only 1e7 data size till rapidsai/cudf#2277 will get resolved. This is because of problem with corrupting GPU memory driver described in #129 which currently makes us unable to run cudf benchmarks. Due to that cuDF timings are already 1.5 months old.

jangorecki · 2021-05-27T12:51:41Z

I re-requested spilling to disk, this time using dask_cudf in rapidsai/cudf#3740

jangorecki · 2021-05-27T18:16:24Z

rapidsai/dask-cuda#37

jangorecki · 2021-05-31T20:22:58Z

resolved by #219

jangorecki mentioned this issue Aug 21, 2019

Add cudf (RAPIDS) #44

Closed

jangorecki added the cudf label Oct 16, 2019

jangorecki added the no documentation label May 13, 2020

jangorecki mentioned this issue May 14, 2020

pending issues on cuDF #148

Closed

8 tasks

jangorecki mentioned this issue May 27, 2021

cudf use dask #219

Merged

jangorecki closed this as completed May 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extend GPU memory to run cuDF for medium and big data #97

extend GPU memory to run cuDF for medium and big data #97

jangorecki commented Aug 21, 2019

datametrician commented Nov 3, 2019

jangorecki commented Nov 5, 2019

jangorecki commented Dec 11, 2019 •

edited

Loading

datametrician commented Dec 12, 2019

jangorecki commented Jan 9, 2020

jangorecki commented Feb 24, 2020

jangorecki commented May 27, 2021

jangorecki commented May 27, 2021

jangorecki commented May 31, 2021

extend GPU memory to run cuDF for medium and big data #97

extend GPU memory to run cuDF for medium and big data #97

Comments

jangorecki commented Aug 21, 2019

datametrician commented Nov 3, 2019

jangorecki commented Nov 5, 2019

jangorecki commented Dec 11, 2019 • edited Loading

datametrician commented Dec 12, 2019

jangorecki commented Jan 9, 2020

jangorecki commented Feb 24, 2020

jangorecki commented May 27, 2021

jangorecki commented May 27, 2021

jangorecki commented May 31, 2021

jangorecki commented Dec 11, 2019 •

edited

Loading