Try CuPy integration w/ Dask to see what, if any, operations benefit from GPU acceleration #6

eric-czech · 2020-02-07T17:28:25Z

No description provided.

hammer · 2020-02-20T14:48:31Z

I was checking in on Vaex recently and saw https://www.kaggle.com/jovanveljanoski/vaex-on-kaggle-gpu-performance-test, where they use jit_cuda from Vaex which uses CuPy behind the scenes.

We probably want to work with Dask arrays and CuPy directly rather than via Vaex, but just thought I'd point it out as an easy way to try CuPy.

eric-czech · 2020-04-29T13:20:18Z

I tried swapping out numpy for CuPy arrays as Dask chunks in qc_call_rate_benchmarking_cuda.ipynb, but the results were not great. What takes about 30 seconds in the original notebook, as a parallel CPU implementation, takes more like a minute w/ CuPy-backed dask arrays. The time varies quite a bit based on chunk size, but about 100% slower was as fast as I could get it.

On the other hand, using numba cuda.jit to do stuff not even possible w/ CuPy looks to be a win for LD prune (#26) so it's looking like for equal $ spent on GPUs and CPUs, GPUs will only make sense for pairwise algorithms (or worse). They're pretty rough benchmarks here though so it's definitely worth testing simpler things with CuPy more as the example workflows pile up.

hammer · 2020-04-29T13:24:17Z

That's an interesting finding that some workloads are best on CPU and some are best on GPU. It makes transparent dispatch to different backends more valuable.

eric-czech added a commit that referenced this issue Apr 29, 2020

LD prune PLINK v GPU notebook (#6 #26)

a57a921

eric-czech added a commit that referenced this issue Apr 29, 2020

Adding Dask CuPy QC example (#6)

1a89912

eric-czech added a commit that referenced this issue May 6, 2020

CSV load times for cudf/dask/pandas (#6)

4bb69da

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try CuPy integration w/ Dask to see what, if any, operations benefit from GPU acceleration #6

Try CuPy integration w/ Dask to see what, if any, operations benefit from GPU acceleration #6

eric-czech commented Feb 7, 2020

hammer commented Feb 20, 2020

eric-czech commented Apr 29, 2020

hammer commented Apr 29, 2020

Try CuPy integration w/ Dask to see what, if any, operations benefit from GPU acceleration #6

Try CuPy integration w/ Dask to see what, if any, operations benefit from GPU acceleration #6

Comments

eric-czech commented Feb 7, 2020

hammer commented Feb 20, 2020

eric-czech commented Apr 29, 2020

hammer commented Apr 29, 2020