[FEA] Improve performance of benchmark input generation #9857

robertmaynard · 2021-12-07T18:36:59Z

Is your feature request related to a problem? Please describe.
As identified by #5773 (comment) a significant portion of the runtime for some benchmarks is data generation instead of micro-benchmarking.

Specifically the issue is that the benchmark fixtures spend significant time in Setup/TearDown initializing state.

Describe the solution you'd like

Cache input state when possible across Setup/TearDown
Perform as much of generate_benchmark_input.hpp on the benchmarked GPU as possiblem

The text was updated successfully, but these errors were encountered:

To speedup generate benchmark input generation, move all data generation to device. To address #5773 (comment) This PR moves the random input generation to device. Rest all of the original work in this PR was split to multiple PRs and merged. #10277 #10278 #10279 #10280 #10281 #10300 With all of these changes, single iteration of all benchmark runs in <1000 seconds. (from 3067s to 964s). Running more iterations would see higher benefit too because the benchmark is restarted several times during run which again calls benchmark input generation code. closes #9857 Authors: - Karthikeyan (https://github.com/karthikeyann) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) - Vukasin Milovanovic (https://github.com/vuule) - David Wendt (https://github.com/davidwendt) URL: #10109

robertmaynard added feature request New feature or request 0 - Backlog In queue waiting for assignment libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue labels Dec 7, 2021

robertmaynard added this to the C++ Benchmark Runtime Improvements milestone Dec 7, 2021

karthikeyann mentioned this issue Dec 8, 2021

move benchmark input generation to GPU for CONTIGUOUS_SPLIT_BENCH #9871

Closed

karthikeyann mentioned this issue Mar 7, 2022

generate benchmark input in device #10109

Merged

rapids-bot bot closed this as completed in #10109 Mar 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Improve performance of benchmark input generation #9857

[FEA] Improve performance of benchmark input generation #9857

robertmaynard commented Dec 7, 2021 •

edited

Loading

[FEA] Improve performance of benchmark input generation #9857

[FEA] Improve performance of benchmark input generation #9857

Comments

robertmaynard commented Dec 7, 2021 • edited Loading

robertmaynard commented Dec 7, 2021 •

edited

Loading