Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmark guide update for command-line interface / spark-submit #924

Merged
merged 6 commits into from
Oct 13, 2020

Conversation

andygrove
Copy link
Contributor

This PR fixes a couple of small errors in the guide and adds instructions on using spark-submit to run benchmarks.

@andygrove andygrove added the benchmark Benchmarking, benchmarking tools label Oct 9, 2020
@andygrove andygrove added this to the Sep 28 - Oct 9 milestone Oct 9, 2020
@andygrove andygrove self-assigned this Oct 9, 2020
Signed-off-by: Andy Grove <andygrove@nvidia.com>
@andygrove
Copy link
Contributor Author

build

jlowe
jlowe previously approved these changes Oct 9, 2020
docs/benchmarks.md Outdated Show resolved Hide resolved
@sameerz
Copy link
Collaborator

sameerz commented Oct 10, 2020

build

@sameerz sameerz added the documentation Improvements or additions to documentation label Oct 10, 2020
@sameerz
Copy link
Collaborator

sameerz commented Oct 11, 2020

build

jlowe
jlowe previously approved these changes Oct 12, 2020

The benchmarks can be executed in two modes currently:

- Execute the query and collect the results to the driver
- Execute the query and write the results to disk (in Parquet or CSV format)
- Execute the query and write the results to disk (in Parquet or ORC format)
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So, this wasn't an error after all. The benchmarks do currently support writing. to Parquet and CSV only. I will add ORC support as part of this PR.

Signed-off-by: Andy Grove <andygrove@nvidia.com>
@andygrove
Copy link
Contributor Author

build

Signed-off-by: Andy Grove <andygrove@nvidia.com>
Signed-off-by: Andy Grove <andygrove@nvidia.com>
@andygrove
Copy link
Contributor Author

build

@andygrove
Copy link
Contributor Author

build

@andygrove andygrove merged commit 8bf4653 into NVIDIA:branch-0.3 Oct 13, 2020
@andygrove andygrove deleted the benchmark-guide-updates branch October 13, 2020 13:55
sperlingxx pushed a commit to sperlingxx/spark-rapids that referenced this pull request Nov 20, 2020
…DIA#924)

* Add spark-submit usage

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Update docs/benchmarks.md

* Add option to write to ORC

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Add match for WriteOrc

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Add match for Orc when writing results summary file

Signed-off-by: Andy Grove <andygrove@nvidia.com>

Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
…DIA#924)

* Add spark-submit usage

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Update docs/benchmarks.md

* Add option to write to ORC

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Add match for WriteOrc

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Add match for Orc when writing results summary file

Signed-off-by: Andy Grove <andygrove@nvidia.com>

Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
…DIA#924)

* Add spark-submit usage

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Update docs/benchmarks.md

* Add option to write to ORC

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Add match for WriteOrc

Signed-off-by: Andy Grove <andygrove@nvidia.com>

* Add match for Orc when writing results summary file

Signed-off-by: Andy Grove <andygrove@nvidia.com>

Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com>
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this pull request Nov 30, 2023
[auto-merge] bot-auto-merge-branch-23.02 to branch-23.04 [skip ci] [bot]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
benchmark Benchmarking, benchmarking tools documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants