Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Branch 0.5 doc remove numpartitions #1951

Merged

Conversation

sameerz
Copy link
Collaborator

@sameerz sameerz commented Mar 16, 2021

Fixes #1853

The spark.sql.shuffle.partitions default is 200, we should let uses modify if they need rather than put in a setting. We have information about tuning this in https://nvidia.github.io/spark-rapids/docs/tuning-guide.html .

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
@sameerz sameerz added the documentation Improvements or additions to documentation label Mar 16, 2021
@sameerz sameerz added this to the Mar 15 - March 26 milestone Mar 16, 2021
@sameerz sameerz self-assigned this Mar 16, 2021
Copy link
Member

@jlowe jlowe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that we also have a shuffle partition setting in the AWS EMR guide which should probably be removed as well.

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
@sameerz
Copy link
Collaborator Author

sameerz commented Mar 17, 2021

Removed from AWS EMR getting started guide as well.

Copy link
Member

@jlowe jlowe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will need to be upmerged to branch-0.5 after #1889 is merged to get the premerge CI to pass. Not worth performing a premerge CI build until that happens. Assuming the merge of branch-0.5 goes in clean, this approval should "stick."

@sameerz
Copy link
Collaborator Author

sameerz commented Mar 18, 2021

build

@jlowe
Copy link
Member

jlowe commented Mar 18, 2021

This will likely need to upmerge with branch-0.5 to get premerge CI to pass.

@sameerz
Copy link
Collaborator Author

sameerz commented Mar 18, 2021

build

1 similar comment
@sameerz
Copy link
Collaborator Author

sameerz commented Mar 19, 2021

build

@sameerz sameerz merged commit a50ddfb into NVIDIA:branch-0.5 Mar 19, 2021
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
* Remove --conf spark.sql.shuffle.partitions=10 from docs

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>

* Remove spark.sql.shuffle.partitions=40 from get-started/yarn-gpu.md

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>

* Remove spark.sql.shuffle.partitions from AWS EMR getting started gui

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
* Remove --conf spark.sql.shuffle.partitions=10 from docs

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>

* Remove spark.sql.shuffle.partitions=40 from get-started/yarn-gpu.md

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>

* Remove spark.sql.shuffle.partitions from AWS EMR getting started gui

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
@sameerz sameerz deleted the branch-0.5-doc-remove-numpartitions branch June 11, 2021 17:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[DOC] Getting started docs sets numpartitions
2 participants