Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Additional tests for broadcast hash join #313

Merged
merged 5 commits into from
Jul 15, 2020

Conversation

nartal1
Copy link
Collaborator

@nartal1 nartal1 commented Jul 1, 2020

This is part of audit work #237 . Took a look at Spark tests that could be ported over to spark-rapids plugin. Most of them are not required. Have added tests that seemed relevant. Metrics related tests can be added in #198. Please take a look and comment if these tests could be added.

@nartal1 nartal1 added the test Only impacts tests label Jul 1, 2020
@nartal1 nartal1 added this to the Jun 22 - Jul 2 milestone Jul 1, 2020
@nartal1 nartal1 self-assigned this Jul 1, 2020
@nartal1
Copy link
Collaborator Author

nartal1 commented Jul 1, 2020

build


test("broadcast hint isn't propagated after a join") {
import spark.sqlContext.implicits._
spark.conf.set("spark.sql.autoBroadcastJoinThreshold", "-1")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Won't setting configs on the shared spark session be problematic with the potential of config settings from one test leaking into other tests? That's why Spark's tests use something like withSQLConf to only temporary change configurations for a specific test.

@nartal1 nartal1 modified the milestones: Jun 22 - Jul 2, Jul 6 - Jul 17 Jul 2, 2020
@nartal1
Copy link
Collaborator Author

nartal1 commented Jul 14, 2020

build

@nartal1
Copy link
Collaborator Author

nartal1 commented Jul 14, 2020

Addressed review comments. Ready for second round of review.

@@ -97,4 +99,45 @@ class JoinsSuite extends SparkQueryCompareTestSuite {
mixedDfWithNulls, mixedDfWithNulls, sortBeforeRepart = true) {
(A, B) => A.join(B, A("longs") === B("longs"), "LeftAnti")
}

test("broadcast hint isn't propagated after a join") {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How are these integration tests? They don't fit with that model, and look much more like "unit" tests. Could you please move them to the tests directory instead of integration_tests

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @revans2 for reviewing. My bad, since all the join tests were in this file, I added these here as well not realizing these do not qualify here. I have moved these tests to a new file under tests directory. Please take a look and let me know if it's okay to keep there or should be put in any existing file.

@nartal1
Copy link
Collaborator Author

nartal1 commented Jul 15, 2020

build

@revans2
Copy link
Collaborator

revans2 commented Jul 15, 2020

The concerns for @jlowe have been addressed so I think we can merge this in now instead of waiting for him.

@revans2 revans2 merged commit 10668cd into NVIDIA:branch-0.2 Jul 15, 2020
nartal1 added a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
* Additional tests for broadcast hash join

* addressed review comments

* Moved these tests under tests/ directory

Co-authored-by: Niranjan Artal <nartal>
nartal1 added a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
* Additional tests for broadcast hash join

* addressed review comments

* Moved these tests under tests/ directory

Co-authored-by: Niranjan Artal <nartal>
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this pull request Nov 30, 2023
Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
test Only impacts tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants