Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Profile/qualification tool error handling improvements and support spark < 3.1.1 #2604

Merged
merged 47 commits into from
Jun 6, 2021

Conversation

tgravescs
Copy link
Collaborator

I found a few issues with bad event logs, added more error handling and tests.

I also added support for running with Spark versions < 3.1.1, this was due to the ResourceProfile event only in 3.1.

Renamed Rank to be Score based on feedback.

@tgravescs tgravescs added the task Work required that improves the product but is not user facing label Jun 5, 2021
@tgravescs tgravescs added this to the June 7 - June 18 milestone Jun 5, 2021
@tgravescs tgravescs self-assigned this Jun 5, 2021
@tgravescs
Copy link
Collaborator Author

build

@tgravescs
Copy link
Collaborator Author

build

@tgravescs
Copy link
Collaborator Author

build

Signed-off-by: Thomas Graves <tgraves@nvidia.com>
@tgravescs
Copy link
Collaborator Author

build

nartal1
nartal1 previously approved these changes Jun 6, 2021
@tgravescs
Copy link
Collaborator Author

build

@tgravescs
Copy link
Collaborator Author

missed test files on upmerge

@tgravescs
Copy link
Collaborator Author

build

@tgravescs tgravescs merged commit ca8104c into NVIDIA:branch-21.06 Jun 6, 2021
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
…ark < 3.1.1 (NVIDIA#2604)

* Qualification tool

Signed-off-by: Thomas Graves <tgraves@apache.org>

* remove unused func

* Add missing files

* Add checks for format option

* cast columsn to string to write to text

* Revert "Add checks for format option"

This reverts commit 6f5271c.

* cleanup

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* update output dir

* formating

* Update help messages

* update app name

* cleanup

* put test functions back

* fix typo

* add printSQLPlanMetrics and printRapidsJar

* use opt

* Add Analysis

* format output

* more tests

Signed-off-by: Thomas Graves <tgraves@apache.org>

* tests working

* test rearrange utils

* move test file

* move test file right location

* add Analysis Suite

* update test analysis

* add
rapids-4-spark-tools/src/test/resources/ProfilingExpectations/rapids_join_eventlog_jobandstagemetrics_expectation.csv

* add more tests

* more tests

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* remove unneeded expectation file

* Add more analysis tests

* comment

* cleanup

* Start handling ResourceProfile with reflection

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* allow spark 3.0 and 3.1.1 to parse logs and fix bug with missing table

* Add more test files

* Add more tests and error handling

* improve error handling to skip bad event logs

* update readme

* remove unneeded temp dir

* Revert "remove unneeded temp dir"

This reverts commit 828b302.

* close file writer on exception

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* move test files

* fix move of test files
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
…ark < 3.1.1 (NVIDIA#2604)

* Qualification tool

Signed-off-by: Thomas Graves <tgraves@apache.org>

* remove unused func

* Add missing files

* Add checks for format option

* cast columsn to string to write to text

* Revert "Add checks for format option"

This reverts commit 6f5271c.

* cleanup

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* update output dir

* formating

* Update help messages

* update app name

* cleanup

* put test functions back

* fix typo

* add printSQLPlanMetrics and printRapidsJar

* use opt

* Add Analysis

* format output

* more tests

Signed-off-by: Thomas Graves <tgraves@apache.org>

* tests working

* test rearrange utils

* move test file

* move test file right location

* add Analysis Suite

* update test analysis

* add
rapids-4-spark-tools/src/test/resources/ProfilingExpectations/rapids_join_eventlog_jobandstagemetrics_expectation.csv

* add more tests

* more tests

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* remove unneeded expectation file

* Add more analysis tests

* comment

* cleanup

* Start handling ResourceProfile with reflection

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* allow spark 3.0 and 3.1.1 to parse logs and fix bug with missing table

* Add more test files

* Add more tests and error handling

* improve error handling to skip bad event logs

* update readme

* remove unneeded temp dir

* Revert "remove unneeded temp dir"

This reverts commit 828b302.

* close file writer on exception

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* move test files

* fix move of test files
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
task Work required that improves the product but is not user facing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants