Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Databricks shim layer for GpuFileSourceScanExec and GpuBroadcastExchangeExec #571

Merged
merged 9 commits into from
Aug 18, 2020

Conversation

tgravescs
Copy link
Collaborator

Databricks has a bunch of changes in the FileSourceScanExec and DataSourceScanExec so we have to do extra stuff there. In addition they use a different version of FileStatus. So we had to shim some of the functions around using the FileStatus. I had to pull in FileScanRDD into our shim layer. I verified the metrics work on databricks for the GpuFileSourceScanExec.

Also there were changes from GpuBroadcastExchangeExec missing from databricks shim.

tgravescs and others added 9 commits August 14, 2020 15:04
Signed-off-by: Thomas Graves <tgraves@nvidia.com>
Signed-off-by: Thomas Graves <tgraves@nvidia.com>
Signed-off-by: Thomas Graves <tgraves@apache.org>
Signed-off-by: Thomas Graves <tgraves@apache.org>
Signed-off-by: Thomas Graves <tgraves@apache.org>
Signed-off-by: Thomas Graves <tgraves@nvidia.com>
Signed-off-by: Thomas Graves <tgraves@nvidia.com>
Signed-off-by: Thomas Graves <tgraves@apache.org>
@tgravescs
Copy link
Collaborator Author

build

@jlowe jlowe added the build Related to CI / CD or cleanly building label Aug 17, 2020
@tgravescs tgravescs merged commit e26e961 into NVIDIA:branch-0.2 Aug 18, 2020
@tgravescs tgravescs deleted the dbFileSourceFix branch August 18, 2020 00:43
@sameerz sameerz added this to the Aug 17 - Aug 28 milestone Aug 18, 2020
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
…xchangeExec (NVIDIA#571)

* Make GpuFileSourceScanExec work with Databricks

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* Add in GpuFileScanRDD

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* cleanup

Signed-off-by: Thomas Graves <tgraves@apache.org>

* Rework to get PartitionedFiles only

Signed-off-by: Thomas Graves <tgraves@apache.org>

* remove commented out code

Signed-off-by: Thomas Graves <tgraves@apache.org>

* Fix spacing in pom

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* Add gpu broadcast get function and fix names

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* remove unused imports

Signed-off-by: Thomas Graves <tgraves@apache.org>

Co-authored-by: Thomas Graves <tgraves@nvidia.com>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
…xchangeExec (NVIDIA#571)

* Make GpuFileSourceScanExec work with Databricks

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* Add in GpuFileScanRDD

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* cleanup

Signed-off-by: Thomas Graves <tgraves@apache.org>

* Rework to get PartitionedFiles only

Signed-off-by: Thomas Graves <tgraves@apache.org>

* remove commented out code

Signed-off-by: Thomas Graves <tgraves@apache.org>

* Fix spacing in pom

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* Add gpu broadcast get function and fix names

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* remove unused imports

Signed-off-by: Thomas Graves <tgraves@apache.org>

Co-authored-by: Thomas Graves <tgraves@nvidia.com>
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this pull request Nov 30, 2023
…IDIA#571)

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
build Related to CI / CD or cleanly building
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants