Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Save Databricks init scripts in the workspace [databricks] #8961

Merged
merged 5 commits into from
Aug 25, 2023

Conversation

NvTimLiu
Copy link
Collaborator

@NvTimLiu NvTimLiu commented Aug 9, 2023

To close: #8916

Init scripts in DBFS will be deprecated sept 1 2023.

Move Databricks init scripts in the workspace

Signed-off-by: Tim Liu timl@nvidia.com

To close: NVIDIA#8916

Init scripts in DBFS will be deprecated sept 1 2023.

Move Databricks init scripts in the workspace

Signed-off-by: Tim Liu <timl@nvidia.com>
Signed-off-by: Tim Liu <timl@nvidia.com>
@NvTimLiu
Copy link
Collaborator Author

NvTimLiu commented Aug 9, 2023

build

Signed-off-by: Tim Liu <timl@nvidia.com>
@NvTimLiu
Copy link
Collaborator Author

NvTimLiu commented Aug 9, 2023

build

@NvTimLiu
Copy link
Collaborator Author

Oh, I'd like to move DB workspace create and clean up into the common functions, let me try it.

@@ -129,7 +129,7 @@ String getDbType() {
void databricksBuild() {
def CLUSTER_ID = ''
def SPARK_MAJOR = BASE_SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS.replace('.', '')
def dbfs_path = "$INIT_SCRIPTS_DIR-$DB_TYPE"
def ws_path = "$INIT_SCRIPTS_DIR-$DB_TYPE"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

another way of doing this is to add spark-rapids github repo under /Repos. Then we can reference an initscript path /Repos/user@email/spark-rapids/...

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we create a mirror spark-rapids GitHub repo on Databricks /Repos/user@email/spark-rapids dir, it is inflexible to point the init_script files at the PR for the pre-merge pipeline, or pointing to someone's private GitHub repo for testing pipelines?

Signed-off-by: Tim Liu <timl@nvidia.com>
@sameerz sameerz added build Related to CI / CD or cleanly building documentation Improvements or additions to documentation labels Aug 14, 2023
… common functions

Signed-off-by: Tim Liu <timl@nvidia.com>
@NvTimLiu
Copy link
Collaborator Author

build

@NvTimLiu NvTimLiu marked this pull request as ready for review August 24, 2023 01:28
Copy link
Collaborator

@pxLi pxLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM to unblock usage after Sep. 1

If any follow-ups, lets handle in new PRs, thanks~

@pxLi pxLi merged commit 69ac83c into NVIDIA:branch-23.10 Aug 25, 2023
27 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
build Related to CI / CD or cleanly building documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] Databricks - move init scripts off DBFS
6 participants