Split DB scripts to make them common for the build and IT pipeline #1933

NvTimLiu · 2021-03-15T01:47:57Z

Split DB scripts to make them common for the build and IT pipeline.

Issue: #1568

Related PR #1829

Signed-off-by: Tim Liu <timl@nvidia.com>

NvTimLiu · 2021-03-15T01:48:17Z

build

NvTimLiu · 2021-03-15T01:49:22Z

jenkins/databricks/build.sh

@@ -103,36 +103,5 @@ mvn -B install:install-file \

 mvn -B -P${BUILD_PROFILES} clean package -DskipTests

-# Copy so we pick up new built jar and latest CuDF jar. Note that the jar names have to be


Move integration tests scripts into 'test.sh', to make it common for DB build & IT pipelines

NvTimLiu · 2021-03-15T01:50:33Z

jenkins/databricks/params.py

@@ -0,0 +1,71 @@
+# Copyright (c) 2021, NVIDIA CORPORATION.


Make the parameter parsing scripts common for both 'run-build.py' and 'run-test.py'

NvTimLiu · 2021-03-15T01:51:06Z

jenkins/databricks/run-tests.py



 def main():
-  workspace = 'https://dbc-9ff9942e-a9c4.cloud.databricks.com'


Make the parameter parsing scripts common for both 'run-build.py' and 'run-test.py', move parsing code into params.py

NvTimLiu · 2021-03-15T01:52:19Z

jenkins/databricks/test.sh

+LOCAL_JAR_PATH=$1
+
+# tests
+export PATH=/databricks/conda/envs/databricks-ml-gpu/bin:/databricks/conda/condabin:$PATH


Move integration tests scripts into 'test.sh', to make it common for DB build & IT pipelines

NvTimLiu · 2021-03-15T01:54:54Z

jenkins/databricks/test.sh

+
+set -e
+
+LOCAL_JAR_PATH=$1


Build pipeline builds the jars from source code, so the LOCAL_JAR_PATH is null.

IT pipeline sets the 'LOCAL_JAR_PATH' to tell where the jars are downloaded into the local directory.

NvTimLiu · 2021-03-15T09:44:15Z

jenkins/databricks/clusterutils.py

@@ -23,21 +23,23 @@ class ClusterUtils(object):

    @staticmethod
    def generate_create_templ(sshKey, cluster_name, runtime, idle_timeout,
-            num_workers, driver_node_type, worker_node_type,
+            num_workers, driver_node_type, worker_node_type, cluster_type,


Add 'cluster_type' parameter to make create.py common for aws and azure Databricks

don't we need whatever equivalent for azure or those just aren't specified?

also perhaps a better name for this would be like cloud_provider.

don't we need whatever equivalent for azure or those just aren't specified?

I thinks we need not specify whatever equivalent for azure, as the default(unspecified) value of cluster_type is aws.

Also cluster_type is for aws configs of creating cluster as below, any other values means we do not need the below configs. So we only need to specify the default(unspecified) value cluster_type as 'aws' in the create.py.

if (cluster_type == 'aws'): templ['aws_attributes'] = { "zone_id": "us-west-2a", "first_on_demand": 1, "availability": "SPOT_WITH_FALLBACK", "spot_bid_price_percent": 100, "ebs_volume_count": 0 }

also perhaps a better name for this would be like cloud_provider.

Sounds good, let me change it.

NvTimLiu · 2021-03-15T09:44:35Z

jenkins/databricks/create.py

@@ -33,20 +33,21 @@ def main():
  num_workers = 1
  worker_type = 'g4dn.xlarge'
  driver_type = 'g4dn.xlarge'
+  cluster_type = 'aws'


Add 'cluster_type' parameter to make create.py common for aws and azure Databricks

…re Databricks Signed-off-by: Tim Liu <timl@nvidia.com>

NvTimLiu · 2021-03-15T11:43:48Z

build

jenkins/databricks/run-build.py

tgravescs · 2021-03-15T13:37:23Z

changes overall look fine, a few minor nits. One thing I want to mention is these scripts are also used by dev to build and test on Databricks manually. So any changes I would like to keep that easy to do. It looks like with these changes that is still fine, just splits it into build and test separately.

change the var name from cluster_type to cloud_provider. Signed-off-by: Tim Liu <timl@nvidia.com>

NvTimLiu · 2021-03-15T16:32:32Z

changes overall look fine, a few minor nits. One thing I want to mention is these scripts are also used by dev to build and test on Databricks manually. So any changes I would like to keep that easy to do. It looks like with these changes that is still fine, just splits it into build and test separately.

Got it. I'll try to change the scripts as little as possible.

NvTimLiu · 2021-03-16T01:55:50Z

build

…VIDIA#1933) * Split DB scripts to make them common for the build and IT pipeline Signed-off-by: Tim Liu <timl@nvidia.com> * update getopt for jar_path * Add 'cluster_type' parameter to make create.py common for aws and azure Databricks Signed-off-by: Tim Liu <timl@nvidia.com> * Change to use a more readable var name change the var name from cluster_type to cloud_provider. Signed-off-by: Tim Liu <timl@nvidia.com>

Split DB scripts to make them common for the build and IT pipeline

8d829de

Signed-off-by: Tim Liu <timl@nvidia.com>

NvTimLiu requested review from GaryShen2008, jlowe and revans2 as code owners March 15, 2021 01:47

NvTimLiu self-assigned this Mar 15, 2021

NvTimLiu requested a review from tgravescs as a code owner March 15, 2021 01:47

NvTimLiu commented Mar 15, 2021

View reviewed changes

update getopt for jar_path

19f46fe

NvTimLiu force-pushed the merge-dbscripts branch from dac26b8 to 85a5476 Compare March 15, 2021 09:39

NvTimLiu commented Mar 15, 2021

View reviewed changes

NvTimLiu force-pushed the merge-dbscripts branch from 85a5476 to bbbebfd Compare March 15, 2021 09:53

NvTimLiu marked this pull request as draft March 15, 2021 09:56

Add 'cluster_type' parameter to make create.py common for aws and azu…

e1b88d4

…re Databricks Signed-off-by: Tim Liu <timl@nvidia.com>

NvTimLiu force-pushed the merge-dbscripts branch from bbbebfd to e1b88d4 Compare March 15, 2021 10:14

NvTimLiu marked this pull request as ready for review March 15, 2021 11:43

tgravescs reviewed Mar 15, 2021

View reviewed changes

jenkins/databricks/run-build.py Show resolved Hide resolved

Change to use a more readable var name

e86643b

change the var name from cluster_type to cloud_provider. Signed-off-by: Tim Liu <timl@nvidia.com>

sameerz added the build Related to CI / CD or cleanly building label Mar 16, 2021

sameerz added this to the Mar 15 - March 26 milestone Mar 16, 2021

tgravescs approved these changes Mar 16, 2021

View reviewed changes

NvTimLiu merged commit 97ccad7 into NVIDIA:branch-0.5 Mar 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split DB scripts to make them common for the build and IT pipeline #1933

Split DB scripts to make them common for the build and IT pipeline #1933

NvTimLiu commented Mar 15, 2021

NvTimLiu commented Mar 15, 2021

NvTimLiu Mar 15, 2021

NvTimLiu Mar 15, 2021

NvTimLiu Mar 15, 2021 •

edited

Loading

NvTimLiu Mar 15, 2021

NvTimLiu Mar 15, 2021

NvTimLiu Mar 15, 2021

tgravescs Mar 15, 2021

tgravescs Mar 15, 2021

NvTimLiu Mar 15, 2021

NvTimLiu Mar 15, 2021

NvTimLiu Mar 15, 2021

NvTimLiu commented Mar 15, 2021

tgravescs commented Mar 15, 2021

NvTimLiu commented Mar 15, 2021 •

edited

Loading

NvTimLiu commented Mar 16, 2021

		@@ -103,36 +103,5 @@ mvn -B install:install-file \

		mvn -B -P${BUILD_PROFILES} clean package -DskipTests

		# Copy so we pick up new built jar and latest CuDF jar. Note that the jar names have to be



		def main():
		workspace = 'https://dbc-9ff9942e-a9c4.cloud.databricks.com'


		set -e

		LOCAL_JAR_PATH=$1

Split DB scripts to make them common for the build and IT pipeline #1933

Split DB scripts to make them common for the build and IT pipeline #1933

Conversation

NvTimLiu commented Mar 15, 2021

NvTimLiu commented Mar 15, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NvTimLiu Mar 15, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NvTimLiu commented Mar 15, 2021

tgravescs commented Mar 15, 2021

NvTimLiu commented Mar 15, 2021 • edited Loading

NvTimLiu commented Mar 16, 2021

NvTimLiu Mar 15, 2021 •

edited

Loading

NvTimLiu commented Mar 15, 2021 •

edited

Loading