add some modification to support torchbench-cpu well #795

Guo-Peilin · 2022-11-25T06:33:05Z

No description provided.

wyzero · 2022-11-25T06:40:45Z

pytorch_blade/benchmark/TorchBench/test_torch_bench.sh

+        export OMP_NUM_THREADS=1 GOMP_CPU_AFFINITY=4
+    else
+        # eg: bash test_torch_bench.sh cpu tiny 32 "0-63:2"
+        export OMP_NUM_THREADS=$3 GOMP_CPU_AFFINITY=$4


not enough if only limiting OMP threads since the framework (e.g. torch/TF) may have its own thread pool as well. maybe be use taskset as instead?

wyzero · 2022-11-25T06:42:18Z

.github/workflows/torchbench.yml

+    secrets: inherit
+  TorchBenchCpuFull8Threads:


better to encode CPU type into the name as well.

right now cpu means x86 only, aarch64 has itw own name

wyzero · 2022-11-25T06:48:38Z

pytorch_blade/benchmark/TorchBench/test_torch_bench.sh

    results=(eval-cpu-fp32)
 else
    results=(eval-cuda-fp32 eval-cuda-fp16)
 fi
+python3 torchbenchmark/onnxrt_helper.py


note that latest ORT may use its own thread pool instead of ORT, we may need extra thread related setting for it. This can be leaved for future improvement.

wyzero · 2022-11-28T02:47:01Z

pytorch_blade/benchmark/TorchBench/test_torch_bench.sh


 config_file=blade_$1_$2.yaml
 bench_target=$2
+binding_cores=1


Using 0 as default is better in case there is only 1 core used for testing.

wyzero · 2022-11-28T02:47:28Z

pytorch_blade/benchmark/TorchBench/test_torch_bench.sh

-    export GOMP_CPU_AFFINITY="2-5"
+    if [ ! -n "$3" ]
+    then
+        export OMP_NUM_THREADS=1 GOMP_CPU_AFFINITY=4


Why using different default value for GOMP_CPU_AFFINITY and taskset?

the origin code using in this way. Now I using the same value

qiuxiafei · 2022-11-28T03:51:30Z

pytorch_blade/benchmark/TorchBench/test_torch_bench.sh


 if [ $1 == "cpu" ]
 then
-    # 4 cores
-    export GOMP_CPU_AFFINITY="2-5"
+    if [ ! -n "$3" ]


nit: ! -n is identical to -z

qiuxiafei · 2022-11-28T03:52:42Z

pytorch_blade/benchmark/TorchBench/test_torch_bench.sh

    results=(eval-cpu-fp32)
+    taskset -c $bingding_cores python3 torchbenchmark/.github/scripts/run-config.py \


misspelled bingding_cores ...

zzpmiracle · 2022-11-28T08:06:17Z

Manually trigger this workflow in https://github.com/alibaba/BladeDISC/actions/workflows/torchbench.yml ?

zzpmiracle · 2022-11-30T12:08:23Z

.github/workflows/torchbench.yml

+      base_image: bladedisc/bladedisc:latest-runtime-torch1.12.0-cpu
+      device: cpu_benchmark
+      dockerfile: docker/cronjobs/Dockerfile.torch.bench
+      extra_envs: -e RELATED_DIFF_PERCENT=3


diff has changed to 5%

zzpmiracle · 2022-11-30T12:12:26Z

.github/workflows/torchbench.yml

+      dockerfile: docker/cronjobs/Dockerfile.torch.bench
+      extra_envs: -e RELATED_DIFF_PERCENT=3
+      exec_command: bash ./pytorch_blade/benchmark/TorchBench/test_torch_bench.sh cpu full 8 "0-7"


Maybe we can loop for different configs in one job, not create jobs for each config.

zzpmiracle · 2022-12-05T11:57:43Z

.github/workflows/torchbench.yml

+      name: torch-offcial-benchmark
+      base_image: bladedisc/bladedisc:latest-runtime-torch1.12.0-cpu-aarch64
+      device: cpu_benchmark


Since not use extra machine for benchmark, we can directly use aarch64 / cpu for aarch64 / cpu benchmark now.

pytorch_blade/benchmark/TorchBench/test_torch_bench.sh

Guo-Peilin · 2022-12-08T08:52:04Z

.github/workflows/torchbench.yml

zzpmiracle · 2022-12-09T07:00:12Z

pytorch_blade/benchmark/TorchBench/test_torch_bench.sh

@@ -21,6 +24,9 @@ fi
 # for CI git-lfs permission problems
 pushd $benchmark_repo_dir
 # cache venv in benchmark dir
+if [ $1 == "aarch64" ]; then
+    rm -rf ./venv && cp -r /opt/venv_disc ./venv


maybe not remove this folder to cache venv

zzpmiracle

LGTM!

wyzero reviewed Nov 25, 2022

View reviewed changes

Guo-Peilin force-pushed the support-torchbench-on-cpu branch from 302c920 to 51607d1 Compare November 25, 2022 08:21

wyzero reviewed Nov 28, 2022

View reviewed changes

Guo-Peilin force-pushed the support-torchbench-on-cpu branch from 51607d1 to 107ca4a Compare November 28, 2022 03:10

wyzero previously approved these changes Nov 28, 2022

View reviewed changes

Guo-Peilin dismissed wyzero’s stale review via 8033c02 November 28, 2022 03:31

Guo-Peilin force-pushed the support-torchbench-on-cpu branch from 107ca4a to 8033c02 Compare November 28, 2022 03:31

qiuxiafei reviewed Nov 28, 2022

View reviewed changes

qiuxiafei requested a review from zzpmiracle November 28, 2022 03:54

Guo-Peilin force-pushed the support-torchbench-on-cpu branch from 8033c02 to 22db73e Compare November 28, 2022 06:24

Guo-Peilin force-pushed the support-torchbench-on-cpu branch 6 times, most recently from 50d7966 to d82a075 Compare November 30, 2022 11:57

zzpmiracle reviewed Nov 30, 2022

View reviewed changes

Guo-Peilin force-pushed the support-torchbench-on-cpu branch 12 times, most recently from 22aa842 to b85c8d7 Compare December 5, 2022 11:57

zzpmiracle reviewed Dec 5, 2022

View reviewed changes

Guo-Peilin force-pushed the support-torchbench-on-cpu branch 10 times, most recently from 0993257 to 829f0ba Compare December 8, 2022 06:47

zzpmiracle reviewed Dec 9, 2022

View reviewed changes

.github/workflows/torchbench.yml Show resolved Hide resolved

zzpmiracle reviewed Dec 9, 2022

View reviewed changes

add some modification to support torchbench-cpu well

c085e5f

Guo-Peilin force-pushed the support-torchbench-on-cpu branch from 829f0ba to c085e5f Compare December 9, 2022 07:34

zzpmiracle approved these changes Dec 12, 2022

View reviewed changes

Guo-Peilin merged commit 512a8dc into main Dec 12, 2022

Guo-Peilin deleted the support-torchbench-on-cpu branch December 12, 2022 07:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add some modification to support torchbench-cpu well #795

add some modification to support torchbench-cpu well #795

Guo-Peilin commented Nov 25, 2022

wyzero Nov 25, 2022

Guo-Peilin Nov 28, 2022

wyzero Nov 25, 2022

Guo-Peilin Nov 28, 2022

wyzero Nov 25, 2022

wyzero Nov 28, 2022

Guo-Peilin Nov 28, 2022

wyzero Nov 28, 2022

Guo-Peilin Nov 28, 2022

qiuxiafei Nov 28, 2022

qiuxiafei Nov 28, 2022

zzpmiracle commented Nov 28, 2022

zzpmiracle Nov 30, 2022

Guo-Peilin Dec 1, 2022

zzpmiracle Nov 30, 2022

zzpmiracle Dec 5, 2022

Guo-Peilin commented Dec 8, 2022 •

edited

Loading

zzpmiracle Dec 9, 2022

zzpmiracle left a comment

		results=(eval-cpu-fp32)
		taskset -c $bingding_cores python3 torchbenchmark/.github/scripts/run-config.py \

add some modification to support torchbench-cpu well #795

add some modification to support torchbench-cpu well #795

Conversation

Guo-Peilin commented Nov 25, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zzpmiracle commented Nov 28, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Guo-Peilin commented Dec 8, 2022 • edited Loading

Choose a reason for hiding this comment

zzpmiracle left a comment

Choose a reason for hiding this comment

Guo-Peilin commented Dec 8, 2022 •

edited

Loading