GPU build broken with CUDA SDK 12.0 #13932

tufei · 2022-12-10T17:57:32Z

Describe the issue

It seems the ORT has a hard dependency on CUDA SDK 11.x?

[dnn_onnxruntime @ 0x3f7ccc0] SessionOptionsAppendExecutionProvider_CUDA(): /onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1069 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_cuda.so with error: libcublasLt.so.11: cannot open shared object file: No such file or directory

To reproduce

On Fedora 36 update to latest CUDA SDK 12.0 then try some examples.

Urgency

No response

Platform

Linux

OS Version

Fedora 36

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.13.1

ONNX Runtime API

C

Architecture

X64

Execution Provider

CUDA

Execution Provider Library Version

CUDA 12.0

snnn · 2022-12-12T19:30:37Z

If your system doesn't have libcublasLt.so.11, then the answer is : "yes".

tufei · 2022-12-13T16:46:27Z

Thanks, I meant to say, is it possible that in your future releases, not link to libcublasLt.so.11, but only libcublasLt.so, and then check versions using proper API calls?

Regards,

snnn · 2022-12-13T19:48:54Z

@tufei, would you mind showing me an example? We didn't explicitly put the name of "libcublasLt.so.11" in the link command. We put "-lcublasLt" there and linker resolved it to "libcublasLt.so.11". Most Linux shared libs work in such a way. I don't know how to change it.

tufei · 2022-12-14T17:53:40Z

https://stackoverflow.com/questions/12637841/what-is-the-soname-option-for-building-shared-libraries-for

smartnet-club · 2022-12-17T02:39:22Z

terminate called after throwing an instance of 'Ort::Exception'
what(): /onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1069 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_cuda.so with error: libcublasLt.so.11: cannot open shared object file: No such file or directory

ldd libonnxruntime_providers_cuda.so
libcublasLt.so.11 => not found
libcublas.so.11 => not found

borisfom · 2023-01-10T02:11:27Z

@tufei : You should be able to install CUDA 11 libraries alongside with your CUDA 12 to work around.
@snnn : it would be nice to have a separate onnxruntime-gpu wheel built with CUDA 12 available. Is that in your nearest plans ?

dzhao · 2023-02-11T00:45:29Z

coask when onnx can support cuda12? or even support building CUDA 12?

zeruniverse · 2023-02-11T06:08:57Z

I think #14659 needs to be merged into latest release for CUDA 12 build fix

chriskyndrid · 2023-04-05T18:35:22Z

+1

For the time being, I resolved this as follows:

Add the fc35 cuda repo:

sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/fedora35/x86_64/cuda-fedora35.repo
sudo dnf clean all

If you're using the RPM fusion repositories for your display drivers:

sudo dnf module disable nvidia-driver

Install CUDA 11.8

sudo dnf install cuda-11-8

cuDNN
You may/will need to make sure you have the appropriate libcudnn8 installed for the version of cuda you are using:

sudo dnf install https://developer.download.nvidia.com/compute/machine-learning/repos/rhel8/x86_64/nvidia-machine-learning-repo-rhel8-1.0.0-1.x86_64.rpm

and e.g.

sudo dnf install libcudnn8 libcudnn8-devel libnccl libnccl-devel

You can browse the packages from the rhel8 repo here

After the above I can successfully run inference. I'm using the ort crate for rust with models I converted from pytorch(mostly) to ONNX.

snnn · 2023-04-05T21:46:47Z

@snnn : it would be nice to have a separate onnxruntime-gpu wheel built with CUDA 12 available. Is that in your nearest plans ?

Right now each our package only works with a specific CUDA minor version. For example, the last one only works with CUDA 11.6 and the next one will only work with CUDA 11.8. At some point it will become CUDA 12 point something. If you have more questions about the project's future plan, you can ask @pranavsharma .

AkshayUpadhye · 2023-08-20T04:54:24Z

I also had a similar issue, building onnxruntime from source helped !!

snnn · 2023-08-20T21:03:25Z

The last code should works fine on Windows with CUDA 12.2. I am adding a build pipeline for it. #17231

wongwenxin · 2023-08-24T07:07:04Z

I had a similar problem when using Clion to compile the onnxruntime-cpp example in yolov8！！

[DCSP_ONNX]:/onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1131 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_cuda.so with error: libcublasLt.so.11: cannot open shared object file: No such file or directory

But I can find libcublasLt.so.11 under /usr/local/cuda11.8/lib64

Please help！！！

snnn · 2023-09-07T01:24:26Z

@wongwenxin See https://man7.org/linux/man-pages/man8/ld.so.8.html about how Linux finds dynamic libraries. You might need to run ldconfig to add the directory to the operating system's database, or setup LD_LIBRARY_PATH env.

I will close this issue now because it is as designed. All our prebuilt packages were built with CUDA 11.x. They are not compatible with CUDA 12.x. However, you can build ONNX Runtime from source with CUDA 12.x if you need to use that version of CUDA.

Feel free to open a new issue if you hit any build error with that.

jrabek · 2023-12-12T00:11:32Z

Has anyone built the onnx runtime with cuda 12.x successfully? What would be the best instructions to use to do so?

snnn · 2023-12-12T00:44:20Z

The latest code should work fine with CUDA 12.2. And we have a nightly package for it. https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly/PyPI/ort-nightly-gpu

jrabek · 2023-12-12T13:07:12Z

Thank you for the comment and link @snnn! Much appreciated 🙏

FrancescoSaverioZuppichini · 2023-12-24T15:25:59Z

The latest code should work fine with CUDA 12.2. And we have a nightly package for it. https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly/PyPI/ort-nightly-gpu

having this error

thank you so much

FrancescoSaverioZuppichini · 2023-12-24T17:27:40Z

Tried with


pip install ort-nightly-gpu

working!

vladoossss · 2023-12-25T09:34:25Z

The latest code should work fine with CUDA 12.2. And we have a nightly package for it. https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly/PyPI/ort-nightly-gpu

Tried this:

pip install ort-nightly-gpu==1.17.0.dev20231205004 --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ort-cuda-12-nightly/pypi/simple/

But saw this error:

ERROR: Could not find a version that satisfies the requirement ort-nightly-gpu==1.17.0.dev20231205004 (from versions: none)
ERROR: No matching distribution found for ort-nightly-gpu==1.17.0.dev20231205004

FrancescoSaverioZuppichini · 2023-12-25T14:38:28Z

The latest code should work fine with CUDA 12.2. And we have a nightly package for it. https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly/PyPI/ort-nightly-gpu

Tried this:
pip install ort-nightly-gpu==1.17.0.dev20231205004 --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ort-cuda-12-nightly/pypi/simple/
But saw this error:

ERROR: Could not find a version that satisfies the requirement ort-nightly-gpu==1.17.0.dev20231205004 (from versions: none) ERROR: No matching distribution found for ort-nightly-gpu==1.17.0.dev20231205004

look at my message above, try with

pip install ort-nightly-gpu

vladoossss · 2023-12-25T14:53:50Z

The latest code should work fine with CUDA 12.2. And we have a nightly package for it. https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly/PyPI/ort-nightly-gpu

Tried this:
pip install ort-nightly-gpu==1.17.0.dev20231205004 --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ort-cuda-12-nightly/pypi/simple/
But saw this error:
ERROR: Could not find a version that satisfies the requirement ort-nightly-gpu==1.17.0.dev20231205004 (from versions: none) ERROR: No matching distribution found for ort-nightly-gpu==1.17.0.dev20231205004
look at my message above, try with
pip install ort-nightly-gpu

With this command you installed ort-nightly-gpu==1.15.dev
But this version will not work with CUDA 12.

FrancescoSaverioZuppichini · 2023-12-26T10:37:05Z

The latest code should work fine with CUDA 12.2. And we have a nightly package for it. https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly/PyPI/ort-nightly-gpu

Tried this:
pip install ort-nightly-gpu==1.17.0.dev20231205004 --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ort-cuda-12-nightly/pypi/simple/
But saw this error:
ERROR: Could not find a version that satisfies the requirement ort-nightly-gpu==1.17.0.dev20231205004 (from versions: none) ERROR: No matching distribution found for ort-nightly-gpu==1.17.0.dev20231205004
look at my message above, try with
pip install ort-nightly-gpu
With this command you installed ort-nightly-gpu==1.15.dev But this version will not work with CUDA 12.

I have CUDA 12 and it works 🤔

RoM4iK · 2023-12-27T12:13:09Z

For people who will come after me, you can download whl here
https://dev.azure.com/onnxruntime/onnxruntime/_artifacts/feed/onnxruntime-cuda-12/PyPI/onnxruntime-gpu/overview/1.17.0

arcayi · 2024-01-17T08:31:53Z

The latest code should work fine with CUDA 12.2. And we have a nightly package for it. https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly/PyPI/ort-nightly-gpu

Tried this:
pip install ort-nightly-gpu==1.17.0.dev20231205004 --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ort-cuda-12-nightly/pypi/simple/
But saw this error:
ERROR: Could not find a version that satisfies the requirement ort-nightly-gpu==1.17.0.dev20231205004 (from versions: none) ERROR: No matching distribution found for ort-nightly-gpu==1.17.0.dev20231205004
look at my message above, try with
pip install ort-nightly-gpu
With this command you installed ort-nightly-gpu==1.15.dev But this version will not work with CUDA 12.
I have CUDA 12 and it works 🤔

this works. thanks

ZhangHangjianMA · 2024-04-23T07:45:13Z

The latest code should work fine with CUDA 12.2. And we have a nightly package for it. https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly/PyPI/ort-nightly-gpu

Tried this:
pip install ort-nightly-gpu==1.17.0.dev20231205004 --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ort-cuda-12-nightly/pypi/simple/
But saw this error:
ERROR: Could not find a version that satisfies the requirement ort-nightly-gpu==1.17.0.dev20231205004 (from versions: none) ERROR: No matching distribution found for ort-nightly-gpu==1.17.0.dev20231205004
look at my message above, try with
pip install ort-nightly-gpu
With this command you installed ort-nightly-gpu==1.15.dev But this version will not work with CUDA 12.
I have CUDA 12 and it works 🤔
this works. thanks

I follow the solution: Fannovel16/comfyui_controlnet_aux#75 (comment),

pip install ort-nightly-gpu --index-url=https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ort-cuda-12-nightly/pypi/simple/
pip install onnxruntime-gpu==1.17.0 --index-url=https://pkgs.dev.azure.com/onnxruntime/onnxruntime/_packaging/onnxruntime-cuda-12/pypi/simple/

It works for me, but with a warning:
onnx2trt_utils.cpp:374: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.

xenova · 2024-05-24T12:44:35Z

Has anyone got this working with onnxruntime-gpu==1.18.0? (https://pypi.org/project/onnxruntime-gpu/1.18.0/)

meikuam · 2024-06-18T08:16:41Z

Has anyone got this working with onnxruntime-gpu==1.18.0? (https://pypi.org/project/onnxruntime-gpu/1.18.0/)

I m facing same issue too. Seems like its not fixed at newer version of package.

geraldstanje · 2024-06-20T17:10:48Z

i see the same error:

root@fdd2e200ddd1:/workspace# find / -name "libcublasLt.so.11"
root@fdd2e200ddd1:/workspace# find / -name "libcublasLt.so"
/usr/local/cuda-12.3/targets/x86_64-linux/lib/stubs/libcublasLt.so
/usr/local/cuda-12.3/targets/x86_64-linux/lib/libcublasLt.so

find / -name "libonnxruntime_providers_cuda.so"
/usr/local/lib/python3.10/dist-packages/onnxruntime/capi/libonnxruntime_providers_cuda.so
/opt/tritonserver/backends/onnxruntime/libonnxruntime_providers_cuda.so

2024-06-20 16:01:03.906843992 [E:onnxruntime:Default, provider_bridge_ort.cc:1744 TryGetProviderInfo_CUDA] 
/onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1426 onnxruntime::Provider& 
onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library 
libonnxruntime_providers_cuda.so with error: libcublasLt.so.11: cannot open shared object file: No such file or directory

more infos:
onnx/sklearn-onnx#1111

cc @pranavsharma - can we open a new issue - or is there a solution to this?

snnn · 2024-06-20T17:14:39Z

Did you get the package from https://pkgs.dev.azure.com/onnxruntime/onnxruntime/_packaging/onnxruntime-cuda-12/pypi/simple/ ?

geraldstanje · 2024-06-20T17:28:26Z

@snnn thanks for quick reply! no from: https://pypi.org/project/onnxruntime-gpu/
i could try the following - than it should be fixed?

pip install onnxruntime-gpu==1.18.0 --index-url=https://pkgs.dev.azure.com/onnxruntime/onnxruntime/_packaging/onnxruntime-cuda-12/pypi/simple/

also what is ort-nightly-gpu?

@snnn there is no 1.18.0 version?

snnn · 2024-06-20T18:03:26Z

Sorry I gave you the wrong URL. The URL should be

https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/

geraldstanje · 2024-06-20T18:15:56Z

Sorry I gave you the wrong URL. The URL should be
https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/

whats the diff between the 2 different URLs? does new URL have onnxruntime-gpu==1.18.0?

also what is ort-nightly-gpu?

meikuam · 2024-06-26T05:15:12Z

Sorry I gave you the wrong URL. The URL should be
https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/

Thanks that helped to fix this problem for now

snnn · 2024-06-29T14:29:36Z

https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/onnxruntime-cuda-12 is the place where we host our CUDA 12 python, nuget and maven packages. You can click the "Connect to feed" button to see instructions.

https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ort-cuda-12-nightly is similar to above, but it only hosts nightly packages, and currently it doesn't have nightly packages for python. (We are working on it).

github-actions bot added the ep:CUDA issues related to the CUDA execution provider label Dec 10, 2022

mkjooriah mentioned this issue Mar 31, 2023

Openpilot not showing any view from Carla commaai/openpilot#27763

Closed

snnn closed this as completed Sep 7, 2023

davyzhang mentioned this issue Jan 8, 2024

Onnxruntime not found or doesn't come with acceleration providers Fannovel16/comfyui_controlnet_aux#75

Closed

GPU build broken with CUDA SDK 12.0 #13932

GPU build broken with CUDA SDK 12.0 #13932

Comments

tufei commented Dec 10, 2022

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

snnn commented Dec 12, 2022

tufei commented Dec 13, 2022

snnn commented Dec 13, 2022

tufei commented Dec 14, 2022

smartnet-club commented Dec 17, 2022

borisfom commented Jan 10, 2023

dzhao commented Feb 11, 2023

zeruniverse commented Feb 11, 2023

chriskyndrid commented Apr 5, 2023

snnn commented Apr 5, 2023

AkshayUpadhye commented Aug 20, 2023

snnn commented Aug 20, 2023

wongwenxin commented Aug 24, 2023

snnn commented Sep 7, 2023

jrabek commented Dec 12, 2023

snnn commented Dec 12, 2023

jrabek commented Dec 12, 2023

FrancescoSaverioZuppichini commented Dec 24, 2023

FrancescoSaverioZuppichini commented Dec 24, 2023

vladoossss commented Dec 25, 2023 • edited Loading

FrancescoSaverioZuppichini commented Dec 25, 2023

vladoossss commented Dec 25, 2023

FrancescoSaverioZuppichini commented Dec 26, 2023

RoM4iK commented Dec 27, 2023

arcayi commented Jan 17, 2024

ZhangHangjianMA commented Apr 23, 2024 • edited Loading

xenova commented May 24, 2024

meikuam commented Jun 18, 2024

geraldstanje commented Jun 20, 2024 • edited Loading

snnn commented Jun 20, 2024

geraldstanje commented Jun 20, 2024 • edited Loading

snnn commented Jun 20, 2024

geraldstanje commented Jun 20, 2024

meikuam commented Jun 26, 2024

snnn commented Jun 29, 2024

vladoossss commented Dec 25, 2023 •

edited

Loading

ZhangHangjianMA commented Apr 23, 2024 •

edited

Loading

geraldstanje commented Jun 20, 2024 •

edited

Loading

geraldstanje commented Jun 20, 2024 •

edited

Loading