Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HUB- local agent: CUDA not detected #45

Closed
1 task done
barney2074 opened this issue Jun 7, 2022 · 7 comments
Closed
1 task done

HUB- local agent: CUDA not detected #45

barney2074 opened this issue Jun 7, 2022 · 7 comments
Assignees
Labels
question A HUB question that does not involve a bug

Comments

@barney2074
Copy link

Search before asking

Question

Hello,

When using the HUB local agent (on Windows)- how can I get it to utilise the graphics card ?
I get an error 'CUDA not detected' message
image

  • I have tried COLAB (I have a Pro subscription)- but as is my usual experience with COLAB- it gets 90% finished, then disconnects, and won't let me back on with a GPU- very annoying....!
  • I do also have a Linux/RTX3060 machine which I can try, when it has finished on another job

thank you

Andrew

Additional

No response

@barney2074 barney2074 added the question A HUB question that does not involve a bug label Jun 7, 2022
@glenn-jocher
Copy link
Member

@barney2074 your pytorch installation is not recognizing any CUDA devices. See https://pytorch.org/get-started/locally/ to get started.

You can test correct Pytorch-GPU installation with import torch; print(torch.cuda.is_available())

@barney2074
Copy link
Author

thanks @glenn-jocher

I played around with it for ages- different graphics driver & CUDA versions- but couldn't get Pytorch to recognise it

In the end, I gave up on the Windows machine, and used the Linux one instead
The model training and deployment to an IOS device worked perfectly

thanks,

Andrew

@glenn-jocher
Copy link
Member

@barney2074 @kalenmike thanks for the info! From your screenshots it does seem like it's a PyTorch install issue.

In our investigations we also identified a new failure mode with AutoBatch on consumer GPUs, but this appears unrelated to your original issue. In any case we should be pushing an AutoBatch update soon which will resolve that.

@glenn-jocher
Copy link
Member

@barney2074 closing as environment issue. Please let us know if you run into any other issues or think of features you'd like to see!

@barney2074
Copy link
Author

thanks @glenn-jocher

I was wondering if Docker would be an easier option for the local HUB agent ?- but suppose not everyone has Docker plus you need NVIDIA Docker as well

@kalenmike
Copy link
Contributor

@barney2074 That's a good suggestion and something I think we will look into implementing. We want to make using local agents as easy as possible.

@glenn-jocher
Copy link
Member

@barney2074 yes this isn't a bad idea to offer this as an additional local option. Then rather than need to debug local python environments the main requirement would be to have Docker installed correctly as well as all NVIDIA/CUDA drivers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question A HUB question that does not involve a bug
Projects
None yet
Development

No branches or pull requests

3 participants