fixed the GPU indexing error when running in colab #1229

mhwahdan · 2022-12-09T11:11:36Z

python train.py --workers 8 --device 0 --batch-size 16 --data data.yaml --img 640 640 --cfg cfg/training/yolov7.yaml --weights yolov7x.pt --name yolov7 --hyp data/hyp.scratch.p5.yaml

I got this error

RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (cpu)

I modified the loss.py file to automatically get the index of the default GPU selected using torch.device('cuda') function

fixes #1224 #1045 #1101 #1225

when i used the command python train.py --workers 8 --device 0 --batch-size 16 --data data.yaml --img 640 640 --cfg cfg/training/yolov7.yaml --weights yolov7x.pt --name yolov7 --hyp data/hyp.scratch.p5.yaml I got this error RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (cpu) I modified the loss.py file to automatically get the index of the default GPU selected using torch.device('cuda') function fixes WongKinYiu#1225 WongKinYiu#1224 WongKinYiu#1101 WongKinYiu#1045

fixed the GPU indexing error when running in colab

mateusz-lichota · 2022-12-09T21:17:15Z

+1, this change really needs to get merged in order for gpu training to be a painless process

marekjg · 2022-12-11T15:52:20Z

utils/loss.py

@@ -682,8 +682,7 @@ def build_targets(self, p, targets, imgs):
                all_gj.append(gj)
                all_gi.append(gi)
                all_anch.append(anch[i][idx])
-                from_which_layer.append(torch.ones(size=(len(b),)) * i)
-
+                from_which_layer.append((torch.ones(size=(len(b),)) * i).to('cuda'))


I think it would be better to initialize torch.ones to targets (or some other nearby tensor) to make cpu case work as well

You are right

I have modified the code to detect the targets types and set the torch.ones according to them
That makes the CPU and GPU cases work

Solved my problem!

Great. I wonder how it was working before without a need for such modification :)

Let's hope it gets merged

@WongKinYiu
Your review would be much appreciated :)

WongKinYiu#1229

SkalskiP · 2022-12-27T16:54:37Z

@WongKinYiu / @AlexeyAB, are there any plans to merge that change? The training script does not work with the latest PyTorch, which makes your installation instructions not work.

magedhelmy1 · 2023-01-11T19:09:15Z

Fixes train_aux bug with masks

mhwahdan added 2 commits December 9, 2022 13:07

Merge pull request #1 from RobEn-AAST/error-when-running-in-colab-patch

9fd805f

fixed the GPU indexing error when running in colab

marekjg reviewed Dec 11, 2022

View reviewed changes

autochange the tensor types according to the target devices

f8e9fb7

mhwahdan requested a review from marekjg December 15, 2022 10:04

TimoLob added a commit to TimoLob/yolov7 that referenced this pull request Dec 27, 2022

Based on "fixed the GPU indexing error when running in colab"

d3645dd

WongKinYiu#1229

Merge branch 'main' into main

c03a67f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed the GPU indexing error when running in colab #1229

fixed the GPU indexing error when running in colab #1229

mhwahdan commented Dec 9, 2022

mateusz-lichota commented Dec 9, 2022

marekjg Dec 11, 2022

mhwahdan Dec 13, 2022

asanc199 Dec 13, 2022

mhwahdan Dec 14, 2022

Walid-Ahmed Dec 16, 2022

mhwahdan Dec 16, 2022

mhwahdan Dec 16, 2022

SkalskiP commented Dec 27, 2022

magedhelmy1 commented Jan 11, 2023

fixed the GPU indexing error when running in colab #1229

Are you sure you want to change the base?

fixed the GPU indexing error when running in colab #1229

Conversation

mhwahdan commented Dec 9, 2022

mateusz-lichota commented Dec 9, 2022

marekjg Dec 11, 2022

Choose a reason for hiding this comment

mhwahdan Dec 13, 2022

Choose a reason for hiding this comment

asanc199 Dec 13, 2022

Choose a reason for hiding this comment

mhwahdan Dec 14, 2022

Choose a reason for hiding this comment

Walid-Ahmed Dec 16, 2022

Choose a reason for hiding this comment

mhwahdan Dec 16, 2022

Choose a reason for hiding this comment

mhwahdan Dec 16, 2022

Choose a reason for hiding this comment

SkalskiP commented Dec 27, 2022

magedhelmy1 commented Jan 11, 2023