Simplified Inference #1045

glenn-jocher · 2020-09-25T22:46:09Z

This PR implements standalone, independent inference classes and methods for pytorch hub and a future pip package.

OpenCV Example:

import cv2
import torch

# Model
model = torch.hub.load('ultralytics/yolov5', 'yolov5s', pretrained=True).autoshape()

# Image
torch.hub.download_url_to_file('https://github.com/ultralytics/yolov5/blob/master/inference/images/zidane.jpg?raw=true', 'image.jpg')
img = cv2.imread('image.jpg')

# Inference
prediction = model(img[:, :, ::-1], shape=640)  # BGR to RGB

# Plot
for *box, conf, cls in prediction[0]:  # [xy1, xy2], confidence, class
    print('%s %.2f' % (model.names[int(cls)], conf))  # label
    cv2.rectangle(img, pt1=tuple(box[:2]), pt2=tuple(box[2:]), color=[255, 255, 255], thickness=2)  # plot
cv2.imwrite('results.jpg', img)  # save

Output:

100% 165k/165k [00:00<00:00, 5.11MB/s]

person 0.87
person 0.80
tie 0.78
tie 0.28
True

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Updated dependencies, improved training stability, added NMS control, and streamlined dataset acquisition methods.

📊 Key Changes

📦 Updated NVIDIA PyTorch Docker image from 20.08 to 20.09.
🎛 Changed giou loss term in hyperparameter YAML files to box.
🔄 Modified COCO dataset download script to pack multiple commands and improved readability.
🔧 Simplified VOC dataset download script to be more concise and organized.
✂️ Removed unnecessary import in detect.py and adjusted confidence/threshold defaults.
💡 Moved NMS (Non-Maximum Suppression) layer control from models to separate class for modularity.
🔁 Refined autoShape class to handle cv2/np/PIL/torch inputs uniformly.
📚 Updated requirements.txt by commenting out a specific coremltools version requirement.
🛠 Replaced giou (Generalized Intersection over Union) with box loss throughout the codebase.

🎯 Purpose & Impact

🏗 Ensures the underlying Docker container has the latest stable libraries for PyTorch.
📐 Adjusts hyperparameters to enhance model training stability and performance.
🚀 Streamlines dataset scripts for ease of use and maintainability.
🧠 Introduces modularity to the NMS process, allowing for better future extensions.
✅ Facilitates easier input handling for more flexible model predictions.
🗂 Encourages best practices in dependency management by avoiding hard version locks where not necessary.
🧹 General cleanup and standardization of loss terminology for clearer understanding across the codebase.

These updates prepare the code for future enhancements, promote better practices in managing dependencies, and make it simpler for users to acquire datasets. Such improvements may positively affect the usability, efficiency, and reproducibility of the models for developers and end-users alike.

glenn-jocher · 2020-09-25T23:37:58Z

PIL Example:

import numpy as np
import torch
from PIL import Image, ImageDraw

# Model
model = torch.hub.load('ultralytics/yolov5', 'yolov5s', pretrained=True).autoshape()

# Image
torch.hub.download_url_to_file('https://github.com/ultralytics/yolov5/blob/master/inference/images/zidane.jpg?raw=true', 'image.jpg')
img = Image.open('image.jpg')

# Inference
prediction = model(img, shape=640)

# Plot
draw = ImageDraw.Draw(img)
for *box, conf, cls in prediction[0]:  # [xy1, xy2], confidence, class
    print('%s %.2f' % (model.names[int(cls)], conf))  # label
    draw.rectangle(box, width=3)  # plot
img.save('results.jpg')  # save

Output:

100% 165k/165k [00:00<00:00, 5.16MB/s]

person 0.87
person 0.80
tie 0.78
tie 0.28

NanoCode012 · 2020-09-26T05:23:34Z

Hi @glenn-jocher , would this support multiple images inference?

For ex, img would be an array of images. Would prediction be an array of predictions?

Also, I noticed that you lowered iou and conf, could there be a reason?

glenn-jocher · 2020-09-27T01:47:51Z

@NanoCode012 I've just updated it to support batched inference now, I know it's a popular request. It autocomputes the minimum inference size per the shape argument. For zidane.jpg and bus.jpg in a batch for example, it will use 640x640 to accommodate both vertical and horizontal rectangular images at 640. If it was just bus.jpg it would be 640x480 vertical, and if it was just zidane.jpg it would run at 384x640 horizontal, so it's optimally shaped under all conditions.

This should super-simplify inference for most custom use cases I think.

# Images
img1 = Image.open('inference/images/zidane.jpg')
img2 = Image.open('inference/images/bus.jpg')

# Batched inference
prediction = model([img1, img2], shape=640)

# Plot
for i, img in enumerate([img1, img2]):
    for *box, conf, cls in prediction[i]:  # [xy1, xy2], confidence, class
        print('%s %.2f' % (model.names[int(cls)], conf))  # label
        ImageDraw.Draw(img).rectangle(box, width=3)  # plot
    img.save('results%g.jpg' % i)  # save

glenn-jocher · 2020-09-27T01:56:12Z

@NanoCode012 yes I lowered IoU and confidence thresholds also for this NMS module. After playing around with the sliders in iDetection I realized lower values seemed to produce visually more better results (qualitatively speaking), so I released the iDetection v7.7 update with 0.4 and 0.2 IoU and confidence threshold defaults (which are manually variable now with the sliders).

I also separately saw that the CoreML official NMS defaults for them are 0.45 IoU and 0.25 confidence, so I decided to adopt those values here. I should probably update the detect.py defaults as well in this PR to match.

NanoCode012 · 2020-09-27T16:52:28Z

Hello @glenn-jocher , thanks for clarification on the IOU and conf thresholds as well as the update on batch inference.

aniltolwani · 2020-09-29T08:04:22Z

Will these change affect standard video inference (detect.py) at all? It would be great to be able to achieve similar inference times to test.py using batched predictions with video (right now I'm getting .014s per frame).

glenn-jocher · 2020-10-04T17:43:11Z

/rebase

glenn-jocher · 2020-10-05T13:51:14Z

@aniltolwani yes, you can use this for batched video inference.

The model() here accepts a list of images, so you would construct the batch yourself, i.e. you would read perhaps 16 frames from a cv2 video capture object, place them in a list, and then pass the list for batched inference, repeat for the duration of the video.

/rebase

glenn-jocher · 2020-10-06T13:35:02Z

I think I'm going to expand this concept to be a bit more ambitious. If I can make this autoshape wrapper handle the current input format as well, which is just BCWH torch data, then this would allow the model to really accept nearly all commonly used input formats: cv2 image, numpy image, pil image, list of images (for batched inference), pytorch input (already shaped/letterboxed). I think this would cover the great majority of use cases.

glenn-jocher · 2020-10-10T12:09:14Z

/rebase

…ference # Conflicts: # models/yolo.py

glenn-jocher · 2020-10-10T14:12:10Z

PR is updated with torch input functionality now, so it can optionally behave identically to the current model. The autoShape wrapper comments now show all the input options as well, which I think cover the vast majority of pytorch inference use cases:

    def forward(self, x, shape=640, augment=False, profile=False):
        # supports inference from various sources. For height=720, width=1280, RGB images example inputs are:
        #   opencv:     x = cv2.imread('image.jpg')[:,:,::-1]  # HWC BGR to RGB x(720,1280,3)
        #   PIL:        x = Image.open('image.jpg')  # HWC x(720,1280,3)
        #   numpy:      x = np.zeros((720,1280,3))  # HWC
        #   torch:      x = torch.zeros(16,3,720,1280)  # BCHW
        #   multiple:   x = [Image.open('image1.jpg'), Image.open('image2.jpg'), ...]  # list of images

Test script:

import cv2
import numpy as np
from PIL import Image, ImageDraw

from models.experimental import attempt_load

# Model
# model = torch.hub.load('ultralytics/yolov5', 'yolov5s', pretrained=True)
model = attempt_load('yolov5s.pt')
model = model.autoshape()  # < ------------------ add autoshape() wrapper

# Image
img1 = Image.open('inference/images/zidane.jpg')  # PIL
img2 = cv2.imread('inference/images/zidane.jpg')[:, :, ::-1]  # opencv (BGR to RGB)
img3 = np.zeros((640, 1280, 3))  # numpy
imgs = [img1, img2, img3]

# Inference
prediction = model(imgs, size=640)  # includes NMS

# Plot
for i, img in enumerate(imgs):
    print('\nImage %g/%g: %s ' % (i + 1, len(imgs), img.shape), end='')
    img = Image.fromarray(img.astype(np.uint8)) if isinstance(img, np.ndarray) else img  # from np
    if prediction[i] is not None:  # is not None
        for *box, conf, cls in prediction[i]:  # [xy1, xy2], confidence, class
            print('%s %.2f, ' % (model.names[int(cls)], conf), end='')  # label
            ImageDraw.Draw(img).rectangle(box, width=3)  # plot
    img.save('results%g.jpg' % i)  # save

Test script output for batched infrence at img size 640:

Fusing layers... 
Adding autoShape... 

Image 1/3: (720, 1280, 3) person 0.87, person 0.80, tie 0.78, tie 0.28, 
Image 2/3: (720, 1280, 3) person 0.87, person 0.80, tie 0.78, tie 0.28, 
Image 3/3: (640, 1280, 3)

img size checks are warnings rather than errors, so current implementation allows improperly formed model inputs.

glenn-jocher · 2020-10-14T21:04:10Z

/rebase

* fix/hyper * Hyp giou check to train.py * restore general.py * train.py overwrite fix * restore general.py and pep8 update Co-authored-by: Glenn Jocher <glenn.jocher@ultralytics.com>

glenn-jocher · 2020-10-15T13:07:11Z

/rebase

img size checks are warnings rather than errors, so current implementation allows improperly formed model inputs.

* comment * fix parsing * fix evolve * folder * tqdm * Update train.py * Update train.py * reinstate anchors into meta dict anchor evolution is working correctly now * reinstate logger prefer the single line readout for concise logging, which helps simplify notebook and tutorials etc. Co-authored-by: Glenn Jocher <glenn.jocher@ultralytics.com>

* fix/hyper * Hyp giou check to train.py * restore general.py * train.py overwrite fix * restore general.py and pep8 update Co-authored-by: Glenn Jocher <glenn.jocher@ultralytics.com>

glenn-jocher · 2020-10-15T17:40:12Z

This PR is rebased, updates complete, CI passing, merging.

glenn-jocher · 2020-10-15T17:40:43Z

/rebase

…ference

glenn-jocher · 2020-10-15T18:04:42Z

Ok, somehow I managed to get this PR stuck in a rebase cycle. Closing and starting from scratch at #1153

initial commit

2be4a8c

glenn-jocher self-assigned this Sep 25, 2020

batch inference update

c9ef269

github-actions bot force-pushed the simple_inference branch from c9ef269 to e2bf1e1 Compare October 4, 2020 17:43

github-actions bot force-pushed the simple_inference branch from e2bf1e1 to 50974db Compare October 5, 2020 13:51

glenn-jocher added 2 commits October 10, 2020 12:09

initial commit

c599075

batch inference update

68c78a0

github-actions bot force-pushed the simple_inference branch from 50974db to 68c78a0 Compare October 10, 2020 12:09

glenn-jocher added 4 commits October 10, 2020 14:37

Merge remote-tracking branch 'origin/simple_inference' into simple_in…

c8141ba

…ference # Conflicts: # models/yolo.py

add torch capability

623c568

empty image bug fix

097aca2

comment update

9896ce0

glenn-jocher added 7 commits October 10, 2020 16:56

extract NMS to allow for augment

11ea358

update NMS thresholds to CoreML defaults

82e865a

fuse() bug fix

c2403d7

Update requirements.txt coremltools==4.0

d87cf7e

Rearrange export input after checks (#1118)

d45e349

img size checks are warnings rather than errors, so current implementation allows improperly formed model inputs.

FROM nvcr.io/nvidia/pytorch:20.09-py3

10c85bf

Generalized regression criterion renaming (#1120)

0ada058

Minor import and spelling updates (#1133)

4d3680c

fix compatibility for hyper config (#1146)

c67e722

* fix/hyper * Hyp giou check to train.py * restore general.py * train.py overwrite fix * restore general.py and pep8 update Co-authored-by: Glenn Jocher <glenn.jocher@ultralytics.com>

glenn-jocher and others added 11 commits October 15, 2020 18:48

fuse() bug fix

fe1d90a

Update requirements.txt coremltools==4.0

70432a5

Rearrange export input after checks (#1118)

e63bf4d

img size checks are warnings rather than errors, so current implementation allows improperly formed model inputs.

FROM nvcr.io/nvidia/pytorch:20.09-py3

bfa2f89

Generalized regression criterion renaming (#1120)

402095a

Dataset download bash script updates (#1132)

6088171

Minor import and spelling updates (#1133)

330bdfb

fix compatibility for hyper config (#1146)

d7e6f4d

* fix/hyper * Hyp giou check to train.py * restore general.py * train.py overwrite fix * restore general.py and pep8 update Co-authored-by: Glenn Jocher <glenn.jocher@ultralytics.com>

update copied attributes

34282b0

optimize imports

8668c2b

glenn-jocher added 9 commits October 15, 2020 19:50

initial commit

91a029a

batch inference update

a34b35b

initial commit

a9db87b

comment update

37a07a2

extract NMS to allow for augment

5159d10

update NMS thresholds to CoreML defaults

0be772e

update copied attributes

8144436

optimize imports

dc53110

Merge remote-tracking branch 'origin/simple_inference' into simple_in…

efa5e3f

…ference

glenn-jocher mentioned this pull request Oct 15, 2020

Simplified Inference #1153

Merged

glenn-jocher closed this Oct 15, 2020

glenn-jocher deleted the simple_inference branch October 15, 2020 18:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplified Inference #1045

Simplified Inference #1045

glenn-jocher commented Sep 25, 2020 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented Sep 25, 2020 •

edited

Loading

NanoCode012 commented Sep 26, 2020

glenn-jocher commented Sep 27, 2020 •

edited

Loading

glenn-jocher commented Sep 27, 2020 •

edited

Loading

NanoCode012 commented Sep 27, 2020

aniltolwani commented Sep 29, 2020

glenn-jocher commented Oct 4, 2020

glenn-jocher commented Oct 5, 2020

glenn-jocher commented Oct 6, 2020 •

edited

Loading

glenn-jocher commented Oct 10, 2020

glenn-jocher commented Oct 10, 2020 •

edited

Loading

glenn-jocher commented Oct 14, 2020

glenn-jocher commented Oct 15, 2020

glenn-jocher commented Oct 15, 2020

glenn-jocher commented Oct 15, 2020

glenn-jocher commented Oct 15, 2020

Simplified Inference #1045

Simplified Inference #1045

Conversation

glenn-jocher commented Sep 25, 2020 • edited by UltralyticsAssistant Loading

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

glenn-jocher commented Sep 25, 2020 • edited Loading

NanoCode012 commented Sep 26, 2020

glenn-jocher commented Sep 27, 2020 • edited Loading

glenn-jocher commented Sep 27, 2020 • edited Loading

NanoCode012 commented Sep 27, 2020

aniltolwani commented Sep 29, 2020

glenn-jocher commented Oct 4, 2020

glenn-jocher commented Oct 5, 2020

glenn-jocher commented Oct 6, 2020 • edited Loading

glenn-jocher commented Oct 10, 2020

glenn-jocher commented Oct 10, 2020 • edited Loading

glenn-jocher commented Oct 14, 2020

glenn-jocher commented Oct 15, 2020

glenn-jocher commented Oct 15, 2020

glenn-jocher commented Oct 15, 2020

glenn-jocher commented Oct 15, 2020

glenn-jocher commented Sep 25, 2020 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented Sep 25, 2020 •

edited

Loading

glenn-jocher commented Sep 27, 2020 •

edited

Loading

glenn-jocher commented Sep 27, 2020 •

edited

Loading

glenn-jocher commented Oct 6, 2020 •

edited

Loading

glenn-jocher commented Oct 10, 2020 •

edited

Loading