GitHub - SURUIYUAN/Segment-Any-Video

Introduction

The Segment Anything Model(SAM) proposed by facebook has made a great influence in computer vision, as it is a fundamental step in many tasks, such as edge detection, face recognition and autonomous driving. However, there are some weakness in SAM: (1) it can't return the semantic information about the regions, (2) in some cases an instance(eg. a car) may be segmented to different parts, (3) the model can't process video data.
In this repository, we implement a segmentation and a tracking method using YOLOv8 and SAM, it can fix the weakness, we name this method Segment Any Video(SAV).
In seg.py, our segmentation method is implemented by providing the boxes from YOLOv8 detector as prompts to SAM, and the masks with no semantic info will also be returned, this is the biggest difference with SAM. In track.py, we modified the code from ultralytics/tracker/track.py which sported ByteTrack and BoTSORT, then apply instance segmentation to all frames.

Installation

pip install ultralytics
pip install git+https://github.com/facebookresearch/segment-anything.git

Model CheckPoints

Usage

python seg.py --img_path TestImages --save_dir SegOut --sam_checkpoint model/sam_vit_h_4b8939.pth --yolo_checkpoint model/yolov8x.pt

or

python track.py --video_path video.mp4 --save_path video_test.mp4 sam_checkpoint model/sam_vit_h_4b8939.pth --yolo_checkpoint model/yolov8x.pt --imgsz 1920

Image Segment Results

We can see from the above result that our method can segment the bus, car and train to an intact object semantically while SAM segments to different parts.

Video Track Result

Segment and track

Demo

Our online demo is here.
Note: Considering that video segmentation is time-consuming, so we didn't integrate this method. If you are interested, you can git clone this repository and run on your GPU machine.

TODO

License

This code is licensed under the AGPL-3.0 License.

Contact

Email: zhangkun@cstor.cn
Wechat QR code:

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
pic		pic
video		video
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
qr.jpg		qr.jpg
seg.py		seg.py
track.py		track.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

SURUIYUAN/Segment-Any-Video

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages