Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Nanopore RNA-Seq data from the Singapore Nanopore-Expression Project
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Amazon EC2 instance comparison site
High-resolution models for human tasks.
Run Segment Anything Model 2 on a live video stream
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
This project demonstrates how to build a Python Flask API that connects to Google Calendar API. Users can connect their Google calendar with this service, store user tokens in the database, and ret…
Ridiculously simple landmark annotation tool
Refine high-quality datasets and visual AI models
A list of tools for annotating data, managing annotations, etc.
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Chrome extension for studocu premium free
A curated list of awesome computer vision resources
A curated list of recent monocular depth estimation papers
Cross-platform, customizable ML solutions for live and streaming media.
SignLanguage is a platform where users can practically learn American Sign Language using machine learning and access videos for over 20,000+ ASL phrases.
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…
pix2tex: Using a ViT to convert images of equations into LaTeX code.