Stars
⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀
High-resolution models for human tasks.
[NeurIPS 2023 Track Datasets and Benchmarks] OpenLane-V2: The First Perception and Reasoning Benchmark for Road Driving
A community-maintained Python framework for creating mathematical animations.
[AI Agent Application Development Framework] - 🚀 Build AI agent native application in very few code 💬 Easy to interact with AI agent in code using structure data and chained-calls syntax 🧩 Enhance …
[IEEE T-PAMI] Awesome BEV perception research and cookbook for all level audience in autonomous diriving
全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作
Open-Sora: Democratizing Efficient Video Production for All
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网
Streamlit — A faster way to build and share data apps.
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
[ACM Multimedia 2023] Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow.
Two weeks of homemade scripting language notes and source code. If you think it's good, give me a star.
PytorchAutoDrive: Segmentation models (ERFNet, ENet, DeepLab, FCN...) and Lane detection models (SCNN, RESA, LSTR, LaneATT, BézierLaneNet...) based on PyTorch with fast training, visualization, ben…
A natural language interface for computers
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
A collection of resources and papers on Diffusion Models