Stars
The official gpt4free repository | various collection of powerful language models
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
High-Resolution Image Synthesis with Latent Diffusion Models
Faker is a Python package that generates fake data for you.
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
Open-source cron job and background task monitoring service, written in Python & Django
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
OpenMMLab Pose Estimation Toolbox and Benchmark.
Count the MACs / FLOPs of your PyTorch model.
🚩 自动更新域名解析到本机IP(支持dnspod,阿里DNS,CloudFlare,华为云,DNSCOM...)
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
Awesome work on hand pose estimation/tracking
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Automatic architecture search and hyperparameter optimization for PyTorch
Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral
This is an official implementation for "Video Swin Transformers".
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.
[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
ROLO is short for Recurrent YOLO, aimed at simultaneous object detection and tracking
[ICLR2022] official implementation of UniFormer
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers
Reference implementation of a two-level RCN model
[CVPR'22 Oral] GMFlow: Learning Optical Flow via Global Matching