Starred repositories
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…
Convolutional recurrent network in pytorch
Generate text images for training deep learning ocr model
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
A synthetic data generator for text recognition
llama3 implementation one matrix multiplication at a time
Start building LLM-empowered multi-agent applications in an easier way.
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
[WACV'25] StreetTryOn: A Benchmark for In-the-Wild Virtual Try-On and Cross-Domain Virtual Try-On
FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.
A PyTorch implementation of EfficientNet
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Paper 'Transformer based Pluralistic Image Completion with Reduced Information Loss' in TPAMI 2024 and 'Reduce Information Loss in Transformers for Pluralistic Image Inpainting' in CVPR2022
A curated list of image inpainting and video inpainting papers and resources
Inpaint anything using Segment Anything and inpainting models.
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Deep Learning-based Image Fusion: A Survey
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)
An out-of-box human parsing representation extractor.
pytorch implementation of openpose including Hand and Body Pose Estimation.