-
Shanghai Jiao Tong University
- Shanghai
Highlights
- Pro
Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Image augmentation for machine learning experiments.
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
京东抢购助手:包含登录,查询商品库存/价格,添加/清空购物车,抢购商品(下单),查询订单等功能
📷 EasyPhoto | Your Smart AI Photo Generator.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
Official implementation of Character Region Awareness for Text Detection (CRAFT)
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
Nvidia Semantic Segmentation monorepo
Set of methods to ensemble boxes from different object detection models, including implementation of "Weighted boxes fusion (WBF)" method.
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
UPSNet: A Unified Panoptic Segmentation Network
A fast medical imaging analysis library in Python with algorithms for registration, segmentation, and more.
CharNet: Convolutional Character Networks
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
A PyTorch Implementation of Neural IMage Assessment
This repository contains datasets and baselines for benchmarking Chinese text recognition.
A PyTorch implementation of Mask TextSpotter
This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.