-
Shanghai Jiao Tong University
- Shanghai
Highlights
- Pro
Stars
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
📷 EasyPhoto | Your Smart AI Photo Generator.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
This repository contains datasets and baselines for benchmarking Chinese text recognition.
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
Javascript/WebGL lightweight face tracking library designed for augmented reality webcam filters. Features : multiple faces detection, rotation, mouth opening. Various integration examples are prov…
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier
Collection of NSFW images URLs for the purposes of training an NSFW Image Classifier
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.
Nvidia Semantic Segmentation monorepo
Project for Digital Image Processing
DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
ncnn is a high-performance neural network inference framework optimized for the mobile platform
A PyTorch Implementation of Neural IMage Assessment
Automagically generate thumbnails, animated GIFs, and summaries from videos
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
Set of methods to ensemble boxes from different object detection models, including implementation of "Weighted boxes fusion (WBF)" method.
Image augmentation for machine learning experiments.
[ICCV 2019] Monocular depth estimation from a single image
京东抢购助手:包含登录,查询商品库存/价格,添加/清空购物车,抢购商品(下单),查询订单等功能