Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
(2020-2022)The PyTorch version of SiamFC,SiamRPN,DaSiamRPN, UpdateNet , SiamDW, SiamRPN++, SiamMask, SiamFC++, SiamCAR, SiamBAN, Ocean, LightTrack , TrTr, NanoTrack; Visual object tracking based on…
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
A generative speech model for daily dialogue.
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Accessible large language models via k-bit quantization for PyTorch.
一款在线图像标注工具(矩形、多边形、持续更新中……),可用于深度学习实例分割模型训练(Mask R-CNN)等。
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Rembg is a tool to remove images background
Tesseract Open Source OCR Engine (main repository)
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Foundational Models for State-of-the-Art Speech and Text Translation
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Cpp and python implementation of YOLOv9 using TensorRT API
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…