Stars
Typer, build great CLIs. Easy to code. Based on Python type hints.
Aerial Gym Simulator - Isaac Gym Simulator for Aerial Robots
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Convert JSON annotations into YOLO format.
Unreal Engine 5 Guide. Learn to develop games for Windows, Linux, macOS, iOS, Android, Xbox Series X|S, PlayStation 5, Nintendo Switch.
UnrealCV: Connecting Computer Vision to Unreal Engine
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
DSPy: The framework for programming—not prompting—foundation models
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
ONNX-compatible Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
Code to easily try 30 (and growing) different image matching methods
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.
Real-time and accurate open-vocabulary end-to-end object detection
Code release for CVPR'24 submission 'OmniGlue'
Whitebox AES implementation in C++. Chow, Karroumi.
[CVPR 2023] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
Unified framework for building enterprise RAG pipelines with small, specialized models
A multi-voice TTS system trained with an emphasis on quality
CLI11 is a command line parser for C++11 and beyond that provides a rich feature set with a simple and intuitive interface.
A super easy to use map tiles downloader built using Python
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
User-friendly WebUI for LLMs (Formerly Ollama WebUI)