Stars
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Vim-fork focused on extensibility and usability
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
OpenAI Whisper ASR Webservice API
The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
ML-powered speech recognition directly in your browser
📜 A minimalist personal website embodying the purity of paper and freshness of snow.
AirLLM 70B inference with single 4GB GPU
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
A unified interface for multiple Text-to-Speech (TTS) providers.
The free and privacy-friendly screen recorder with no limits 🎥
fatwang2 / dify2openai
Forked from 0w0z/Dify2OpenaiApiTurn Dify into OpenAI
Utilize the unlimited free GPT-3.5-Turbo API service provided by the login-free ChatGPT Web.
ONNX implementation of Whisper. PyTorch free.
A Gradio web UI for Large Language Models.
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.