Highlights
- Pro
Starred repositories
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A collection of prompts, system prompts and LLM instructions
Real time transcription with OpenAI Whisper.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
Fast and memory-efficient exact attention
A generative speech model for daily dialogue.
Easiest 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, a…
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
SoftVC VITS Singing Voice Conversion
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Using OpenAI's Whisper to automatically generate YouTube subtitles
LLM prompts, llama3 prompts, llama2 prompts
LLM plugin providing access to local Ollama models using HTTP API
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models
A tweak to get Spotify Premium for free, just like Spotilife
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
aider is AI pair programming in your terminal
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.