Block or Report
Block or report kosmasCogninn
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Real time transcription with OpenAI Whisper.
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
A Web UI for easy subtitle using whisper model.
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
Faster Whisper transcription with CTranslate2
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
OpenAI Whisper ASR Webservice API
A nearly-live implementation of OpenAI's Whisper.
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references und…
All the resources you need to get to Senior Engineer and beyond
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Learn how to design systems at scale and prepare for system design interviews
Everything you need to know to get the job.
The official evaluation suite and dynamic data release for MixEval.
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Port of OpenAI's Whisper model in C/C++
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
Large Action Model framework to develop AI Web Agents
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
SDK for creating whiteboards and canvas experiences on the web.
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Video+code lecture on building nanoGPT from scratch
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.