Starred repositories
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
The official Pixel Streaming servers and frontend.
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
A small tutorial repository on capturing images with semantic annotation from UnrealEngine to disk.
A hackers AI voice assistant, built using Python and PyTorch.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
A playbook for systematically maximizing the performance of deep learning models.
Papers from the computer science community to read and discuss.
An opinionated list of awesome Python frameworks, libraries, software and resources.
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Investment Research for Everyone, Everywhere.
The world's simplest facial recognition api for Python and the command line
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
A system for connecting players across Steam
A complete multiplayer shooter project. This project accompanies the Udemy course that teaches how to create it.
Best Practices, code samples, and documentation for Computer Vision.
10 Weeks, 20 Lessons, Data Science for All!
12 Weeks, 24 Lessons, AI for All!
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Control Airsim Drone/Multicopter with your keyboard!
📚 A Collection of Free & Open Resources for University Coursework in Computer Science.
Solutions to all questions of the book Introduction to the Theory of Computation, 3rd edition by Michael Sipser
Framework and Language for Neurosymbolic Programming. Join Our Discord: https://discord.gg/RavzdND229
This is a simple terminal controller for the Airsim Python api.
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data o…