- Vancouver, Canada
Highlights
- Pro
Block or Report
Block or report Robinysh
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Awesome speech/audio LLMs, representation learning, and codec models
UI Library for Design Engineers. Animated components and effects you can copy and paste into your apps. Free. Open Source.
Minimalist developer portfolio using Next.js 14, React, TailwindCSS, Shadcn UI and Magic UI
Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"
aider is AI pair programming in your terminal
Tools for handling speech data in machine learning projects.
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Incredibly fast Whisper-large-v3
Large Action Model framework to develop AI Web Agents
An extremely fast Python package installer and resolver, written in Rust.
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
scalable and robust tree-based speculative decoding algorithm
The PyTorch-based audio source separation toolkit for researchers
PygmalionAI's large-scale inference engine
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-bench lite with each task costs less than $0.7.
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
List of books, blogs, newsletters and people!
Automate browser-based workflows with LLMs and Computer Vision
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf