Skip to content
View test-dan-run's full-sized avatar
Block or Report

Block or report test-dan-run

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤖 Build voice-based LLM agents. Modular + open source.

Python 2,564 432 Updated Jul 26, 2024

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 772 59 Updated Jun 27, 2024

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 380 32 Updated Jun 9, 2024

YOLOv10: Real-Time End-to-End Object Detection

Python 8,552 747 Updated Jul 18, 2024

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 33,079 3,671 Updated Jul 27, 2024

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

16,806 2,206 Updated Jul 23, 2024

Official implementation of Half-Quadratic Quantization (HQQ)

Python 581 55 Updated Jul 21, 2024

Fast inference engine for Transformer models

C++ 3,095 274 Updated Jul 26, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 13,921 1,253 Updated Jul 26, 2024

🦀 Small exercises to get you used to reading and writing Rust code!

Rust 51,588 9,921 Updated Jul 25, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 4,126 508 Updated Jul 6, 2024
Python 265 30 Updated Jul 5, 2024

Official Implementation of EnCLAP (ICASSP 2024)

Python 88 4 Updated Jun 2, 2024

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Jupyter Notebook 254 21 Updated Jul 9, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 3,490 362 Updated Jul 21, 2024

Real-time stream processing for python

Python 1,230 146 Updated Jun 18, 2024

Split audio based on Pyannote's VAD

Python 3 Updated Apr 3, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,270 714 Updated Jun 24, 2024

NOMAD is a fully unsupervised non-matching reference audio quality metric

Python 22 1 Updated May 27, 2024

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

TypeScript 24,890 4,217 Updated Jul 25, 2024

a pptx to markdown converter

Python 467 73 Updated May 3, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,169 479 Updated Jun 18, 2024
Python 1 Updated Feb 23, 2024
Jupyter Notebook 82 7 Updated Jun 27, 2024

Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"

Python 113 2 Updated May 8, 2024

Reference-aware automatic speech evaluation toolkit

Python 80 5 Updated Feb 22, 2024

ESPnet Model Zoo

Python 243 41 Updated Jul 9, 2023
Next