Skip to content
View manbaaaa's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report manbaaaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM based TTS model, providing inference/training/deployment full-stack ability.

Python 252 26 Updated Jul 5, 2024

Multilingual Voice Understanding Model

Python 221 20 Updated Jul 5, 2024

Material for cuda-mode lectures

Jupyter Notebook 1,607 153 Updated Jun 13, 2024

screen sharing for developers https://screego.net/

Go 7,209 517 Updated Jun 23, 2024

noise reduction

Python 14 3 Updated Jul 3, 2024

Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.

Jupyter Notebook 9,223 2,761 Updated Jul 2, 2024

The official repository of the paper "(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts"

481 17 Updated Jul 5, 2024

An extremely fast Python package installer and resolver, written in Rust.

Rust 14,631 414 Updated Jul 5, 2024

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Python 391 46 Updated Jul 1, 2024

An open-source package providing standardized tools for sound event analysis and data management.

Python 16 1 Updated Jun 6, 2024

A library built for easier audio self-supervised training, downstream tasks evaluation

Python 73 8 Updated Apr 18, 2024

Using GPT to parse PDF

Python 1,917 124 Updated Jul 4, 2024

Segment a given audio into utterances using a trained end-to-end ASR model.

Python 73 9 Updated Oct 9, 2020

[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

Python 78 3 Updated Jun 17, 2024

This repository contains the python implementation of a Sound Event Detection systems working in real time.

Python 37 5 Updated Oct 10, 2022

人人都能用英语

TypeScript 21,378 3,418 Updated Jul 4, 2024

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 865 151 Updated Jul 5, 2023

Golang分布式学习, from bilibili video Go语言编写简单分布式系统

Go 2 Updated Jun 6, 2024

Speech Algorithms

C 727 240 Updated Jun 9, 2024
C++ 2,579 342 Updated Jul 5, 2024

🔍 Go 开发的开源互联网搜索引擎,附教程《自己动手开发互联网搜索引擎》

Go 501 75 Updated Jun 4, 2024

ONNX-TensorRT: TensorRT backend for ONNX

C++ 2,838 538 Updated Jun 18, 2024

All Algorithms implemented in Rust

Rust 21,508 2,095 Updated Jul 4, 2024

LLM101n: Let's build a Storyteller

13,984 652 Updated Jun 28, 2024

Mac app for crushing remote tech interviews with AI

Swift 3,974 285 Updated Aug 9, 2023

Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark

Python 9 Updated Jun 27, 2024

Open source real-time translation app for Android that runs locally

C++ 5,354 402 Updated Jul 4, 2024

paraformer的输出token和编码器alpha系数进行强制对齐

Python 5 Updated May 30, 2024

🦅 A Go framework for the API or Microservice

Go 1,984 224 Updated Jun 22, 2024
Next