Skip to content
View zge's full-sized avatar

Block or report zge

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
18 results for source starred repositories
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 67,841 8,008 Updated Sep 10, 2024

A resource for learning about Machine learning & Deep Learning

Python 7,474 2,675 Updated Aug 17, 2024

Trax — Deep Learning with Clear Code and Speed

Python 8,047 813 Updated Sep 10, 2024

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,324 1,334 Updated Jan 20, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,563 2,423 Updated Sep 22, 2024

CommonMark spec, with reference implementations in C and JavaScript

Python 4,872 314 Updated Sep 13, 2024

Source code complementing our paper for acoustic event classification using convolutional neural networks.

Python 65 28 Updated Jan 31, 2021

Grapheme to phoneme conversion with deep learning.

Python 349 38 Updated Dec 8, 2023

Audio super resolution using neural networks

Python 1,158 205 Updated Oct 24, 2023

The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.

Python 124 19 Updated Sep 7, 2021

End-to-End Neural Diarization

Python 367 57 Updated Aug 30, 2021

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,572 224 Updated Sep 20, 2024

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Python 327 61 Updated Jul 21, 2024

End-to-End Speech Processing Toolkit

Python 8,317 2,157 Updated Sep 21, 2024

Large, modern dataset for speech recognition

Shell 631 62 Updated Feb 26, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,216 480 Updated Sep 9, 2024

A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.

Python 4,529 1,025 Updated Jun 24, 2024

Trainable algorithm for accurate force alignment

Rust 5 Updated Oct 19, 2015