zge

Zhenhao Ge zge

4 followers · 1 following

United States

Achievements

Stars

18 results for source starred repositories

Clear filter

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 67,841 8,008 Updated Sep 10, 2024

aladdinpersson / Machine-Learning-Collection

A resource for learning about Machine learning & Deep Learning

Python 7,474 2,675 Updated Aug 17, 2024

google / trax

Trax — Deep Learning with Clear Code and Speed

Python 8,047 813 Updated Sep 10, 2024

bentrevett / pytorch-seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,324 1,334 Updated Jan 20, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,563 2,423 Updated Sep 22, 2024

commonmark / commonmark-spec

CommonMark spec, with reference implementations in C and JavaScript

Python 4,872 314 Updated Sep 13, 2024

kahst / AcousticEventDetection

Source code complementing our paper for acoustic event classification using convolutional neural networks.

Python 65 28 Updated Jan 31, 2021

as-ideas / DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

Python 349 38 Updated Dec 8, 2023

kuleshov / audio-super-res

Audio super resolution using neural networks

Python 1,158 205 Updated Oct 24, 2023

zkx06111 / WSRGlow

The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.

Python 124 19 Updated Sep 7, 2021

hitachi-speech / EEND

End-to-End Neural Diarization

Python 367 57 Updated Aug 30, 2021

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,572 224 Updated Sep 20, 2024

lochenchou / MOSNet

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Python 327 61 Updated Jul 21, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,317 2,157 Updated Sep 21, 2024

SpeechColab / GigaSpeech

Large, modern dataset for speech recognition

Shell 631 62 Updated Feb 26, 2024

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,216 480 Updated Sep 9, 2024

PyImageSearch / imutils

A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.

Python 4,529 1,025 Updated Jun 24, 2024

MLSpeech / AutoAligner

Trainable algorithm for accurate force alignment

Rust 5 Updated Oct 19, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhenhao Ge zge

Achievements

Achievements

Block or report zge

Stars

openai / whisper

aladdinpersson / Machine-Learning-Collection

google / trax

bentrevett / pytorch-seq2seq

NVIDIA / NeMo

commonmark / commonmark-spec

kahst / AcousticEventDetection

as-ideas / DeepPhonemizer

kuleshov / audio-super-res

zkx06111 / WSRGlow

hitachi-speech / EEND

wq2012 / awesome-diarization

lochenchou / MOSNet

espnet / espnet

SpeechColab / GigaSpeech

s3prl / s3prl

PyImageSearch / imutils

MLSpeech / AutoAligner