Stars
Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
A fast and lightweight python-based CTC beam search decoder for speech recognition.
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
Punctuation Restoration using Transformer Models for High-and Low-Resource Languages
Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
This repository provides code for machine learning algorithms for edge devices developed at Microsoft Research India.
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
Training and Detecting Objects with YOLO3
A collection of design patterns/idioms in Python