Stars
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Speech Enhancement Generative Adversarial Network in TensorFlow
speech enhancement\speech seperation\sound source localization
real time face swap and one-click video deepfake with only a single image
A statistical model-based Speech Enhancement Using MMSE-STSA
Ai-Sherry / Sixty-years-of-frequency-domain-monaural-speech-enhancement
Forked from cszheng-ioa/Sixty-years-of-frequency-domain-monaural-speech-enhancementA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
A generative speech model for daily dialogue.
使用 spleeter 将视频中的人声提取出来(去除背景音),再对视频中的声音进行分析,分成静音部分和非静音部分,分别施加不同的速度,最后合成到一个新视频。
城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN
基于Tensorflow实现声音分类,博客地址:
使用Tensorflow实现声纹识别
A Deep LSTM-CNN-HMM Neural Network system for Speaker Identification