Skip to content

Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.

License

Notifications You must be signed in to change notification settings

DmitryRyumin/Awesome-Speech-Enhancement

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Awesome-Speech-Enhancement

General Information Awesome Version License: MIT
Repository Size and Activity GitHub repo size GitHub commit activity (branch)
Contribution Statistics GitHub contributors GitHub closed issues GitHub issues GitHub closed pull requests GitHub pull requests
Other Metrics GitHub last commit GitHub watchers GitHub forks GitHub Repo stars Visitors
Application App

Contributors



Note

Contributions to improve the completeness of this list are greatly appreciated. If you come across any overlooked papers, please feel free to create pull requests, open issues or contact me via email. Your participation is crucial to making this repository even better.


List of sections

Papers

Conferences

Conference
Title Repo Paper
Diffusion-based Speech Enhancement in Matched and Mismatched Conditions using a Heun-based Sampler IEEE Xplore
arXiv
Unsupervised Speech Enhancement with Diffusion-based Generative Models WEB Page
GitHub
IEEE Xplore
arXiv
Boosting Speech Enhancement with Clean Self-Supervised Features via Conditional Variational Autoencoders GitHub IEEE Xplore
Diffusion-based Speech Enhancement with a Weighted Generative-Supervised Learning Loss GitHub IEEE Xplore
arXiv
AV2WAV: Diffusion-based Re-Synthesis from Continuous Self-Supervised Features for Audio-Visual Speech Enhancement WEB Page IEEE Xplore
arXiv
A Closer Look at Wav2vec2 Embeddings for On-Device Single-Channel Speech Enhancement IEEE Xplore
arXiv
GTCRN: A Speech Enhancement Model Requiring Ultralow Computational Resources GitHub IEEE Xplore

Journals

Journal
Title Repo Paper
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation Year WEB Page
GitHub
IEEE Xplore
arXiv

Code

About Repo
A Training Code Template for DNN-based Speech Enhancement GitHub

Datasets

Title Homepage Paper
VoiceBank + DEMAND Homepage WEB Page

Performance Measures

SNR

Code for SNR Calculation Examples

Note

It is important to understand that these ranges are approximate and can change depending on the specific application and quality requirements for speech enhancement. In addition, the perception of sound quality can be subjective and influenced by individual user preferences. In the context of the table provided, higher SNR values indicate better signal quality with less noise or interference.

SNR Range Description
Very Low SNR (<0 dB) The noise power exceeds the signal power. The signal is almost entirely masked by noise, making recovery or processing extremely difficult.
Low SNR (0-10 dB) Low SNR values indicate a significant presence of noise in the signal. Processing and perception of such a signal can be challenging but possible in some cases with specialized noise reduction algorithms.
Moderate SNR (10-20 dB) These SNR values are considered acceptable for many signal processing applications. The signal contains a noticeable level of noise, but its quality is still sufficient for processing and perception.
High SNR (20-30 dB) High SNR values indicate good signal quality with a low noise level. Such signals are easily processed and perceived by humans.
Very High SNR (>30 dB) Very high SNR values correspond to signals with minimal noise levels. These signals are considered ideal for most signal processing applications.

SDR

Code for SDR Calculation Examples

Note

It is important to understand that these ranges are approximate and can change depending on the specific application and quality requirements for speech enhancement. In addition, the perception of sound quality can be subjective and influenced by individual user preferences. In the context of the table provided, higher SNR values indicate better signal quality with less distortion.

SDR Range Description
Very Low SDR (<0 dB) The distortion level exceeds the signal power. The signal is highly distorted, making it difficult to recover or process accurately.
Low SDR (0-5 dB) Low SDR values indicate the presence of significant distortion relative to the signal power. Processing and perception of such a signal can be challenging, requiring specialized distortion reduction methods.
Moderate SDR (5-10 dB) These SDR values indicate moderate distortion in the signal, which may degrade its quality, but still allows for some processing and perception.
High SDR (10-15 dB) High SDR values indicate good signal quality relative to the level of distortion. Such signals are relatively clean and easy to process.
Very High SDR (>15 dB) Very high SDR values correspond to signals with minimal distortion levels. These signals are considered ideal for most signal processing applications.

Star History

Star History Chart

About

Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published