Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
audio_utils.py		audio_utils.py
compute_statistics.py		compute_statistics.py
normalize.py		normalize.py
preprocess_libritts.py		preprocess_libritts.py
preprocess_vcc2020.py		preprocess_vcc2020.py
preprocess_vctk.py		preprocess_vctk.py

README.md

Feature extraction

This part aims to extract features from speech data, including

mel-spectrogram
- ppgvc_mel
- mel
linguistic representations
- vq-wav2vec
- conformer_ppg
- hubert_soft and hubert_discrete
utterance level speaker representations
- d-vector
prosodic representations
- ppgvc_logf0
- fastspeech2 pitch + energy

Mel-Spectrograms

- ppgvc_mel: logmel_spectrograms from [ppg-vc](https://github.com/liusongxiang/ppg-vc)
- mel: logmel_spectrogram from [parallel_wave_gan](https://github.com/kan-bayashi/ParallelWaveGAN)

ppgvc_mel

It works compatible with the ppgvc_hifigan vocoder. It uses a min max normalization and is trained on VCTK dataset.

./bin/feature_extraction_ppgvc_mel_f0.sh dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

preprocess

preprocess

README.md

Feature extraction

Mel-Spectrograms

ppgvc_mel

mel

Linguistic representation extraction

vq-wav2vec

Files

preprocess

Directory actions

More options

Directory actions

More options

Latest commit

History

preprocess

Folders and files

parent directory

README.md

Feature extraction

Mel-Spectrograms

ppgvc_mel

mel

Linguistic representation extraction

vq-wav2vec