VoiceConversionGANs

GAN series for voice conversion on VCC2018 dataset

This is a voice conversion repository including cyclegan-vc, stargan-vc, stargan-vc2 and some other variants

This work is still in progress, more GAN models will be included

This work is based on repository stargan-vc, stargan-vc2 and cyclegan-vc

Requirements:

Python3
PyTorch 0.4.1
Pyworld

Models:

stgan: stargan-vc1 from https://github.com/liusongxiang/StarGAN-Voice-Conversion
stgan2: stargan-vc2 from https://github.com/SamuelBroughton/StarGAN-Voice-Conversion-2
stgan1_cin: stargan-vc1 + generator with conditional instance normalization + speaker classifier
stgan2_new: stargan-vc2 + patchgan discriminator + only target condition in generator and discriminator + no speaker classifier + gradient penalty
stgan2_ls: stargan-vc2 + projection discriminator (as in the paper) + source and target conditions in generator and discriminator + LSGAN adversarial loss
cycgan: cyclegan-vc1

Preprocess

./run_pre.sh

Modify according to your own conda env and hyper-params

Train:

./run_train.sh

Modify according to your own conda env and hyper-params

Convert

./run_convert.sh

Objective Evaluation

./run_eval.sh

This evaluation calculate Mel Cepstral Distortion (MCD) and Modulation Spectral Distance (MSD) as in stargan-vc2 paper.

However, this script can not get the same score as the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
cycgan		cycgan
stgan		stgan
stgan1_cin		stgan1_cin
stgan2		stgan2
stgan2_ls		stgan2_ls
stgan2_new		stgan2_new
.gitignore		.gitignore
README.md		README.md
convert.py		convert.py
data_loader.py		data_loader.py
evaluate.py		evaluate.py
logger.py		logger.py
main.py		main.py
main_cyc.py		main_cyc.py
main_st1cin.py		main_st1cin.py
main_st2.py		main_st2.py
main_st2ls.py		main_st2ls.py
main_st2new.py		main_st2new.py
preprocess.py		preprocess.py
run_convert.sh		run_convert.sh
run_eval.sh		run_eval.sh
run_pre.sh		run_pre.sh
run_train.sh		run_train.sh
run_train_cyc.sh		run_train_cyc.sh
run_train_ls.sh		run_train_ls.sh
speaker_used.json		speaker_used.json
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoiceConversionGANs

This is a voice conversion repository including cyclegan-vc, stargan-vc, stargan-vc2 and some other variants

This work is based on repository stargan-vc, stargan-vc2 and cyclegan-vc

Requirements:

Models:

Preprocess

Train:

Convert

Objective Evaluation

About

Releases

Packages

Languages

entn-at/VoiceConversionGANs

Folders and files

Latest commit

History

Repository files navigation

VoiceConversionGANs

This is a voice conversion repository including cyclegan-vc, stargan-vc, stargan-vc2 and some other variants

This work is based on repository stargan-vc, stargan-vc2 and cyclegan-vc

Requirements:

Models:

Preprocess

Train:

Convert

Objective Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages