Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
aijianiula0601@gmail.com committed Feb 24, 2022
1 parent e766f73 commit 4b0ec37
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 11 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
# FlowCPCVC

### Audio Demo for "FlowCPCVC: A flow contrastive predictive coding voice conversion system"
Audio Demo for "FlowCPCVC: A flow contrastive predictive coding voice conversion system"
14 changes: 4 additions & 10 deletions index.md
Original file line number Diff line number Diff line change
@@ -1,16 +1,10 @@
## FlowCPCVC
# <center> FlowCPCVC: A flow contrastive predictivecoding voice conversion system </center>

#### Audio Demo for "FlowCPCVC: A flow contrastive predictive coding voice conversion system"

---

## <center> FlowCPCVC: A flow contrastive predictivecoding voice conversion system </center>

#### <center> Jia-Hong Huang, Wen Xu, Yu-Le Li </center>
<center> Jia-Hong Huang, Wen Xu, Yu-Le Li </center>

## Abstract

#### The task of voice conversion(VC) system is to change the voice iden-tity from one speaker into another while keeping the linguistic contentunchanged. Any-to-any voice conversion had gained increasing popular-ity in many applications. In the paper, we presented FlowCPCVC, anzero-shot voice conversion(VC) system, which is light-weight end-to-endsystem and can achieve state-of-art quality conversion. Our model baseon VAE, which combined speaker encoder and contend extractor witha fancy method to train our model. The speaker verification aim toextract speaker’s timbre and Contrastive predictive coding(CPC) mod-ule help to extract linguistic content to guide the flow module discardthe tone and keep the linguistic content. Using the speaker verifica-tion module, our framework can extract the tone embedding vector fromspeech to assist to finish the zero-shot VC task. Although our frame-work force on any-to-any task, it can also extend to any-to-many taskfor better robustness. Experimental results show that the FlowCPCVCsystem, compared with other zero-shot systems, can achieve state-of-art quality for voice conversion.
The task of voice conversion(VC) system is to change the voice iden-tity from one speaker into another while keeping the linguistic contentunchanged. Any-to-any voice conversion had gained increasing popular-ity in many applications. In the paper, we presented FlowCPCVC, anzero-shot voice conversion(VC) system, which is light-weight end-to-endsystem and can achieve state-of-art quality conversion. Our model baseon VAE, which combined speaker encoder and contend extractor witha fancy method to train our model. The speaker verification aim toextract speaker’s timbre and Contrastive predictive coding(CPC) mod-ule help to extract linguistic content to guide the flow module discardthe tone and keep the linguistic content. Using the speaker verifica-tion module, our framework can extract the tone embedding vector fromspeech to assist to finish the zero-shot VC task. Although our frame-work force on any-to-any task, it can also extend to any-to-many taskfor better robustness. Experimental results show that the FlowCPCVCsystem, compared with other zero-shot systems, can achieve state-of-art quality for voice conversion.

## Compared systems

Expand All @@ -20,7 +14,7 @@
---
## Any-to-Any task

#### speakers come from vctk in test-dataset, all speakers are unseen during training
speakers come from vctk in test-dataset, all speakers are unseen during training

### Man-to-female

Expand Down

0 comments on commit 4b0ec37

Please sign in to comment.