Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any current results? #2

Open
unilight opened this issue Mar 16, 2023 · 2 comments
Open

Any current results? #2

unilight opened this issue Mar 16, 2023 · 2 comments

Comments

@unilight
Copy link

Hi Mingjie, very cool project and very nice work!! This is exactly what we need -- more comparisons and analyses. Just wondering if you have any insights you can share? Although I bet you would rather write them in a paper :)

@MingjieChen
Copy link
Owner

Hello Wen-Chin,

I can share some insights I found in experiments.

  1. Some HUBERT based linguistic encoders (i.e. hubert_soft, content_vec) still cause speaker information leakage, even though some disentanglement learning methods have been applied.
  2. DiffWave as a decoder generates good quality waveforms but it ignores given target speaker information in inference. It reconstructs source speech. I am still looking into this problem.

In terms of results, I am currently still debugging and running trainings with limited number of GPUs in our lab.
So I still need some time (e.g. one or two months) to get some formal results that can be shared.

I am happy if you would like to give suggestions, pull requests or more collaborations.

@unilight
Copy link
Author

Hmm, I am happy to collaborate, but I am not sure what the end goal of this project this thus not sure how I can help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants