Any current results? #2

unilight · 2023-03-16T10:06:05Z

Hi Mingjie, very cool project and very nice work!! This is exactly what we need -- more comparisons and analyses. Just wondering if you have any insights you can share? Although I bet you would rather write them in a paper :)

MingjieChen · 2023-03-16T11:01:55Z

Hello Wen-Chin,

I can share some insights I found in experiments.

Some HUBERT based linguistic encoders (i.e. hubert_soft, content_vec) still cause speaker information leakage, even though some disentanglement learning methods have been applied.
DiffWave as a decoder generates good quality waveforms but it ignores given target speaker information in inference. It reconstructs source speech. I am still looking into this problem.

In terms of results, I am currently still debugging and running trainings with limited number of GPUs in our lab.
So I still need some time (e.g. one or two months) to get some formal results that can be shared.

I am happy if you would like to give suggestions, pull requests or more collaborations.

unilight · 2023-03-17T08:26:05Z

Hmm, I am happy to collaborate, but I am not sure what the end goal of this project this thus not sure how I can help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any current results? #2

Any current results? #2

unilight commented Mar 16, 2023

MingjieChen commented Mar 16, 2023

unilight commented Mar 17, 2023

Any current results? #2

Any current results? #2

Comments

unilight commented Mar 16, 2023

MingjieChen commented Mar 16, 2023

unilight commented Mar 17, 2023