You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi Mingjie, very cool project and very nice work!! This is exactly what we need -- more comparisons and analyses. Just wondering if you have any insights you can share? Although I bet you would rather write them in a paper :)
The text was updated successfully, but these errors were encountered:
Some HUBERT based linguistic encoders (i.e. hubert_soft, content_vec) still cause speaker information leakage, even though some disentanglement learning methods have been applied.
DiffWave as a decoder generates good quality waveforms but it ignores given target speaker information in inference. It reconstructs source speech. I am still looking into this problem.
In terms of results, I am currently still debugging and running trainings with limited number of GPUs in our lab.
So I still need some time (e.g. one or two months) to get some formal results that can be shared.
I am happy if you would like to give suggestions, pull requests or more collaborations.
Hi Mingjie, very cool project and very nice work!! This is exactly what we need -- more comparisons and analyses. Just wondering if you have any insights you can share? Although I bet you would rather write them in a paper :)
The text was updated successfully, but these errors were encountered: