This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through reference audio.
-
Updated
Aug 19, 2024 - Python
This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through reference audio.
Add a description, image, and links to the emotion-styled topic page so that developers can more easily learn about it.
To associate your repository with the emotion-styled topic, visit your repo's landing page and select "manage topics."