Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mumbling Voice #40

Open
Blandrust opened this issue May 8, 2023 · 2 comments
Open

Mumbling Voice #40

Blandrust opened this issue May 8, 2023 · 2 comments

Comments

@Blandrust
Copy link

Hey there,

I'm trying to fine-tune the TTS model for the German language, but I'm fairly new to this field. I've tried various approaches and datasets, such as the German part of the M-AI Labs dataset, and the Mozilla Common Voice dataset. I've also adjusted the Coqui recipe you provided to fit with the new datasets.

I've tried training the model for 100k steps and even up to 500k steps, but I can't seem to get the model to learn German. It's always kind of mumbling and not learning anything at all, even though the loss is still decreasing after 100k steps and onwards.

Has anyone experienced this kind of issue before? Do I need to train the model for a longer period, or am I missing something crucial in my approach?

Thanks in advance.

@harshvardhan-truefan
Copy link

Hi,
I'm facing the same issue as well. I tried to fine-tune the yourtts model on my dataset, but the test audios that I'm getting in the end is a person mumbling, and is not able to form sentences. Any idea about how to solve this issue ?

Thanks and Regards,
Harsh

@thivux
Copy link

thivux commented Apr 10, 2024

did anyone find the solution?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants