-
Notifications
You must be signed in to change notification settings - Fork 487
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alignment problems with German text? #38
Comments
Does your training data contain beginning / ending silences? It's better to trim silences before training. I sometimes got bad results with long audio (for example, see #24). What's the output of the following command?
If the |
The |
@r9y9, it seems this is a general problem, see ##27 -- When I use a test.txt with more than one line, the first entry is generated correctly, the others are just inaudible. Everything is fine until
in When I start synthesis.py for each entry directly, all of them are generated ok... I'm investigating what could be going on... |
Oh, thank you for the report. I can reproduce.... I found a really stupid bug. Fix with tests coming shortly. |
@imdatsolak I think I fixed the bug. Could you confirm if it works? There's a fix for only incremental inference, so you don't need to re-train your model. |
I'll check and let you know within next 10 minutes :-) |
@r9y9, your fix works. Thanks again. Closing this issue. |
Hi @r9y9, I'm training on German audio. I have added the german characters (Ä, Ö, Ü, ß, ä, ö, ü) to the symbolset and am using basic_cleaners.
The problem is the alignment on test-audio. Look at some of the samples. And, of course, the audio is horrible too. I have tested with up to 500k steps. Always the same results. When I generate audio with synthesis, I have similar results. Any hints where I'd need to add more info?
Thanks for any recommendations... (I converted the German training data to ljspeech format...)
The text was updated successfully, but these errors were encountered: