Alignment problems with German text? #38

imdatceleste · 2018-02-05T16:03:43Z

Hi @r9y9, I'm training on German audio. I have added the german characters (Ä, Ö, Ü, ß, ä, ö, ü) to the symbolset and am using basic_cleaners.

The problem is the alignment on test-audio. Look at some of the samples. And, of course, the audio is horrible too. I have tested with up to 500k steps. Always the same results. When I generate audio with synthesis, I have similar results. Any hints where I'd need to add more info?

Thanks for any recommendations... (I converted the German training data to ljspeech format...)

r9y9 · 2018-02-06T02:52:30Z

Does your training data contain beginning / ending silences? It's better to trim silences before training.

I sometimes got bad results with long audio (for example, see #24). What's the output of the following command?

python compute_timestamp_ratio.py --hparams="your hyper params" ${your_data_path}

deepvoice3_pytorch/compute_timestamp_ratio.py

Line 48 in cda1ca4

    
           print(input_timestamps, output_timestamps, output_timestamps / input_timestamps)

If the output_timestamps / input_timestamps is larger than 2, I'd try to increase outputs_per_step or decrease downsample_step to balance input/outuput lengths, and change the model architecture accordingly.

imdatceleste · 2018-02-06T06:59:23Z

The output_timestamps/input_timestamps is 1.43. I'll check the whether I have pauses at begin and end let you know.
EDIT: yes, there were silence at the beginning and at the end. I removed them and will try training again and let you know. Thank you very much.

imdatceleste · 2018-02-06T13:47:52Z

@r9y9, it seems this is a general problem, see ##27 -- When I use a test.txt with more than one line, the first entry is generated correctly, the others are just inaudible. Everything is fine until

mel_outputs, linear_outputs, alignments, done = model(...

in synthesis.py:tts(.... But then the call to to model(... returns super-fast and the result is inaudible or actually just no audio at all...

When I start synthesis.py for each entry directly, all of them are generated ok...

I'm investigating what could be going on...

r9y9 · 2018-02-06T15:04:10Z

Oh, thank you for the report. I can reproduce.... I found a really stupid bug. Fix with tests coming shortly.

forgot to clear buffer property ref #38

r9y9 · 2018-02-06T15:46:08Z

@imdatsolak I think I fixed the bug. Could you confirm if it works? There's a fix for only incremental inference, so you don't need to re-train your model.

imdatceleste · 2018-02-06T15:46:48Z

I'll check and let you know within next 10 minutes :-)

imdatceleste · 2018-02-06T15:52:06Z

It seems to work, the alignments look a lot better. Unfortunately, I re-started training :-( and am only at step 30k, so I'll need to continue training over the night and final result should be available tomorrow. Thank you very much. I'll let you know once I have more results...
EDIT:
It works now. When it reaches the save_checkpoint, it generates (correctly) the following alignment-files (before the fix, the first looked good, the others looked like the one at top of this issue):

Thanks again!! 👍

imdatceleste · 2018-02-06T17:46:42Z

@r9y9, your fix works. Thanks again. Closing this issue.

r9y9 added a commit that referenced this issue Feb 6, 2018

Fix non-deterministic inference

6a3fef1

forgot to clear buffer property ref #38

imdatceleste closed this as completed Feb 6, 2018

imdatceleste mentioned this issue Feb 19, 2018

Issue training with DeepVoice3 model with LJSpeech Data #43

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alignment problems with German text? #38

Alignment problems with German text? #38

imdatceleste commented Feb 5, 2018

r9y9 commented Feb 6, 2018

imdatceleste commented Feb 6, 2018 •

edited

Loading

imdatceleste commented Feb 6, 2018

r9y9 commented Feb 6, 2018

r9y9 commented Feb 6, 2018

imdatceleste commented Feb 6, 2018

imdatceleste commented Feb 6, 2018 •

edited

Loading

imdatceleste commented Feb 6, 2018

Alignment problems with German text? #38

Alignment problems with German text? #38

Comments

imdatceleste commented Feb 5, 2018

r9y9 commented Feb 6, 2018

imdatceleste commented Feb 6, 2018 • edited Loading

imdatceleste commented Feb 6, 2018

r9y9 commented Feb 6, 2018

r9y9 commented Feb 6, 2018

imdatceleste commented Feb 6, 2018

imdatceleste commented Feb 6, 2018 • edited Loading

imdatceleste commented Feb 6, 2018

imdatceleste commented Feb 6, 2018 •

edited

Loading

imdatceleste commented Feb 6, 2018 •

edited

Loading