can we calculate text'phoneme duration time form StochasticDurationPredictor or DurationPredictor ? #591

CasonTsai · 2024-08-05T07:34:29Z

amazing work! excuse me, how to extract text'phoneme duration time form StochasticDurationPredictor or DurationPredictor ? I want to extract the delay time of the phoneme corresponding to each piece of text。

Plachtaa · 2024-08-05T07:37:01Z

You may use the attn output from the forward method as the phoneme-audio alignment result

CasonTsai · 2024-08-05T08:05:46Z

You may use the attn output from the forward method as the phoneme-audio alignment result

thanks for replying,i will experience in inference

CasonTsai · 2024-08-05T10:17:06Z

You may use the attn output from the forward method as the phoneme-audio alignment result

hello,i print the attn output in inferncing the model ,but I don’t know the correspondence between phoneme duration time and attn output of text，thank you for your reply

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

can we calculate text'phoneme duration time form StochasticDurationPredictor or DurationPredictor ? #591

can we calculate text'phoneme duration time form StochasticDurationPredictor or DurationPredictor ? #591

CasonTsai commented Aug 5, 2024

Plachtaa commented Aug 5, 2024

CasonTsai commented Aug 5, 2024

CasonTsai commented Aug 5, 2024

can we calculate text'phoneme duration time form StochasticDurationPredictor or DurationPredictor ? #591

can we calculate text'phoneme duration time form StochasticDurationPredictor or DurationPredictor ? #591

Comments

CasonTsai commented Aug 5, 2024

Plachtaa commented Aug 5, 2024

CasonTsai commented Aug 5, 2024

CasonTsai commented Aug 5, 2024