Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Variance embeds in multispeaker models #1299

Open
1 task done
ariikamusic opened this issue Oct 12, 2024 · 0 comments
Open
1 task done

Variance embeds in multispeaker models #1299

ariikamusic opened this issue Oct 12, 2024 · 0 comments

Comments

@ariikamusic
Copy link

ariikamusic commented Oct 12, 2024

Acknowledgement

  • I have read Getting-Started and FAQ

🐛 Describe the bug

I am writing this issue on behalf of multiple community members (Discord) who are or have been experiencing the issue.

Whenever variance (pitch, tension, voicing) in DiffSinger is generated for multispeaker models, OpenUTAU seems to generate the variance model of only the first speaker embed inside the configuration file, regardless of choosing the specific speaker inside OpenUTAU. It is mentioned that the variance model and/or generated pitch stays very similar if not the same with all speakers.

Explains how to reproduce the bug

N/A

OS & Version

Windows

Logs

N/A

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant