Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: resolve inconsistent sample rates and improve Train tab layout #2234

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

Wernervanrun
Copy link

Pull request checklist

  • The PR has a proper title. Use Semantic Commit Messages. (No more branch-name title please)
  • Make sure this is ready to be merged into the relevant branch. Please don't create a PR and let it hang for a few days.
  • Ensure you can run the codes you submitted successfully. These submissions will be prioritized for review:
    • Introduce improvements in program execution speed;
    • Introduce improvements in synthesis quality;
    • Fix existing bugs reported by user feedback (or you met);
    • Introduce more convenient user operations.

PR type

  • Bug fix / synthesis quality improvement

Description

This commit addresses two issues on the Train tab:

  1. Inconsistent Target Sample Rates:

    • Resolved an issue where switching versions led to inconsistent Target sample rate options. Additionally, I want to point out a missing v2 - 40k configuration file. Despite v2 having a 40k pretrained model:
      • assets/pretrained (32k, 40k, 48k)
      • assets/pretrained_v2 (32k, 40k, 48k)
      • configs/v1 (32k, 40k, 48k)
      • configs/v2 (32k, 48k)
    • Tests showed that the default settings 40k - v2 generated an incorrect 40k - v1 config file. The default settings have been updated to use 48k - v2 to align the sample rates correctly.

    Now:

    • v1 will consistently show 32k, 40k, and 48k
    • v2 will consistently show 32k and 48k

    Note: Ensures that selecting the 40k option for v2 no longer results in an incorrect configuration file.

  2. Repositioned 'Version' Field:

    • Moved the 'Version' field to appear before the 'Target sample rate' field on the Train tab. This improves the logical flow and user experience by ensuring that the 'Version' selection is made before choosing the 'Target sample rate' it is updating.

    Changes:

    • Updated layout on the Train tab to reposition the 'Version' field.

    Note: No functional changes to the data processing were made.

This commit addresses two issues on the Train tab:

1. **Inconsistent Target Sample Rates:**
   - Resolved an issue where switching versions led to inconsistent Target sample rates. Additionally, it fixes a missing v2 - 40k configuration file. Despite v2 having a 40k pretrained model, configurations were mismatched:
     - assets/pretrained  (32k, 40k, 48k)
     - assets/pretrained_v2  (32k, 40k, 48k)
     - configs/v1 (32k, 40k, 48k)
     - configs/v2 (32k, 48k)
   - Tests showed that a 40k - v2 setting generated an incorrect 40k - v1 config file. The default settings have been updated to use 48k - v2 to align the sample rates correctly.

   Now:
   - v1 will consistently show 32k, 40k, and 48k
   - v2 will consistently show 32k and 48k

   **Note:** Ensures that selecting the 40k option for v2 no longer results in an incorrect configuration file, aligning the sample rates across both versions.

2. **Repositioned 'Version' Field:**
   - Moved the 'Version' field to appear before the 'Target sample rate' field on the Train tab. This improves the logical flow and user experience by ensuring that the 'Version' selection is made before choosing the 'Target sample rate'.

   Changes:
   - Updated layout on the Train tab to reposition the 'Version' field.
   - Ensured that field updates are reflected correctly.

   **Note:** Aligns with a more intuitive user workflow by ensuring users select the version first before setting the target sample rate. No functional changes to the data processing were made.
Noticed that the code was making an exception with the line:
`if version19 == "v1" or sr2 == "40k":`
This approach isn't ideal, as it causes v2 - 40k to fallback to v1 - 40k. While this fix re-adds the necessary code, a better long-term solution might be to copy the v1 - 40k config into the v2 config folder to avoid such exceptions in the future.
fumiama added a commit to fumiama/Retrieval-based-Voice-Conversion-WebUI that referenced this pull request Aug 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant