Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

修改了推理界面,增加了一些实用功能 #1330

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

DD-MASTERT
Copy link

get_tts_wav函数添加了一个调整静音片段长短的参数并在gpt采样参数一栏设置了可调大小的Slider。
添加了下推理后下载到缓存目录moys/temp/audio.wav,目的是为了在下方添加的下载语音的几个控件里可以输入路径快速点击把推理语音下载到指定目录(多条同一目录用命名不重复),缓存目录因为每次都是覆盖文件,所以不会有多余的缓存,只有一个audio.wav
然后是模型配置的部分(控件代码在738-832行),方便快速切换模型
最后是下方的切分一栏加了个推送按钮,用来把切分结果推送到文本输入框

总之,都是方便推理界面使用的微调,没改动什么,觉得还可以话可以合并一下,另外,我强烈推荐更新一下gradio的版本,新版本虽然要改一下按钮部分,麻烦一点,但是,新版本的音频模块可以在参考音频那里直接裁剪参考音频的片段,这个很方便,不用到其他软件去裁剪了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant