Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

将音频切分更改为fap的音频切分与响度匹配,以及完成剩余的i18n #937

Closed
wants to merge 10 commits into from
Closed
Show file tree
Hide file tree
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -255,6 +255,7 @@ Special thanks to the following projects and contributors:
- [gradio](https://github.com/gradio-app/gradio)
- [faster-whisper](https://github.com/SYSTRAN/faster-whisper)
- [FunASR](https://github.com/alibaba-damo-academy/FunASR)
- [audio-preprocess](https://github.com/fishaudio/audio-preprocess)

## Thanks to all contributors for their efforts

Expand Down
2 changes: 1 addition & 1 deletion docs/cn/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -244,7 +244,7 @@ python ./tools/asr/fasterwhisper_asr.py -i <input> -o <output> -l <language>
- [SubFix](https://github.com/cronrpc/SubFix)
- [FFmpeg](https://github.com/FFmpeg/FFmpeg)
- [gradio](https://github.com/gradio-app/gradio)

- [audio-preprocess](https://github.com/fishaudio/audio-preprocess)
## 感谢所有贡献者的努力

<a href="https://github.com/RVC-Boss/GPT-SoVITS/graphs/contributors" target="_blank">
Expand Down
1 change: 1 addition & 0 deletions docs/ja/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -222,6 +222,7 @@ python ./tools/asr/fasterwhisper_asr.py -i <input> -o <output> -l <language>
- [SubFix](https://github.com/cronrpc/SubFix)
- [FFmpeg](https://github.com/FFmpeg/FFmpeg)
- [gradio](https://github.com/gradio-app/gradio)
- [audio-preprocess](https://github.com/fishaudio/audio-preprocess)

## すべてのコントリビューターに感謝します

Expand Down
1 change: 1 addition & 0 deletions docs/ko/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -226,6 +226,7 @@ python ./tools/asr/fasterwhisper_asr.py -i <input> -o <output> -l <language>
- [SubFix](https://github.com/cronrpc/SubFix)
- [FFmpeg](https://github.com/FFmpeg/FFmpeg)
- [gradio](https://github.com/gradio-app/gradio)
- [audio-preprocess](https://github.com/fishaudio/audio-preprocess)

## 모든 기여자들에게 감사드립니다 ;)

Expand Down
7 changes: 6 additions & 1 deletion i18n/locale/en_US.json
Original file line number Diff line number Diff line change
Expand Up @@ -31,10 +31,15 @@
"凑四句一切": "Slice once every 4 sentences",
"按英文句号.切": "Slice by English punct",
"threshold:音量小于这个值视作静音的备选切割点": "Noise gate threshold (loudness below this value will be treated as noise",
"min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "Minimum length",
"min_duration:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_duration: The minimum length for each segment. If the first segment is too short, it will be combined with subsequent segments until it exceeds this value.",
"max_duration:每段最长多长": "Maxium duration",
"min_interval:最短切割间隔": "Minumum interval for audio cutting",
"hop_size:怎么算音量曲线,越小精度越大计算量越高(不是精度越大效果越好)": "hop_size: FO hop size, the smaller the value, the higher the accuracy)",
"max_sil_kept:切完后静音最多留多长": "Maximum length for silence to be kept",
"对于过短音频的处理方法,勾选则合并,不勾选则抛弃": "Check to enable the short audio pieces",
"目标响度": "Target Loudness",
"峰值响度": "Peak Loudness",
"是否匹配响度": "Check to enable loudness normalization",
"开启语音切割": "Start audio slicer",
"终止语音切割": "Stop audio cutting",
"max:归一化后最大值多少": "Loudness multiplier after normalized",
Expand Down
7 changes: 6 additions & 1 deletion i18n/locale/es_ES.json
Original file line number Diff line number Diff line change
Expand Up @@ -19,10 +19,15 @@
"凑四句一切": "Completa cuatro oraciones para rellenar todo",
"按英文句号.切": "Cortar por puntos en inglés.",
"threshold:音量小于这个值视作静音的备选切割点": "umbral: puntos de corte alternativos considerados como silencio si el volumen es menor que este valor",
"min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_length: duración mínima de cada segmento, si el primer segmento es demasiado corto, se conecta continuamente con los siguientes hasta que supera este valor",
"min_duration:每段最短多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_duration: Duración mínima de cada segmento. Si el primer segmento es demasiado corto, se combinará con los siguientes segmentos hasta superar este valor.",
"max_duration:每段最长多长": "max_duration: Duración máxima",
"min_interval:最短切割间隔": "min_interval: intervalo mínimo de corte",
"hop_size:怎么算音量曲线,越小精度越大计算量越高(不是精度越大效果越好)": "hop_size: cómo calcular la curva de volumen, cuanto más pequeño, mayor precisión pero mayor carga computacional (mayor precisión no significa mejor rendimiento)",
"max_sil_kept:切完后静音最多留多长": "max_sil_kept: duración máxima del silencio después del corte",
"对于过短音频的处理方法,勾选则合并,不勾选则抛弃": "Marcar para activar la fusión de clips de audio cortos",
"目标响度": "Loudness objetivo",
"峰值响度": "Loudness de pico",
"是否匹配响度": "Marcar para activar la correspondencia de loudness",
"开启语音切割": "Habilitar la división de voz",
"终止语音切割": "Terminar la división de voz",
"max:归一化后最大值多少": "max: valor máximo después de la normalización",
Expand Down
7 changes: 6 additions & 1 deletion i18n/locale/fr_FR.json
Original file line number Diff line number Diff line change
Expand Up @@ -19,10 +19,15 @@
"凑四句一切": "Composez quatre phrases pour tout remplir",
"按英文句号.切": "Découpez par des points en anglais",
"threshold:音量小于这个值视作静音的备选切割点": "seuil: le volume inférieur à cette valeur est considéré comme un point de coupe silencieux alternatif",
"min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_length: longueur minimale de chaque segment, si le premier segment est trop court, il est continué avec le segment suivant jusqu'à dépasser cette valeur",
"min_furation:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_duration : Durée minimale de chaque segment. Si le premier segment est trop court, il sera combiné avec les segments suivants jusqu'à dépasser cette valeur.",
"max_duration:每段最长多长": "Maximum duration: Durée maximale",
"min_interval:最短切割间隔": "min_interval: intervalle de coupe minimum",
"hop_size:怎么算音量曲线,越小精度越大计算量越高(不是精度越大效果越好)": "hop_size: comment calculer la courbe de volume, plus petit pour une précision plus élevée mais une charge de calcul plus élevée (ce n'est pas une meilleure précision)",
"max_sil_kept:切完后静音最多留多长": "max_sil_kept: durée maximale de silence après la coupe",
"对于过短音频的处理方法,勾选则合并,不勾选则抛弃": "Cocher pour activer la fusion des clips audio courts",
"目标响度": "Loudness cible",
"峰值响度": "Loudness de crête",
"是否匹配响度": "Cocher pour activer l'ajustement de loudness",
"开启语音切割": "Activer le découpage vocal",
"终止语音切割": "Arrêter le découpage vocal",
"max:归一化后最大值多少": "max: valeur maximale après normalisation",
Expand Down
7 changes: 6 additions & 1 deletion i18n/locale/it_IT.json
Original file line number Diff line number Diff line change
Expand Up @@ -11,10 +11,15 @@
"音频自动切分输入路径,可文件可文件夹": "Percorso di input per la segmentazione automatica dell'audio, può essere un file o una cartella",
"切分后的子音频的输出根目录": "Directory radice di output per gli audio segmentati",
"threshold:音量小于这个值视作静音的备选切割点": "threshold: Punto di taglio alternativo considerato silenzioso se il volume è inferiore a questo valore",
"min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_length: Lunghezza minima di ogni segmento. Se il primo segmento è troppo corto, verrà unito agli segmenti successivi fino a superare questo valore",
"min_duration:每段最短多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_duration: Quanto breve può essere ciascun segmento, se il primo segmento è troppo breve, continuerà ad essere unito ai segmenti successivi fino a superare questo valore.",
"max_duration:每段最长多长": "Maximum duration: Durata massima",
"min_interval:最短切割间隔": "min_interval: Intervallo minimo di taglio",
"hop_size:怎么算音量曲线,越小精度越大计算量越高(不是精度越大效果越好)": "hop_size: Come calcolare la curva del volume. Più piccolo è, maggiore è la precisione ma aumenta la complessità computazionale (non significa che una maggiore precisione dà risultati migliori)",
"max_sil_kept:切完后静音最多留多长": "max_sil_kept: Massima durata del silenzio dopo il taglio",
"对于过短音频的处理方法,勾选则合并,不勾选则抛弃": "Seleziona per abilitare l'unione di clip audio brevi",
"目标响度": "Loudness obiettivo",
"峰值响度": "Loudness di picco",
"是否匹配响度": "Seleziona per abilitare il matching di loudness",
"开启语音切割": "Attivare la segmentazione vocale",
"终止语音切割": "Terminare la segmentazione vocale",
"max:归一化后最大值多少": "max: Massimo valore dopo la normalizzazione",
Expand Down
7 changes: 6 additions & 1 deletion i18n/locale/ja_JP.json
Original file line number Diff line number Diff line change
Expand Up @@ -19,10 +19,15 @@
"凑四句一切": "4つの文で埋める",
"按英文句号.切": "英文のピリオドで切ってください",
"threshold:音量小于这个值视作静音的备选切割点": "閾値:この値未満の音量は静音と見なされ、代替のカットポイントとして扱われます",
"min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_length:各セグメントの最小長さ。最初のセグメントが短すぎる場合、連続して後続のセグメントに接続され、この値を超えるまで続きます。",
"min_duration:每段最短多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_duration: 各セグメントの最短長さ。最初のセグメントが短すぎる場合は、この値を超えるまで次のセグメントと連続して結合されます。",
"max_duration:每段最长多长": "maximum duration: 最大持続時間",
"min_interval:最短切割间隔": "min_interval:最短カット間隔",
"hop_size:怎么算音量曲线,越小精度越大計算量越高(不是精度越大效果越好)": "hop_size:音量曲線を計算する方法。値が小さいほど精度が高くなり、計算量が増加します(精度が高いほど効果が良いわけではありません)。",
"max_sil_kept:切完后静音最多留多长": "max_sil_kept:切り終えた後、最大でどれだけ静かにするか",
"对于过短音频的处理方法,勾选则合并,不勾选则抛弃": "对短いオーディオクリップのマージを有効にするためにチェック",
"目标响度": "目標ラウドネス",
"峰值响度": "ピークラウドネス",
"是否匹配响度": "ラウドネスマッチングを有効にするためにチェック",
"开启语音切割": "音声の分割を開始",
"终止语音切割": "音声の分割を停止",
"max:归一化后最大值多少": "max:正規化後の最大値",
Expand Down
7 changes: 6 additions & 1 deletion i18n/locale/ko_KR.json
Original file line number Diff line number Diff line change
Expand Up @@ -19,10 +19,15 @@
"凑四句一切": "네 문장의 세트를 완성하세요.",
"按英文句号.切": "영어 문장으로 분리하기",
"threshold:音量小于这个值视作静音的备选切割点": "임계 값: 이 값보다 작은 볼륨은 대체 분리 지점으로 간주됩니다.",
"min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "최소 길이: 각 세그먼트의 최소 길이. 첫 번째 세그먼트가 너무 짧으면 계속해서 뒷부분과 연결하여 이 값 이상이 될 때까지",
"min_duration:每段最短多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_duration: 각 세그먼트의 최소 길이가 얼마인지, 첫 번째 세그먼트가 너무 짧다면 이 값을 초과할 때까지 계속해서 다음 세그먼트와 결합됩니다.",
"max_duration:每段最长多长": "Maximum duration: 최대 지속 시간",
"min_interval:最短切割间隔": "최소 분리 간격",
"hop_size:怎么算音量曲线,越小精度越大计算量越高(不是精度越大效果越好)": "hop 크기: 볼륨 곡선을 계산하는 방법. 작을수록 정확도가 높아지지만 계산량이 높아집니다 (정확도가 높다고 효과가 좋아지지 않음)",
"max_sil_kept:切完后静音最多留多长": "최대 유지되는 정적 길이 (분리 후)",
"对于过短音频的处理方法,勾选则合并,不勾选则抛弃": "짧은 오디오 클립 합치기 활성화 체크",
"目标响度": "목표 라우드니스",
"峰值响度": "피크 라우드니스",
"是否匹配响度": "라우드니스 매칭 활성화 체크",
"开启语音切割": "음성 분리 활성화",
"终止语音切割": "음성 분리 종료",
"max:归一化后最大值多少": "최대 값 (정규화 후)",
Expand Down
7 changes: 6 additions & 1 deletion i18n/locale/pt_BR.json
Original file line number Diff line number Diff line change
Expand Up @@ -11,10 +11,15 @@
"音频自动切分输入路径,可文件可文件夹": "Caminho de entrada automático de corte de áudio, pode ser um arquivo ou uma pasta",
"切分后的子音频的输出根目录": "Diretório raiz de saída do sub-áudio após o corte",
"threshold:音量小于这个值视作静音的备选切割点": "Limiar: O volume menor que este valor é considerado como um ponto de corte mudo alternativo",
"min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_length: O comprimento mínimo de cada parágrafo, se o primeiro for muito curto, conecte-o continuamente aos próximos até ultrapassar este valor",
"min_duration:每段最短多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_duration: Duração mínima de cada segmento. Se o primeiro segmento for muito curto, ele será combinado com os segmentos seguintes até ultrapassar esse valor.",
"max_duration:每段最长多长": "Maximum duration: Duração máxima",
"min_interval:最短切割间隔": "min_interval: O intervalo de corte mínimo",
"hop_size:怎么算音量曲线,越小精度越大计算量越高(不是精度越大效果越好)": "HOP_SIZE: Como calcular a curva de volume, quanto menor a precisão, maior a quantidade de cálculos (não significa que quanto maior a precisão, melhor o efeito)",
"max_sil_kept:切完后静音最多留多长": "max_sil_kept: Depois de cortar, por quanto tempo no máximo o silêncio é mantido",
"对于过短音频的处理方法,勾选则合并,不勾选则抛弃": "Marcar para ativar a união de clipes de áudio curtos",
"目标响度": "Loudness alvo",
"峰值响度": "Loudness de pico",
"是否匹配响度": "Marcar para ativar o ajuste de loudness",
"开启语音切割": "Ativar corte de voz",
"终止语音切割": "Encerrar corte de voz",
"max:归一化后最大值多少": "MAX: Qual é o valor máximo após a normalização?",
Expand Down
7 changes: 6 additions & 1 deletion i18n/locale/zh_CN.json
Original file line number Diff line number Diff line change
Expand Up @@ -31,10 +31,15 @@
"凑四句一切": "凑四句一切",
"按英文句号.切": "按英文句号.切",
"threshold:音量小于这个值视作静音的备选切割点": "threshold:音量小于这个值视作静音的备选切割点",
"min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_length:每段最小多长,如果第一段太短一直和后面段连起来直到超过这个值",
"min_duration:每段最短多长,如果第一段太短一直和后面段连起来直到超过这个值": "min_duration:每段最短多长,如果第一段太短一直和后面段连起来直到超过这个值",
"max_duration:每段最长多长": "max_duration:每段最长多长",
"min_interval:最短切割间隔": "min_interval:最短切割间隔",
"hop_size:怎么算音量曲线,越小精度越大计算量越高(不是精度越大效果越好)": "hop_size:怎么算音量曲线,越小精度越大计算量越高(不是精度越大效果越好)",
"max_sil_kept:切完后静音最多留多长": "max_sil_kept:切完后静音最多留多长",
"对于过短音频的处理方法,勾选则合并,不勾选则抛弃": "对于过短音频的处理方法,勾选则合并,不勾选则抛弃",
"目标响度": "目标响度",
"峰值响度": "峰值响度",
"是否匹配响度": "是否匹配响度",
"开启语音切割": "开启语音切割",
"终止语音切割": "终止语音切割",
"max:归一化后最大值多少": "max:归一化后最大值多少",
Expand Down
3 changes: 2 additions & 1 deletion requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -25,4 +25,5 @@ jieba_fast
jieba
LangSegment>=0.2.0
Faster_Whisper
wordsegment
wordsegment
pyloudnorm>=0.1.1
Loading