voco-tts

직접 녹음한 데이터셋으로 파인튜닝한 모델 데모는 여기에서 확인할 수 있습니다

모델 트레이닝 순서는 아래와 같습니다. 보다 높은 정확도를 위해 직접 녹음한 데이터셋 대신 LJ Speech Dataset을 이용해 훈련을 진행하겠습니다.

git clone https://github.com/go-ggle/voco-tts.git

apt-get install espeak

pip install -r requirements.txt

cd monotonic_align
python setup.py build_ext --inplace

cd monotonic_align
mkdir monotonic_align
python setup.py build_ext --inplace

wget https://data.keithito.com/data/speech/LJSpeech-1.1.tar.bz2
tar -xvf LJSpeech-1.1.tar.gz

sed -i -- 's,DUMMY1,LJSpeech-1.1/wavs,g' filelists/ljs_audio_text*.txt.cleaned

vi filelists/ljs_audio_text_train_filelist.txt.cleaned
:500,$d
:wq

vi filelists/ljs_audio_text_val_filelist.txt.cleaned
:150,$d
:wq

wget --no-check-certificate 'https://docs.google.com/uc?export=download&id=1gTp1FXfkiLVlpKh2Vg5FygaCKbsQ97FG' -O configs/train_base.json

python train_ms.py -c configs/train_base.json -m train_base

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
configs		configs
filelists		filelists
monotonic_align		monotonic_align
resources		resources
text		text
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
attentions.py		attentions.py
commons.py		commons.py
data_utils.py		data_utils.py
inference.ipynb		inference.ipynb
losses.py		losses.py
mel_processing.py		mel_processing.py
models.py		models.py
modules.py		modules.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
train_ms.py		train_ms.py
transforms.py		transforms.py
utils.py		utils.py

Provide feedback