KoreanTTS

Tacotron2 모델과 Vocoder모델(Griffinlim, Wavenet, MelGan)을 결합하여 한국어 TTS를 구현하는 프로젝트입니다.

Based on

Dataset

Koran Single Speaker Speech
- 전문여자성우(12시간, wav, 44100khz, 12853개, 3GB)
배우 유인나 목소리
- KBS 라디오 유인나의 볼륨을 높여요(3시간, wav, 16000khz, 3327개, 480.6MB)
- Google Speech to Text API
- Kakao Speech API
반려동물 훈련사 강형욱 목소리
- ETRI 한국어 인식 API

학습에 진행한 오디오 데이터는 저작권 문제로 공유하지 않습니다. 각 데이터 출처에서 확인해주세요.

KSS: https://www.kaggle.com/bryanpark/korean-single-speaker-speech-dataset
KBS 라디오: http://program.kbs.co.kr/2fm/radio/uvolum/pc/index.html

Preprocessing

wav 파일을 numpy 파일로 변환
‘audio’, ‘mel’, ‘linear’, ‘text’ 등의 메타데이터를 묶어 저장
Data/kss/＂음성파일이름.npz＂ 생성
Mel-spectrogram, Linear-spectrogram 정답셋을 생성

Project 진행

총 4가지의 학습을 진행하였습니다.

Tacotron2 + GriffinLim + Singlespeaker
Tacotron2 + GriffinLim + Multispeaker(Deep Voice 2)
Tacotron2 + Melgan + Single Speaker
Tacotron2 + Melgan + Multispeaker (Transfer learning)

결과

Tacotron2 + GriffinLim + Multispeaker(KSS + 유인나) 중 KSS 데이터
- Alignmnet (50000)
Tacotron2 + GriffinLim + Multispeaker(KSS + 유인나) 중 유인나 데이터
- Alignment(90000)
Tacotron2 + MelGan + Singlespeaker(KSS)
- Alignment(90000)

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Tacotron2-Wavenet-Korean-TTS		Tacotron2-Wavenet-Korean-TTS
Tacotron2_GriffinLim_TTS		Tacotron2_GriffinLim_TTS
TensorFlowTTS		TensorFlowTTS
preprocessing		preprocessing
result/audio_samples_90000step_griffinlim		result/audio_samples_90000step_griffinlim
training		training
.gitignore		.gitignore
README.md		README.md
TTS_overview.ipynb		TTS_overview.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

Tacotron2-Wavenet-Korean-TTS

Tacotron2-Wavenet-Korean-TTS

Tacotron2_GriffinLim_TTS

Tacotron2_GriffinLim_TTS

TensorFlowTTS

TensorFlowTTS

preprocessing

preprocessing

result/audio_samples_90000step_griffinlim

result/audio_samples_90000step_griffinlim

training

training

.gitignore

.gitignore

README.md

README.md

TTS_overview.ipynb

TTS_overview.ipynb

Repository files navigation

KoreanTTS

Dataset

Preprocessing

Project 진행

결과

About

Releases

Packages

Contributors 2

Languages

esoyeon/KoreanTTS

Folders and files

Latest commit

History

Repository files navigation

KoreanTTS

Dataset

Preprocessing

Project 진행

결과

About

Topics

Resources

Stars

Watchers

Forks

Languages