Speech samples for "Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora"

Authors: Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa

Speech samples for the paper "Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora" which is submitted to INTERSPEECH 2019.

A pre-print version of this paper can be found at https://arxiv.org/abs/1904.00771

WaveNet vocoder

samples synthesized using WaveNet vocoder

1st sample

	SD	MU	EN	CO
XS01	► Play	► Play	► Play	► Play
XS02	► Play	► Play	► Play	► Play
S03	► Play	► Play	► Play	► Play
S04	► Play	► Play	► Play	► Play
S05	► Play	► Play	► Play	► Play
M06	► Play	► Play	► Play	► Play
M07	► Play	► Play	► Play	► Play
M08	► Play	► Play	► Play	► Play
L09	► Play	► Play	► Play	► Play
XL10	► Play	► Play	► Play	► Play

2nd sample

	SD	MU	EN	CO
XS01	► Play	► Play	► Play	► Play
XS02	► Play	► Play	► Play	► Play
S03	► Play	► Play	► Play	► Play
S04	► Play	► Play	► Play	► Play
S05	► Play	► Play	► Play	► Play
M06	► Play	► Play	► Play	► Play
M07	► Play	► Play	► Play	► Play
M08	► Play	► Play	► Play	► Play
L09	► Play	► Play	► Play	► Play
XL10	► Play	► Play	► Play	► Play

Acknowledgement

The synthetic speech samples were constructed using a Japanese multi-speaker speech database owned by KDDI Research