Speech samples for "Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform"
Authors: Shinji Takaki, Hirokazu Kameoka, Junichi Yamagishi Paper
1: Author of the danger trail, Philip Steels, etc.
NAT:
WORLD
STFT
STFT+CWT
CWT
AbS:
2: To my surprise he began to show actual enthusiasm in my favor.
NAT:
WORLD
STFT
STFT+CWT
CWT
AbS:
3: In a flash Philip followed its direction.
NAT:
WORLD
STFT
STFT+CWT
CWT
AbS:
4: Much, replied Jeanne, as tersely.
NAT:
WORLD
STFT
STFT+CWT
CWT
AbS:
5: I suppose you picked that lingo up among the Indians.
NAT:
WORLD
STFT
STFT+CWT
CWT
AbS:
Acknowledgement
WORLD: https://github.com/mmorise/World
These synthetic speech samples were constructed using the CMU Arctic database. The CMU_ARCTIC databases were constructed at the Language Technologies Institute at Carnegie Mellon University. See http://festvox.org/cmu_arctic/ for more details.