.. samples-xin documentation master file, created by sphinx-quickstart on Sun Apr 25 22:58:24 2021. You can adapt this file completely to your liking, but it should at least contain the root `toctree` directive. .. _label-nsf-v4: Cyclic-noise-NSF (CMU samples) ****************************** Messages -------- * Paper link: Wang, X. & Yamagishi, J. Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model. in Proc. Interspeech 1992–1996. `doi:10.21437/Interspeech.2020-1018 `__ * BibTex:: @inproceedings{wang2020cyclic, address = {ISCA}, author = {Wang, Xin and Yamagishi, Junichi}, booktitle = {Proc. Interspeech}, doi = {10.21437/Interspeech.2020-1018}, pages = {1992--1996}, publisher = {ISCA}, title = {{Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model}}, url = {http://www.isca-speech.org/archive/Interspeech{\_}2020/abstracts/1018.html}, year = {2020} } * This page lists samples on `CMU_ARCTIC database `_; * This page lists copy-synthesis waveform samples, i.e., waveforms generated given natural acoustic features. They were evaluated in the listening test * Results of significance test can be found `here `_ * Code is available. You need both the `CURRENNT toolkit `_ and `scripts `_. `This subfolder `_ in the script repository is made for this project * New implementaion based on Pytorch is also `available `_ * Slides for Interspeech 2020 presentation can be found on `this page `_. Or you can directly download this `PPT `__ or `PDF `__. | Audio samples ------------- All models were trained using SLT, CLB, RMS, and BDL in a speaker-independent way. It may take a few minutes to load all the speech samples. You can also download all samples from `this dropbox link `_. SLT voice ========= .. raw:: html
slt_arctic_b0474.wavslt_arctic_b0476.wavslt_arctic_b0478.wavslt_arctic_b0475.wavslt_arctic_b0477.wav
Nat Nat
WaveNet WaveNet
Sin Sin
Pul Pul
Rno Rno
Cno\({}_{\beta_1}\) Cno\({}_{\beta_1}\)
Cno\({}_{\beta_2}\) Cno\({}_{\beta_2}\)
Cno\({}_{\beta_3}\) Cno\({}_{\beta_3}\)
Cno\({}_{\beta_{tr}}\) Cno\({}_{\beta_{tr}}\)
Rno\({}_{no Lmask}\) Rno\({}_{no Lmask}\)
Cno\({}_{\beta_2 no Lmask}\) Cno\({}_{\beta_2 no Lmask}\)
CLB voice ========= .. raw:: html
clb_arctic_b0474.wavclb_arctic_b0476.wavclb_arctic_b0478.wavclb_arctic_b0475.wavclb_arctic_b0477.wav
Nat Nat
WaveNet WaveNet
Sin Sin
Pul Pul
Rno Rno
Cno\({}_{\beta_1}\) Cno\({}_{\beta_1}\)
Cno\({}_{\beta_2}\) Cno\({}_{\beta_2}\)
Cno\({}_{\beta_3}\) Cno\({}_{\beta_3}\)
Cno\({}_{\beta_{tr}}\) Cno\({}_{\beta_{tr}}\)
Rno\({}_{no Lmask}\) Rno\({}_{no Lmask}\)
Cno\({}_{\beta_2 no Lmask}\) Cno\({}_{\beta_2 no Lmask}\)
BDL voice ========= .. raw:: html
bdl_arctic_b0474.wavbdl_arctic_b0476.wavbdl_arctic_b0478.wavbdl_arctic_b0475.wavbdl_arctic_b0477.wav
Nat Nat
WaveNet WaveNet
Sin Sin
Pul Pul
Rno Rno
Cno\({}_{\beta_1}\) Cno\({}_{\beta_1}\)
Cno\({}_{\beta_2}\) Cno\({}_{\beta_2}\)
Cno\({}_{\beta_3}\) Cno\({}_{\beta_3}\)
Cno\({}_{\beta_{tr}}\) Cno\({}_{\beta_{tr}}\)
Rno\({}_{no Lmask}\) Rno\({}_{no Lmask}\)
Cno\({}_{\beta_2 no Lmask}\) Cno\({}_{\beta_2 no Lmask}\)
RMS voice ========= .. raw:: html
rms_arctic_b0474.wavrms_arctic_b0476.wavrms_arctic_b0478.wavrms_arctic_b0475.wavrms_arctic_b0477.wav
Nat Nat
WaveNet WaveNet
Sin Sin
Pul Pul
Rno Rno
Cno\({}_{\beta_1}\) Cno\({}_{\beta_1}\)
Cno\({}_{\beta_2}\) Cno\({}_{\beta_2}\)
Cno\({}_{\beta_3}\) Cno\({}_{\beta_3}\)
Cno\({}_{\beta_{tr}}\) Cno\({}_{\beta_{tr}}\)
Rno\({}_{no Lmask}\) Rno\({}_{no Lmask}\)
Cno\({}_{\beta_2 no Lmask}\) Cno\({}_{\beta_2 no Lmask}\)
.. toctree:: :hidden: :maxdepth: 1