Selected accepted publications and their samples/codes:
"The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance"
Lin Zhang, Xin Wang, Erica Cooper, Nicholas Evans, Junichi Yamagishi
IEEE/ACM Transactions on Audio Speech and Language Processing
Preprint
"Outlier-Aware Training for Improving Group Accuracy Disparities"
Li-Kuang Chen, Canasai Kruengkrai, Junichi Yamagishi
AACL-IJCNLP 2022 Student Research Workshop (SRW)
Preprint,
Codes,
Pre-trained models
"Investigating Active-learning-based Training Data Selection for Speech Spoofing Countermeasure"
Xin Wang, Junichi Yamagishi
The 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
Preprint
"Mitigating the Diminishing Effect of Elastic Weight Consolidation"
Canasai Kruengkrai, Junichi Yamagishi
COLING 2022
PDF,
Codes
"Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions"
Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko
Interspeech 2022
Preprint,
Samples
"The VoiceMOS Challenge 2022"
Wen-Chin Huang, Erica Cooper, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi
Interspeech 2022
Preprint,
CodaLab,
website
"DDS: A new device-degraded speech dataset for speech enhancement"
Haoyu Li, Junichi Yamagishi
Interspeech 2022
Preprint,
Database DDS database (DAPS portion).
DDS database (VCTK portion part 1).
DDS database (VCTK portion part 2)
"Spoofing-Aware Attention based ASV Back-end with Multiple Enrollment Utterances and a Sampling Strategy for the SASV Challenge 2022"
Chang Zeng, Lin Zhang, Meng Liu, Junichi Yamagishi
Interspeech 2022
Preprint
"Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models"
Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, N. Tomashenko
ISCA Speaker Odyssey Workshop 2022
Preprint,
Samples,
Codes
"Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation"
Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas Evans
ISCA Speaker Odyssey Workshop 2022
Preprint
"Investigating self-supervised front ends for speech spoofing countermeasures"
Xin Wang, Junichi Yamagishi
ISCA Speaker Odyssey Workshop 2022
Preprint
"Master Face Attacks on Face Recognition Systems"
Huy H. Nguyen, Sébastien Marcel, Junichi Yamagishi, Isao Echizen
IEEE Transactions on Biometrics, Behavior, and Identity Science
Paper
"The VoicePrivacy 2020 Challenge: Results and findings"
Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier Noé, Andreas Nautsch, Nicholas Evans, Junichi Yamagishi, Benjamin O'Brien, Anaïs Chanclu, Jean-François Bonastre, Massimiliano Todisco, Mohamed Maouche
The Special Issue on Voice Privacy (Computer Speech and Language Journal - Elsevier)
Paper,
Preprint,
challenge website
"SVSNet: An End-to-end Speaker Voice Similarity Assessment Model"
Cheng-Hung Hu, Yu-Huai Peng, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang
IEEE Signal Processing Letters
Preprint
"Estimating the confidence of speech spoofing countermeasure"
Xin Wang, Junichi Yamagishi
ICASSP 2022
Preprint
"Generalization Ability of MOS Prediction Networks"
Erica Cooper, Wen-Chin Huang, Tomoki Toda, Junichi Yamagishi
ICASSP 2022
Preprint,
Codes
"On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis"
Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David Cox, James Glass
ICASSP 2022
Preprint,
Samples
"Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances"
Chang Zeng, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi
ICASSP 2022
Preprint,
Codes
"Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds"
Xuan Shi, Erica Cooper, Junichi Yamagishi
IEEE/ACM Transactions on Audio Speech and Language Processing
paper
"Optimizing Tandem Speaker Verification and Anti-Spoofing Systems"
Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi
IEEE/ACM Transactions on Audio Speech and Language Processing
paper
"Effectiveness of Detection-based and Regression-based Approaches for Estimating Mask-Wearing Ratio"
Khanh-Duy Nguyen, Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, Isao Echizen
The International Workshop on Face and Gesture Analysis for COVID-19 (FG4COVID19) held in conjunction with FG 2021
Preprint
"Revisiting Speech Content Privacy"
Jennifer Williams, Junichi Yamagishi, Paul-Gauthier Noe, Cassia Valentini Botinhao, Jean-Francois Bonastre
1st ISCA Symposium on Security and Privacy in Speech Communication
Preprint
"Benchmarking and challenges in security and privacy for voice biometrics"
Jean-Francois Bonastre, Héctor Delgado, Nicholas Evans, Tomi Kinnunen, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noé, Jose Patino, Md Sahidullah, Brij Mohan Lal Srivastava,
Paul-Gauthier Noé, Kong Aik Lee, Massimiliano Todisco, Natalia Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi
1st ISCA Symposium on Security and Privacy in Speech Communication
Preprint
"ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection"
Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas Evans, Héctor Delgado
The ASVspoof 2021 Workshop
Preprint challenge website
"Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection"
Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi
The ASVspoof 2021 Workshop
Preprint,
Database
"OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild"
Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen
ICCV 2021
Preprint,
Database
"A Multi-Level Attention Model for Evidence-Based Fact Checking"
Canasai Kruengkrai, Junichi Yamagishi, Xin Wang
Findings of ACL 2021
Preprint,
Code
"How do Voices from Past Speech Synthesis Challenges Compare Today?"
Erica Cooper, Junichi Yamagishi
ISCA Speech Synthesis Workshop 2021
Preprint
"Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis"
Erica Cooper, Xin Wang, Junichi Yamagishi
ISCA Speech Synthesis Workshop 2021
Preprint,
Samples
"Exploring Disentanglement with Multilingual and Monolingual VQ-VAE"
Jennifer Williams, Jason Fong, Erica Cooper, Junichi Yamagishi
ISCA Speech Synthesis Workshop 2021
Preprint,
Samples
"Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement"
Haoyu Li, Junichi Yamagishi
IEEE/ACM Transactions on Audio Speech and Language Processing
Preprint,
Samples,
Codes
"An Initial Investigation for Detecting Partially Spoofed Audio"
Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi, Jose Patino, Nicholas Evans
Interspeech 2021
Preprint,
Samples,
Database
"A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection"
Xin Wang, Junich Yamagishi
Interspeech 2021
Preprint,
Codes
"ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech"
Andreas Nautsch, Xin Wang, Nicholas Evans, Tomi Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md Sahidullah, Junichi Yamagishi, Kong Aik Lee
IEEE Transactions on Biometrics, Behavior, and Identity Science
Preprint
"End-to-End Text-to-Speech using Latent Duration based on VQ-VAE"
Yusuke Yasuda, Xin Wang, Junichi Yamagishi
ICASSP 2021
Preprint,
Samples
"Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm"
Jennifer Williams, Yi Zhao, Erica Cooper, Junichi Yamagishi
ICASSP 2021
Preprint,
Samples
"How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers?"
Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Junichi Yamagishi
ICASSP 2021
Preprint
"Enhancing Low-Quality Voice Recordings Using Disentangled Channel Factor and Neural Waveform Model"
Haoyu Li, Yang Ai, Junichi Yamagishi
IEEE SLT 2021
Preprint,
Samples
"Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation"
Yang Ai, Haoyu Li, Xin Wang, Junichi Yamagishi, Zhenhua Ling
IEEE SLT 2021
Preprint,
Samples