Publications

(2021). Revisiting Speech Content Privacy. Proceedings of the Symposium of the Security & Privacy in Speech Communication.

PDF Cite

(2021). A Multi-Level Attention Model for Evidence-Based Fact Checking. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.

PDF Cite DOI

(2021). Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection. Proc. 2021 Automatic Speaker Verification and Spoofing Countermeasures Challenge (ASVspoof 2021 Workshop).

PDF Cite

(2021). Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021.

PDF Cite DOI

(2021). How Similar or Different is Rakugo Speech Synthesizer to Professional Performers?. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021.

PDF Cite DOI

(2021). Enhancing Low-Quality Voice Recordings Using Disentangled Channel Factor and Neural Waveform Model. IEEE Spoken Language Technology Workshop, SLT 2021, Shenzhen, China, January 19-22, 2021.

PDF Cite DOI

(2021). End-to-End anti-spoofing with RawNet2. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021.

PDF Cite DOI

(2021). Capsule-Forensics Networks for Deepfake Detection. Chapter 13, Handbook of Digital Face Manipulation and Detection - From DeepFakes to Morphing Attacks.

Cite

(2020). Viable Threat on News Reading: Generating Biased News Using Natural Language Models. Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science.

PDF Cite DOI

(2020). Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020.

PDF Cite DOI

(2020). Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model. Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020.

PDF Cite DOI

(2020). Transferring Neural Speech Waveform Synthesizers to Musical Instrument Sounds Generation. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020.

PDF Cite DOI

(2020). The Privacy ZEBRA: Zero Evidence Biometric Recognition Assessment. Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020.

PDF Cite DOI

(2020). Spoofing Attack Detection Using the Non-Linear Fusion of Sub-Band Classifiers. Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020.

PDF Cite DOI

(2020). Noise Tokens: Learning Neural Noise Templates for Environment-Aware Speech Enhancement. Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020.

PDF Cite DOI

(2020). Introducing the VoicePrivacy Initiative. Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020.

PDF Cite DOI

(2020). iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning. Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020.

PDF Cite DOI

(2020). Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-Based Detection. Advanced Information Networking and Applications - Proceedings of the 34th International Conference on Advanced Information Networking and Applications, AINA-2020, Caserta, Italy, 15-17 April.

PDF Cite DOI

(2020). Generating Master Faces for Use in Performing Wolf Attacks on Face Recognition Systems. 2020 IEEE International Joint Conference on Biometrics, IJCB 2020, Houston, TX, USA, September 28 - October 1, 2020.

PDF Cite DOI

(2020). Design Choices for X-Vector Based Speaker Anonymization. Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020.

PDF Cite DOI

(2020). Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS?. Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020.

PDF Cite DOI

(2019). Neural Source-filter-based Waveform Model for Statistical Parametric Speech Synthesis. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2019, Brighton, United Kingdom, May 12-17, 2019.

PDF Cite DOI

(2019). Multi-task Learning for Detecting and Segmenting Manipulated Facial Images and Videos. 10th IEEE International Conference on Biometrics Theory, Applications and Systems, BTAS 2019, Tampa, FL, USA, September 23-26, 2019.

PDF Cite DOI

(2019). MOSNet: Deep Learning based Objective Assessment for Voice Conversion. Proc. Conference of the International Speech Communication Association 2019 (Interspeech 2019).

PDF Cite

(2019). Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet. Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019.

PDF Cite DOI

(2019). Investigation of Enhanced Tacotron Text-to-speech Synthesis Systems with Self-attention for Pitch Accent Language. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2019, Brighton, United Kingdom, May 12-17, 2019.

PDF Cite DOI

(2019). Capsule-forensics: Using Capsule Networks to Detect Forged Images and Videos. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2019, Brighton, United Kingdom, May 12-17, 2019.

PDF Cite DOI

(2019). Bootstrapping Non-Parallel Voice Conversion from Speaker-Adaptive Text-to-Speech. IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019, Singapore, December 14-18, 2019.

PDF Cite DOI

(2019). ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019.

PDF Cite DOI