Audio Samples from the paper _Mitigating Language Mismatch in SSL-Based Speaker Anonymization, Interspeech 2025
By Zhe Zhang1, Wen-Chin Huang2, Xin Wang1, Xiaoxiao Miao3, Junichi Yamagishi1
1 National Institute of Informatics, Japan
2 Nagoya University, Japan
3 Duke Kunshan University, China
This page provides audio samples from our speaker anonymization experiments. Samples are in two languages:
For each utterance, we first list:
Then, the table below shows the results for the three SSL-based methods:
The methods are grouped by speaker anonimizers or resynthesis:
Original:
VPC Baseline B2 (McAdams):
Method Group | Resynthesis | Selection | OHNN |
---|---|---|---|
HU-EN | |||
HU-JA | |||
mHU-JA |
Original:
VPC Baseline B2 (McAdams):
Method Group | Resynthesis | Selection | OHNN |
---|---|---|---|
HU-EN | |||
HU-JA | |||
mHU-JA |
Original:
VPC Baseline B2 (McAdams):
Method Group | Resynthesis | Selection | OHNN |
---|---|---|---|
HU-EN | |||
HU-JA | |||
mHU-JA |
Original:
VPC Baseline B2 (McAdams):
Method Group | Resynthesis | Selection | OHNN |
---|---|---|---|
HU-EN | |||
HU-JA | |||
mHU-JA |
Original:
VPC Baseline B2 (McAdams):
Method Group | Resynthesis | Selection | OHNN |
---|---|---|---|
HU-EN | |||
HU-JA | |||
mHU-JA |
Original:
VPC Baseline B2 (McAdams):
Method Group | Resynthesis | Selection | OHNN |
---|---|---|---|
HU-EN | |||
HU-JA | |||
mHU-JA |