Speaker Anonymization Using X-vector and Neural Waveform Models

Fuming Fang, Xin Wang, Junichi Yamagishi, Isao Echizen, Massimiliano Todisco, Nicholas Evans, Jean-François Bonastre
preprint paper

Female speaker:
Utterance: I can't believe the buzz.
Natural:

Dissimilarity score PPG: 6th sigmoid PPG: softmax
Copy synthesis
0.2
0.4
0.6


Male speaker:
Utterance: Military chiefs were playing it straight.
Natural:

Dissimilarity score PPG: 6th sigmoid PPG: softmax
Copy synthesis
0.2
0.4
0.6