Skip to main content

SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET

Publication ,  Conference
Miao, X; Wang, X; Cooper, E; Yamagishi, J; Evans, N; Todisco, M; Bonastre, JF; Rouvier, M
Published in: ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings
January 1, 2024

The success of deep learning in speaker recognition relies heavily on the use of large datasets. However, the data-hungry nature of deep learning methods has already being questioned on account the ethical, privacy, and legal concerns that arise when using large-scale datasets of natural speech collected from real human speakers. For example, the widely-used VoxCeleb2 dataset for speaker recognition is no longer accessible from the official website. To mitigate these concerns, this work presents an initiative to generate a privacy-friendly synthetic VoxCeleb2 dataset that ensures the quality of the generated speech in terms of privacy, utility, and fairness. We also discuss the challenges of using synthetic data for the downstream task of speaker verification.

Duke Scholars

Published In

ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

11421 / 11425
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Miao, X., Wang, X., Cooper, E., Yamagishi, J., Evans, N., Todisco, M., … Rouvier, M. (2024). SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET. In ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (pp. 11421–11425). https://doi.org/10.1109/ICASSP48485.2024.10446513
Miao, X., X. Wang, E. Cooper, J. Yamagishi, N. Evans, M. Todisco, J. F. Bonastre, and M. Rouvier. “SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET.” In ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 11421–25, 2024. https://doi.org/10.1109/ICASSP48485.2024.10446513.
Miao X, Wang X, Cooper E, Yamagishi J, Evans N, Todisco M, et al. SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET. In: ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings. 2024. p. 11421–5.
Miao, X., et al. “SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET.” ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 2024, pp. 11421–25. Scopus, doi:10.1109/ICASSP48485.2024.10446513.
Miao X, Wang X, Cooper E, Yamagishi J, Evans N, Todisco M, Bonastre JF, Rouvier M. SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings. 2024. p. 11421–11425.

Published In

ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

11421 / 11425