Scholars@Duke publication: SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET

SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET

Publication , Conference

Miao, X; Wang, X; Cooper, E; Yamagishi, J; Evans, N; Todisco, M; Bonastre, JF; Rouvier, M

Published in: ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

January 1, 2024

The success of deep learning in speaker recognition relies heavily on the use of large datasets. However, the data-hungry nature of deep learning methods has already being questioned on account the ethical, privacy, and legal concerns that arise when using large-scale datasets of natural speech collected from real human speakers. For example, the widely-used VoxCeleb2 dataset for speaker recognition is no longer accessible from the official website. To mitigate these concerns, this work presents an initiative to generate a privacy-friendly synthetic VoxCeleb2 dataset that ensures the quality of the generated speech in terms of privacy, utility, and fairness. We also discuss the challenges of using synthetic data for the downstream task of speaker verification.

Duke Scholars

Author Xiaoxiao Miao DKU Faculty

Published In

ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

DOI

10.1109/ICASSP48485.2024.10446513

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

11421 / 11425

Citation

APA

Chicago

ICMJE

MLA

NLM

Miao, X., Wang, X., Cooper, E., Yamagishi, J., Evans, N., Todisco, M., … Rouvier, M. (2024). SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET. In ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (pp. 11421–11425). https://doi.org/10.1109/ICASSP48485.2024.10446513

Miao, X., X. Wang, E. Cooper, J. Yamagishi, N. Evans, M. Todisco, J. F. Bonastre, and M. Rouvier. “SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET.” In ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 11421–25, 2024. https://doi.org/10.1109/ICASSP48485.2024.10446513.

Miao X, Wang X, Cooper E, Yamagishi J, Evans N, Todisco M, et al. SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET. In: ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings. 2024. p. 11421–5.

Miao, X., et al. “SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET.” ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 2024, pp. 11421–25. Scopus, doi:10.1109/ICASSP48485.2024.10446513.

Miao X, Wang X, Cooper E, Yamagishi J, Evans N, Todisco M, Bonastre JF, Rouvier M. SYNVOX2: TOWARDS A PRIVACY-FRIENDLY VOXCELEB2 DATASET. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings. 2024. p. 11421–11425.

Published In

ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

DOI

10.1109/ICASSP48485.2024.10446513

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

11421 / 11425