Skip to main content

Multi-Objective Progressive Clustering for Semi-Supervised Domain Adaptation in Speaker Verification

Publication ,  Conference
Li, Z; Lin, Y; Jiang, N; Qin, X; Zhao, G; Wu, H; Li, M
Published in: ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings
January 1, 2024

Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for semi-supervised domain adaptation in speaker verification tasks. In this paper, we propose a novel pseudo-labeling method named Multi-objective Progressive Clustering (MoPC), specifically designed for semi-supervised domain adaptation. Firstly, we utilize limited labeled data from the target domain to derive domain-specific descriptors based on multiple distinct objectives, namely within-graph denoising, intra-class denoising and inter-class denoising. Then, the Infomap algorithm is adopted for embedding clustering, and the descriptors are leveraged to further refine the target domain's pseudo-labels. Moreover, to further improve the quality of pseudo labels, we introduce the subcenter-purification and progressive-merging strategy for label denoising. Our proposed MoPC method achieves 4.95% EER and ranked the 1st place on the evaluation set of VoxSRC 2023 track 3. We also conduct additional experiments on the FFSVC dataset and yield promising results.

Duke Scholars

Published In

ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

12236 / 12240
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, Z., Lin, Y., Jiang, N., Qin, X., Zhao, G., Wu, H., & Li, M. (2024). Multi-Objective Progressive Clustering for Semi-Supervised Domain Adaptation in Speaker Verification. In ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (pp. 12236–12240). https://doi.org/10.1109/ICASSP48485.2024.10447138
Li, Z., Y. Lin, N. Jiang, X. Qin, G. Zhao, H. Wu, and M. Li. “Multi-Objective Progressive Clustering for Semi-Supervised Domain Adaptation in Speaker Verification.” In ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 12236–40, 2024. https://doi.org/10.1109/ICASSP48485.2024.10447138.
Li Z, Lin Y, Jiang N, Qin X, Zhao G, Wu H, et al. Multi-Objective Progressive Clustering for Semi-Supervised Domain Adaptation in Speaker Verification. In: ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings. 2024. p. 12236–40.
Li, Z., et al. “Multi-Objective Progressive Clustering for Semi-Supervised Domain Adaptation in Speaker Verification.” ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 2024, pp. 12236–40. Scopus, doi:10.1109/ICASSP48485.2024.10447138.
Li Z, Lin Y, Jiang N, Qin X, Zhao G, Wu H, Li M. Multi-Objective Progressive Clustering for Semi-Supervised Domain Adaptation in Speaker Verification. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings. 2024. p. 12236–12240.

Published In

ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

12236 / 12240