Skip to main content

SIMPLE ATTENTION MODULE BASED SPEAKER VERIFICATION WITH ITERATIVE NOISY LABEL DETECTION

Publication ,  Conference
Qin, X; Li, N; Weng, C; Su, D; Li, M
Published in: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
January 1, 2022

Recently, the attention mechanism such as squeeze-and-excitation module (SE) and convolutional block attention module (CBAM) has achieved great success in deep learning-based speaker verification system. This paper introduces an alternative effective yet simple one, i.e., simple attention module (SimAM), for speaker verification. The SimAM module is a plug-and-play module without extra modal parameters. In addition, we propose a noisy label detection method to iteratively filter out the data samples with a noisy label from the training data, considering that a large-scale dataset labeled with human annotation or other automated processes may contain noisy labels. Data with the noisy label may over parameterize a deep neural network (DNN) and result in a performance drop due to the memorization effect of the DNN. Experiments are conducted on VoxCeleb dataset. The speaker verification model with SimAM achieves the 0.675% equal error rate (EER) on VoxCeleb1 original test trials. Our proposed iterative noisy label detection method further reduces the EER to 0.643%.

Duke Scholars

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2022

Volume

2022-May

Start / End Page

6722 / 6726
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Qin, X., Li, N., Weng, C., Su, D., & Li, M. (2022). SIMPLE ATTENTION MODULE BASED SPEAKER VERIFICATION WITH ITERATIVE NOISY LABEL DETECTION. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 2022-May, pp. 6722–6726). https://doi.org/10.1109/ICASSP43922.2022.9746294
Qin, X., N. Li, C. Weng, D. Su, and M. Li. “SIMPLE ATTENTION MODULE BASED SPEAKER VERIFICATION WITH ITERATIVE NOISY LABEL DETECTION.” In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2022-May:6722–26, 2022. https://doi.org/10.1109/ICASSP43922.2022.9746294.
Qin X, Li N, Weng C, Su D, Li M. SIMPLE ATTENTION MODULE BASED SPEAKER VERIFICATION WITH ITERATIVE NOISY LABEL DETECTION. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2022. p. 6722–6.
Qin, X., et al. “SIMPLE ATTENTION MODULE BASED SPEAKER VERIFICATION WITH ITERATIVE NOISY LABEL DETECTION.” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2022-May, 2022, pp. 6722–26. Scopus, doi:10.1109/ICASSP43922.2022.9746294.
Qin X, Li N, Weng C, Su D, Li M. SIMPLE ATTENTION MODULE BASED SPEAKER VERIFICATION WITH ITERATIVE NOISY LABEL DETECTION. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2022. p. 6722–6726.

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2022

Volume

2022-May

Start / End Page

6722 / 6726