Skip to main content

Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification

Publication ,  Conference
Lin, Y; Qin, X; Jiang, N; Zhao, G; Li, M
Published in: 2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023
January 1, 2023

It is widely acknowledged that discriminative representation for speaker verification can be extracted from verbal speech. However, how much speaker information that non-verbal vocalization carries is still a puzzle. This paper explores speaker verification based on the most ubiquitous form of non-verbal voice, laughter. First, we use a semi-automatic pipeline to collect a new Haha-Pod dataset from open-source podcast media. The dataset contains over 240 speakers' laughter clips with corresponding high-quality verbal speech. Second, we propose a Two-Stage Teacher-Student (2S-TS) framework to minimize the within-speaker embedding distance between verbal and non-verbal (laughter) signals. Considering Haha-Pod as a test set, two trial sets (S2L-Eval) are designed to verify the speaker's identity through laugh sounds. Experimental results demonstrate that our method can significantly improve the performance of the S2L-Eval test set with only a minor degradation on the VoxCeleb1 test set. The resources for the Haha-Pod dataset can be found at https://github.com/nevermoreLin/HahaPod.

Duke Scholars

Published In

2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023

DOI

Publication Date

January 1, 2023
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Lin, Y., Qin, X., Jiang, N., Zhao, G., & Li, M. (2023). Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification. In 2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023. https://doi.org/10.1109/ASRU57964.2023.10389664
Lin, Y., X. Qin, N. Jiang, G. Zhao, and M. Li. “Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification.” In 2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023, 2023. https://doi.org/10.1109/ASRU57964.2023.10389664.
Lin Y, Qin X, Jiang N, Zhao G, Li M. Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification. In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023. 2023.
Lin, Y., et al. “Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification.” 2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023, 2023. Scopus, doi:10.1109/ASRU57964.2023.10389664.
Lin Y, Qin X, Jiang N, Zhao G, Li M. Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification. 2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023. 2023.

Published In

2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023

DOI

Publication Date

January 1, 2023