Scholars@Duke publication: Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification

Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification

Publication , Conference

Lin, Y; Qin, X; Jiang, N; Zhao, G; Li, M

Published in: 2023 IEEE Automatic Speech Recognition and Understanding Workshop Asru 2023

January 1, 2023

It is widely acknowledged that discriminative representation for speaker verification can be extracted from verbal speech. However, how much speaker information that non-verbal vocalization carries is still a puzzle. This paper explores speaker verification based on the most ubiquitous form of non-verbal voice, laughter. First, we use a semi-automatic pipeline to collect a new Haha-Pod dataset from open-source podcast media. The dataset contains over 240 speakers' laughter clips with corresponding high-quality verbal speech. Second, we propose a Two-Stage Teacher-Student (2S-TS) framework to minimize the within-speaker embedding distance between verbal and non-verbal (laughter) signals. Considering Haha-Pod as a test set, two trial sets (S2L-Eval) are designed to verify the speaker's identity through laugh sounds. Experimental results demonstrate that our method can significantly improve the performance of the S2L-Eval test set with only a minor degradation on the VoxCeleb1 test set. The resources for the Haha-Pod dataset can be found at https://github.com/nevermoreLin/HahaPod.

Duke Scholars

Author Ming Li DKU Faculty

Published In

2023 IEEE Automatic Speech Recognition and Understanding Workshop Asru 2023

DOI

10.1109/ASRU57964.2023.10389664

Publication Date

January 1, 2023

Citation

APA

Chicago

ICMJE

MLA

NLM

Lin, Y., Qin, X., Jiang, N., Zhao, G., & Li, M. (2023). Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification. In 2023 IEEE Automatic Speech Recognition and Understanding Workshop Asru 2023. https://doi.org/10.1109/ASRU57964.2023.10389664

Lin, Y., X. Qin, N. Jiang, G. Zhao, and M. Li. “Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification.” In 2023 IEEE Automatic Speech Recognition and Understanding Workshop Asru 2023, 2023. https://doi.org/10.1109/ASRU57964.2023.10389664.

Lin Y, Qin X, Jiang N, Zhao G, Li M. Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification. In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop Asru 2023. 2023.

Lin, Y., et al. “Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification.” 2023 IEEE Automatic Speech Recognition and Understanding Workshop Asru 2023, 2023. Scopus, doi:10.1109/ASRU57964.2023.10389664.

Lin Y, Qin X, Jiang N, Zhao G, Li M. Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification. 2023 IEEE Automatic Speech Recognition and Understanding Workshop Asru 2023. 2023.

Published In

2023 IEEE Automatic Speech Recognition and Understanding Workshop Asru 2023

DOI

10.1109/ASRU57964.2023.10389664

Publication Date

January 1, 2023