Skip to main content

Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection

Publication ,  Conference
Ma, M; Wu, H; Wang, X; Yang, L; Wang, J; Li, M
Published in: 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021
January 24, 2021

In this paper, we propose a deep convolutional neural network-based acoustic word embedding system for code-switching query by example spoken term detection. Different from previous configurations, we combine audio data in two languages for training instead of only using one single language. We trans-form the acoustic features of keyword templates and searching content segments obtained in a sliding manner to fixed-dimensional vectors and calculate the distances between them. An auxiliary variability-invariant loss is also applied to training data within the same word but different speakers. This strategy is used to prevent the extractor from encoding undesired speaker- or accent-related information into the acoustic word embeddings. Experimental results show that our proposed sys-tem produces promising searching results in the code-switching test scenario. With the employment of variability-invariant loss, the searching performance is further enhanced.

Duke Scholars

Published In

2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021

DOI

Publication Date

January 24, 2021
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ma, M., Wu, H., Wang, X., Yang, L., Wang, J., & Li, M. (2021). Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection. In 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. https://doi.org/10.1109/ISCSLP49672.2021.9362056
Ma, M., H. Wu, X. Wang, L. Yang, J. Wang, and M. Li. “Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection.” In 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, 2021. https://doi.org/10.1109/ISCSLP49672.2021.9362056.
Ma M, Wu H, Wang X, Yang L, Wang J, Li M. Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection. In: 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. 2021.
Ma, M., et al. “Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection.” 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, 2021. Scopus, doi:10.1109/ISCSLP49672.2021.9362056.
Ma M, Wu H, Wang X, Yang L, Wang J, Li M. Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection. 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. 2021.

Published In

2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021

DOI

Publication Date

January 24, 2021