Skip to main content

Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances

Publication ,  Conference
Wu, H; Li, M; Cai, Z; Zhong, H
Published in: 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings
July 2, 2018

In the task of the unsupervised query by example spoken term detection (QbE-STD), we concatenate the features extracted by a Self-Organizing Map (SOM) and features learned by an unsupervised GMM based model at the feature level to enhance the performance. More specifically, The SOM features are represented by the distances between the current feature vector and the weight vectors of SOM neurons learned in an unsupervised manner. After fetching these features, we apply sub-sequence Dynamic Time Warping (S-DTW) to detect the occurrences of keywords in the test data. We evaluate the performance of these features on the TIMIT English database. After concatenating the SOM features and the GMM based features together, we achieve an improvement of 7.77% and 7.74% on Mean Average Precision (MAP) and P@10 on average.

Duke Scholars

Published In

2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

DOI

Publication Date

July 2, 2018

Start / End Page

245 / 249
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wu, H., Li, M., Cai, Z., & Zhong, H. (2018). Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances. In 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings (pp. 245–249). https://doi.org/10.1109/ISCSLP.2018.8706580
Wu, H., M. Li, Z. Cai, and H. Zhong. “Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances.” In 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, 245–49, 2018. https://doi.org/10.1109/ISCSLP.2018.8706580.
Wu H, Li M, Cai Z, Zhong H. Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances. In: 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings. 2018. p. 245–9.
Wu, H., et al. “Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances.” 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, 2018, pp. 245–49. Scopus, doi:10.1109/ISCSLP.2018.8706580.
Wu H, Li M, Cai Z, Zhong H. Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances. 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings. 2018. p. 245–249.

Published In

2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

DOI

Publication Date

July 2, 2018

Start / End Page

245 / 249