Skip to main content

The DKU-JNU-EMA electromagnetic articulography database on Mandarin and Chinese dialects with tandem feature based acoustic-to-articulatory inversion

Publication ,  Conference
Cai, Z; Qin, X; Cai, D; Li, M; Liu, X; Zhong, H
Published in: 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings
July 2, 2018

This paper presents the acquisition of the Duke Kunshan University Jinan University Electromagnetic Articulography (DKU-JNU-EMA) database in terms of aligned acoustics and articulatory data on Mandarin and Chinese dialects. This database currently includes data from multiple individuals in Mandarin and three Chinese dialects, namely Cantonese, Hakka, Teochew. There are 2-7 native speakers for each language or dialect. Acoustic data is obtained by one head-mounted close talk microphone while articulatory data is obtained by the NDI electromagnetic articulography wave research system. The DKU-JNU-EMA database is now in preparation for public release to help advance research in areas of acoustic-to-articulatory inversion, speech production, dialect recognition, and experimental phonetics. Along with the database, we propose an acoustic-to-articulatory inversion baseline using deep neural networks. Moreover, we show that by concatenating the dimension reduced phoneme posterior probability feature with MFCC features at the feature level as tandem feature, the inversion system performance is enhanced.

Duke Scholars

Published In

2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

DOI

Publication Date

July 2, 2018

Start / End Page

235 / 239
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Cai, Z., Qin, X., Cai, D., Li, M., Liu, X., & Zhong, H. (2018). The DKU-JNU-EMA electromagnetic articulography database on Mandarin and Chinese dialects with tandem feature based acoustic-to-articulatory inversion. In 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings (pp. 235–239). https://doi.org/10.1109/ISCSLP.2018.8706629
Cai, Z., X. Qin, D. Cai, M. Li, X. Liu, and H. Zhong. “The DKU-JNU-EMA electromagnetic articulography database on Mandarin and Chinese dialects with tandem feature based acoustic-to-articulatory inversion.” In 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, 235–39, 2018. https://doi.org/10.1109/ISCSLP.2018.8706629.
Cai Z, Qin X, Cai D, Li M, Liu X, Zhong H. The DKU-JNU-EMA electromagnetic articulography database on Mandarin and Chinese dialects with tandem feature based acoustic-to-articulatory inversion. In: 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings. 2018. p. 235–9.
Cai, Z., et al. “The DKU-JNU-EMA electromagnetic articulography database on Mandarin and Chinese dialects with tandem feature based acoustic-to-articulatory inversion.” 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, 2018, pp. 235–39. Scopus, doi:10.1109/ISCSLP.2018.8706629.
Cai Z, Qin X, Cai D, Li M, Liu X, Zhong H. The DKU-JNU-EMA electromagnetic articulography database on Mandarin and Chinese dialects with tandem feature based acoustic-to-articulatory inversion. 2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings. 2018. p. 235–239.

Published In

2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

DOI

Publication Date

July 2, 2018

Start / End Page

235 / 239