Scholars@Duke publication: Automatic emotional spoken language text corpus construction from written dialogs in fictions

Automatic emotional spoken language text corpus construction from written dialogs in fictions

Publication , Conference

Chen, J; Liu, C; Li, M

Published in: 2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017

July 2, 2017

In this paper, we propose a novel method to automatically construct emotional spoken language text corpus from written dialogs, and release a large scale Chinese emotional text dataset with short conversations extracted from thousands of fictions using the proposed method. The emotional spoken language transcript resources in Chinese are relatively limited. However, constructing a large scale supervised corpus manually is neither efficient nor low-cost. This motivates us to try alternative efficient and effective approaches. First, we build a small scale emotion dictionary manually instead of a large scale corpus. Each word in dictionary has an emotion tag. Then, we use the emotional words to search emotional dialogs heuristically in fictions and classify them automatically. Second, we share our work to boost the performance of emotion recognition on spoken languages using the proposed new database. The labeled dialogs can be used for supervised learning while the unlabeled ones provide better word embeddings for the semantic level emotion recognition. We use the dialogs corpus as an auxiliary dataset in speech emotion recognition. We carry out experiments on automatic speech recognition (ASR) generated texts from the speech signals in Chinese Natural Emotional Audio-Visual Database (CHEAVD). It is an eight emotion states recognition task. We obtain a baseline average macro precision (MAP) of 37.08% and accuracy of 31.13% in terms of text-based method. With the labeled dialogs to pre-train neural networks and over-sampling the minority classes, we achieve an optimized MAP of 47.50% and the accuracy of 43.91%, which outperforms the baseline by 10.42% and 12.78% respectively.

Duke Scholars

Author Ming Li DKU Faculty

Published In

2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017

DOI

10.1109/ACII.2017.8273619

Publication Date

July 2, 2017

Volume

2018-January

Start / End Page

319 / 324

Citation

APA

Chicago

ICMJE

MLA

NLM

Chen, J., Liu, C., & Li, M. (2017). Automatic emotional spoken language text corpus construction from written dialogs in fictions. In 2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017 (Vol. 2018-January, pp. 319–324). https://doi.org/10.1109/ACII.2017.8273619

Chen, J., C. Liu, and M. Li. “Automatic emotional spoken language text corpus construction from written dialogs in fictions.” In 2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017, 2018-January:319–24, 2017. https://doi.org/10.1109/ACII.2017.8273619.

Chen J, Liu C, Li M. Automatic emotional spoken language text corpus construction from written dialogs in fictions. In: 2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017. 2017. p. 319–24.

Chen, J., et al. “Automatic emotional spoken language text corpus construction from written dialogs in fictions.” 2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017, vol. 2018-January, 2017, pp. 319–24. Scopus, doi:10.1109/ACII.2017.8273619.

Chen J, Liu C, Li M. Automatic emotional spoken language text corpus construction from written dialogs in fictions. 2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017. 2017. p. 319–324.

Published In

2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017

DOI

10.1109/ACII.2017.8273619

Publication Date

July 2, 2017

Volume

2018-January

Start / End Page

319 / 324