Scholars@Duke publication: The SYSU system for CCPR 2016 multimodal emotion recognition challenge

The SYSU system for CCPR 2016 multimodal emotion recognition challenge

Publication , Conference

He, G; Chen, J; Liu, X; Li, M

Published in: Communications in Computer and Information Science

January 1, 2016

In this paper, we propose a multimodal emotion recognition system that combines the information from the facial, text and speech data. First, we propose a residual network architecture within the convolutional neural networks (CNN) framework to improve the facial expression recognition performance. We also perform video frames selection to fine tune our pre-trained model. Second, while the text emotion recognition conventionally deal with the clean perfect texts, here we adopt an automatic speech recognition (ASR) engine to transcribe the speech into text and then apply Support Vector Machine (SVM) on top of bag-ofwords (BoW) features to predict the emotion labels. Third, we extract the openSMILE based utterance level feature and MFCC GMM based zero-order statistics feature for the subsequent SVM modeling in the speech based subsystem. Finally, score level fusion was used to combine the multimodal information. Experimental results were carried on the CCPR 2016 Multimodal Emotion Recognition Challenge database, our proposed multimodal system achieved 36% macro average precision on the test set which outperforms the baseline by 6% absolutely.

Duke Scholars

Author Ming Li DKU Faculty

Published In

Communications in Computer and Information Science

DOI

10.1007/978-981-10-3005-5_58

ISSN

1865-0929

Publication Date

January 1, 2016

Volume

663

Start / End Page

707 / 720

Citation

APA

Chicago

ICMJE

MLA

NLM

He, G., Chen, J., Liu, X., & Li, M. (2016). The SYSU system for CCPR 2016 multimodal emotion recognition challenge. In Communications in Computer and Information Science (Vol. 663, pp. 707–720). https://doi.org/10.1007/978-981-10-3005-5_58

He, G., J. Chen, X. Liu, and M. Li. “The SYSU system for CCPR 2016 multimodal emotion recognition challenge.” In Communications in Computer and Information Science, 663:707–20, 2016. https://doi.org/10.1007/978-981-10-3005-5_58.

He G, Chen J, Liu X, Li M. The SYSU system for CCPR 2016 multimodal emotion recognition challenge. In: Communications in Computer and Information Science. 2016. p. 707–20.

He, G., et al. “The SYSU system for CCPR 2016 multimodal emotion recognition challenge.” Communications in Computer and Information Science, vol. 663, 2016, pp. 707–20. Scopus, doi:10.1007/978-981-10-3005-5_58.

He G, Chen J, Liu X, Li M. The SYSU system for CCPR 2016 multimodal emotion recognition challenge. Communications in Computer and Information Science. 2016. p. 707–720.

Published In

Communications in Computer and Information Science

DOI

10.1007/978-981-10-3005-5_58

ISSN

1865-0929

Publication Date

January 1, 2016

Volume

663

Start / End Page

707 / 720