Skip to main content

Speaker personality classification using systems based on acoustic-lexical cues and an optimal tree-structured Bayesian network

Publication ,  Conference
Audhkhasi, K; Metallinou, A; Li, M; Narayanan, SS
Published in: 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
December 1, 2012

Automatic classification of human personality along the Big Five dimensions is an interesting problem with several practical applications. This paper makes some contributions in this regard. First, we propose a few automatically- derived personality-discriminating lexical features which provide information complementary to the conventional acoustic-prosodic cues. We also design a frame-level Gaussian mixture model based system which adds complimentary information to the systems trained on global statistical functionals. Next, we note that the Big Five dimensions are correlated and thus model the dependency between these dimensions in the form of an optimal tree-structured Bayesian network. Our final sub-system consists of within class covariance normalization followed by L1-regularized logistic regression. Fusion of all these sub-systems achieves better classification performance than independently trained classifiers using just acoustic features.

Duke Scholars

Published In

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

Publication Date

December 1, 2012

Volume

1

Start / End Page

262 / 265
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Audhkhasi, K., Metallinou, A., Li, M., & Narayanan, S. S. (2012). Speaker personality classification using systems based on acoustic-lexical cues and an optimal tree-structured Bayesian network. In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 (Vol. 1, pp. 262–265).
Audhkhasi, K., A. Metallinou, M. Li, and S. S. Narayanan. “Speaker personality classification using systems based on acoustic-lexical cues and an optimal tree-structured Bayesian network.” In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, 1:262–65, 2012.
Audhkhasi K, Metallinou A, Li M, Narayanan SS. Speaker personality classification using systems based on acoustic-lexical cues and an optimal tree-structured Bayesian network. In: 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. 2012. p. 262–5.
Audhkhasi, K., et al. “Speaker personality classification using systems based on acoustic-lexical cues and an optimal tree-structured Bayesian network.” 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, vol. 1, 2012, pp. 262–65.
Audhkhasi K, Metallinou A, Li M, Narayanan SS. Speaker personality classification using systems based on acoustic-lexical cues and an optimal tree-structured Bayesian network. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. 2012. p. 262–265.

Published In

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

Publication Date

December 1, 2012

Volume

1

Start / End Page

262 / 265