Skip to main content
Journal cover image

Automatic intelligibility classification of sentence-level pathological speech

Publication ,  Journal Article
Kim, J; Kumar, N; Tsiartas, A; Li, M; Narayanan, SS
Published in: Computer Speech and Language
January 1, 2015

Pathological speech usually refers to the condition of speech distortion resulting from atypicalities in voice and/or in the articulatory mechanisms owing to disease, illness or other physical or biological insult to the production system. Although automatic evaluation of speech intelligibility and quality could come in handy in these scenarios to assist experts in diagnosis and treatment design, the many sources and types of variability often make it a very challenging computational processing problem. In this work we propose novel sentence-level features to capture abnormal variation in the prosodic, voice quality and pronunciation aspects in pathological speech. In addition, we propose a post-classification posterior smoothing scheme which refines the posterior of a test sample based on the posteriors of other test samples. Finally, we perform feature-level fusions and subsystem decision fusion for arriving at a final intelligibility decision. The performances are tested on two pathological speech datasets, the NKI CCRT Speech Corpus (advanced head and neck cancer) and the TORGO database (cerebral palsy or amyotrophic lateral sclerosis), by evaluating classification accuracy without overlapping subjects' data among training and test partitions. Results show that the feature sets of each of the voice quality subsystem, prosodic subsystem, and pronunciation subsystem, offer significant discriminating power for binary intelligibility classification. We observe that the proposed posterior smoothing in the acoustic space can further reduce classification errors. The smoothed posterior score fusion of subsystems shows the best classification performance (73.5% for unweighted, and 72.8% for weighted, average recalls of the binary classes).

Duke Scholars

Published In

Computer Speech and Language

DOI

EISSN

1095-8363

ISSN

0885-2308

Publication Date

January 1, 2015

Volume

29

Issue

1

Start / End Page

132 / 144

Related Subject Headings

  • Speech-Language Pathology & Audiology
  • 46 Information and computing sciences
  • 40 Engineering
  • 2004 Linguistics
  • 1702 Cognitive Sciences
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Kim, J., Kumar, N., Tsiartas, A., Li, M., & Narayanan, S. S. (2015). Automatic intelligibility classification of sentence-level pathological speech. Computer Speech and Language, 29(1), 132–144. https://doi.org/10.1016/j.csl.2014.02.001
Kim, J., N. Kumar, A. Tsiartas, M. Li, and S. S. Narayanan. “Automatic intelligibility classification of sentence-level pathological speech.” Computer Speech and Language 29, no. 1 (January 1, 2015): 132–44. https://doi.org/10.1016/j.csl.2014.02.001.
Kim J, Kumar N, Tsiartas A, Li M, Narayanan SS. Automatic intelligibility classification of sentence-level pathological speech. Computer Speech and Language. 2015 Jan 1;29(1):132–44.
Kim, J., et al. “Automatic intelligibility classification of sentence-level pathological speech.” Computer Speech and Language, vol. 29, no. 1, Jan. 2015, pp. 132–44. Scopus, doi:10.1016/j.csl.2014.02.001.
Kim J, Kumar N, Tsiartas A, Li M, Narayanan SS. Automatic intelligibility classification of sentence-level pathological speech. Computer Speech and Language. 2015 Jan 1;29(1):132–144.
Journal cover image

Published In

Computer Speech and Language

DOI

EISSN

1095-8363

ISSN

0885-2308

Publication Date

January 1, 2015

Volume

29

Issue

1

Start / End Page

132 / 144

Related Subject Headings

  • Speech-Language Pathology & Audiology
  • 46 Information and computing sciences
  • 40 Engineering
  • 2004 Linguistics
  • 1702 Cognitive Sciences
  • 0801 Artificial Intelligence and Image Processing