Skip to main content

Denoising Autoencoder-Based Language Feature Compensation

Publication ,  Journal Article
Miao, X; Xu, J; Wang, J
Published in: Jisuanji Yanjiu Yu Fazhan Computer Research and Development
May 1, 2019

Language identification (LID) accuracy is often significantly reduced when the duration of the test data and the training data are mismatched. This paper proposes a method to compensate language features using a denoising autoencoder (DAE). Use of denoising autoencoder-based language feature compensation can map language features from variable length utterances into a fixed length representation. Therefore the problem of length mismatch and unbalanced phoneme distribution can be mitigated. The algorithm first converts the speech signal to low level acoustic features by framing and transforming, and then estimates its i-vector and phonetic vector. These two vectors are then concatenated and fed into the DAE-based language feature compensation processing unit. The compensated i-vector from the output of the DAE, and the original i-vector, are presented to the back-end classifier to obtain two score vectors. These two score vectors are finally fused at a score level to obtain a final result. Tests on NIST-LRE07 demonstrate that this feature compensation method improves identification performance over various test speech durations. Compared with traditional LID systems, the performance for 30 s test utterances improves by 3.16%, while the performance for 10 s test utterances improves by 2.90%. Compared with the end-to-end LID system, the performance on 3 s test utterances is increased by 3.21%.

Duke Scholars

Published In

Jisuanji Yanjiu Yu Fazhan Computer Research and Development

DOI

ISSN

1000-1239

Publication Date

May 1, 2019

Volume

56

Issue

5

Start / End Page

1082 / 1091

Related Subject Headings

  • Software Engineering
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Miao, X., Xu, J., & Wang, J. (2019). Denoising Autoencoder-Based Language Feature Compensation. Jisuanji Yanjiu Yu Fazhan Computer Research and Development, 56(5), 1082–1091. https://doi.org/10.7544/issn1000-1239.2019.20180471
Miao, X., J. Xu, and J. Wang. “Denoising Autoencoder-Based Language Feature Compensation.” Jisuanji Yanjiu Yu Fazhan Computer Research and Development 56, no. 5 (May 1, 2019): 1082–91. https://doi.org/10.7544/issn1000-1239.2019.20180471.
Miao X, Xu J, Wang J. Denoising Autoencoder-Based Language Feature Compensation. Jisuanji Yanjiu Yu Fazhan Computer Research and Development. 2019 May 1;56(5):1082–91.
Miao, X., et al. “Denoising Autoencoder-Based Language Feature Compensation.” Jisuanji Yanjiu Yu Fazhan Computer Research and Development, vol. 56, no. 5, May 2019, pp. 1082–91. Scopus, doi:10.7544/issn1000-1239.2019.20180471.
Miao X, Xu J, Wang J. Denoising Autoencoder-Based Language Feature Compensation. Jisuanji Yanjiu Yu Fazhan Computer Research and Development. 2019 May 1;56(5):1082–1091.

Published In

Jisuanji Yanjiu Yu Fazhan Computer Research and Development

DOI

ISSN

1000-1239

Publication Date

May 1, 2019

Volume

56

Issue

5

Start / End Page

1082 / 1091

Related Subject Headings

  • Software Engineering