Skip to main content

Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system

Publication ,  Conference
Hong, QY; Li, L; Li, M; Huang, L; Wan, L; Zhang, J
Published in: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
January 1, 2015

To deal with the performance degradation of speaker recognition due to duration mismatch between enrollment and test utterances, a novel strategy to modify the standard normal prior distribution of the i-vector during probabilistic linear discriminant analysis (PLDA) modeling is employed. This new modified-prior PLDA model incorporates the covariance matrix scaled with duration of each utterance for each speaker, which achieves more discriminative characteristics by learning the duration variability as well as session variation in the i-vector space. Furthermore, an efficient Quality Measure Function (QMF) method which adopts duration variation as a compensation technique is employed to eliminate the linear shift in the score domain. To evaluate the robustness of the proposed approach, experiments were conducted on the NIST SRE10 core-core task in condition-5 with varying test utterance duration, in which the i-vectors of test utterances were extracted from full segment and randomly truncated segments of duration 10s and 20s. The results demonstrated the efficiency of modified-prior PLDA in different duration conditions, and the combined score calibration further improved the performance of speaker recognition.

Duke Scholars

Published In

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

EISSN

1990-9772

ISSN

2308-457X

Publication Date

January 1, 2015

Volume

2015-January

Start / End Page

1037 / 1041
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Hong, Q. Y., Li, L., Li, M., Huang, L., Wan, L., & Zhang, J. (2015). Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 2015-January, pp. 1037–1041).
Hong, Q. Y., L. Li, M. Li, L. Huang, L. Wan, and J. Zhang. “Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system.” In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2015-January:1037–41, 2015.
Hong QY, Li L, Li M, Huang L, Wan L, Zhang J. Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2015. p. 1037–41.
Hong, Q. Y., et al. “Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system.” Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, vol. 2015-January, 2015, pp. 1037–41.
Hong QY, Li L, Li M, Huang L, Wan L, Zhang J. Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2015. p. 1037–1041.

Published In

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

EISSN

1990-9772

ISSN

2308-457X

Publication Date

January 1, 2015

Volume

2015-January

Start / End Page

1037 / 1041