Skip to main content

Speaker verification with the mixture of Gaussian factor analysis based representation

Publication ,  Conference
Li, M
Published in: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
August 4, 2015

This paper presents a generalized i-vector representation framework using the mixture of Gaussian (MoG) factor analysis for speaker verification. Conventionally, a single standard factor analysis is adopted to generate a low rank total variability subspace where the mean supervector is assumed to be Gaussian distributed. The energy that can't be represented by the low rank space is modeled by a single multivariate Gaussian. However, due to the sparsity of the frame level posterior probability and the short duration characteristics, some dimensions of the first-order statistics may not be Gaussian distributed. Therefore, we replace the single Gaussian with a mixture of Gaussians to better represent the residual energy. Experimental results on the NIST SRE 2010 condition 5 female task and the RSR 2015 part 1 female task show that the MoG i-vector outperforms the i-vector baseline by more than 10% relatively for both text independent and text dependent speaker verification tasks, respectively.

Duke Scholars

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

August 4, 2015

Volume

2015-August

Start / End Page

4679 / 4683
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, M. (2015). Speaker verification with the mixture of Gaussian factor analysis based representation. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 2015-August, pp. 4679–4683). https://doi.org/10.1109/ICASSP.2015.7178858
Li, M. “Speaker verification with the mixture of Gaussian factor analysis based representation.” In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2015-August:4679–83, 2015. https://doi.org/10.1109/ICASSP.2015.7178858.
Li M. Speaker verification with the mixture of Gaussian factor analysis based representation. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2015. p. 4679–83.
Li, M. “Speaker verification with the mixture of Gaussian factor analysis based representation.” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2015-August, 2015, pp. 4679–83. Scopus, doi:10.1109/ICASSP.2015.7178858.
Li M. Speaker verification with the mixture of Gaussian factor analysis based representation. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2015. p. 4679–4683.

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

August 4, 2015

Volume

2015-August

Start / End Page

4679 / 4683