Skip to main content

HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines

Publication ,  Conference
Qin, X; Bu, H; Li, M
Published in: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
May 1, 2020

This paper presents a far-field text-dependent speaker verification database named HI-MIA. We aim to meet the data requirement for far-field microphone array based speaker verification since most of the publicly available databases are single channel close-talking and text-independent. The database contains recordings of 340 people in rooms designed for the far-field scenario. Recordings are captured by multiple microphone arrays located in different directions and distance to the speaker and a high-fidelity close-talking microphone. Besides, we propose a set of end-to-end neural network based baseline systems that adopt single-channel data for training. Moreover, we propose a testing background aware enrollment augmentation strategy to further enhance the performance. Results show that the fusion systems could achieve 3.29% EER in the far-field enrollment far field testing task and 4.02% EER in the close-talking enrollment and far-field testing task.

Duke Scholars

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

May 1, 2020

Volume

2020-May

Start / End Page

7609 / 7613
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Qin, X., Bu, H., & Li, M. (2020). HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 2020-May, pp. 7609–7613). https://doi.org/10.1109/ICASSP40776.2020.9054423
Qin, X., H. Bu, and M. Li. “HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines.” In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2020-May:7609–13, 2020. https://doi.org/10.1109/ICASSP40776.2020.9054423.
Qin X, Bu H, Li M. HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2020. p. 7609–13.
Qin, X., et al. “HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines.” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2020-May, 2020, pp. 7609–13. Scopus, doi:10.1109/ICASSP40776.2020.9054423.
Qin X, Bu H, Li M. HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2020. p. 7609–7613.

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

May 1, 2020

Volume

2020-May

Start / End Page

7609 / 7613