Skip to main content

HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines

Publication ,  Conference
Qin, X; Bu, H; Li, M
Published in: ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings
May 1, 2020

This paper presents a far-field text-dependent speaker verification database named HI-MIA. We aim to meet the data requirement for far-field microphone array based speaker verification since most of the publicly available databases are single channel close-talking and text-independent. The database contains recordings of 340 people in rooms designed for the far-field scenario. Recordings are captured by multiple microphone arrays located in different directions and distance to the speaker and a high-fidelity close-talking microphone. Besides, we propose a set of end-to-end neural network based baseline systems that adopt single-channel data for training. Moreover, we propose a testing background aware enrollment augmentation strategy to further enhance the performance. Results show that the fusion systems could achieve 3.29% EER in the far-field enrollment far field testing task and 4.02% EER in the close-talking enrollment and far-field testing task.

Duke Scholars

Published In

ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

DOI

ISSN

1520-6149

Publication Date

May 1, 2020

Volume

2020-May

Start / End Page

7609 / 7613
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Qin, X., Bu, H., & Li, M. (2020). HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines. In ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (Vol. 2020-May, pp. 7609–7613). https://doi.org/10.1109/ICASSP40776.2020.9054423
Qin, X., H. Bu, and M. Li. “HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines.” In ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 2020-May:7609–13, 2020. https://doi.org/10.1109/ICASSP40776.2020.9054423.
Qin X, Bu H, Li M. HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines. In: ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings. 2020. p. 7609–13.
Qin, X., et al. “HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines.” ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol. 2020-May, 2020, pp. 7609–13. Scopus, doi:10.1109/ICASSP40776.2020.9054423.
Qin X, Bu H, Li M. HI-MIA: A Far-Field Text-Dependent Speaker Verification Database and the Baselines. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings. 2020. p. 7609–7613.

Published In

ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings

DOI

ISSN

1520-6149

Publication Date

May 1, 2020

Volume

2020-May

Start / End Page

7609 / 7613