Skip to main content

Cross-domain speaker recognition using domain adversarial siamese network with a domain discriminator

Publication ,  Journal Article
Chen, Z; Miao, X; Xiao, R; Wang, W
Published in: Electronics Letters
July 9, 2020

With the widespread use of automatic speaker recognition in realistic world, it suffers a lot when there is a domain mismatch, including channel, language, distance etc. Recent research studies have introduced the adversarial-learning mechanism into deep neural networks to reduce the distribution mismatch between different domains. However, they only aligned the domain distributions between the background training and evaluation data. Few focused on the diverse distribution underlying the enrol and test data. In this Letter, the authors propose a domain adversarial siamese (DAS) network trying to eliminate the domain influence on speech representation. Specifically, they feed a network with speech pairs from the same speaker. Then a domain discriminator is introduced to capture the domain consistence or difference between pairs. Final embeddings become domain-invariant and more speaker-discriminative. A cross-channel data set is sort out from NIST speaker recognition evaluation and more experiments are conducted on AISHELL-Wake-Up-1 data set. Results show that DAS performs equally effective with typical domain adversarial methods, improving results at least 10% on energy efficiency rating. Furthermore, it is proved to be more valid for scenarios such as unbalanced data amount and unknown domain, achieving relatively 11% improvements.

Duke Scholars

Published In

Electronics Letters

DOI

ISSN

0013-5194

Publication Date

July 9, 2020

Volume

56

Issue

14

Start / End Page

737 / 739

Related Subject Headings

  • Electrical & Electronic Engineering
  • 4009 Electronics, sensors and digital hardware
  • 4006 Communications engineering
  • 1005 Communications Technologies
  • 0906 Electrical and Electronic Engineering
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Chen, Z., Miao, X., Xiao, R., & Wang, W. (2020). Cross-domain speaker recognition using domain adversarial siamese network with a domain discriminator. Electronics Letters, 56(14), 737–739. https://doi.org/10.1049/el.2020.0673
Chen, Z., X. Miao, R. Xiao, and W. Wang. “Cross-domain speaker recognition using domain adversarial siamese network with a domain discriminator.” Electronics Letters 56, no. 14 (July 9, 2020): 737–39. https://doi.org/10.1049/el.2020.0673.
Chen Z, Miao X, Xiao R, Wang W. Cross-domain speaker recognition using domain adversarial siamese network with a domain discriminator. Electronics Letters. 2020 Jul 9;56(14):737–9.
Chen, Z., et al. “Cross-domain speaker recognition using domain adversarial siamese network with a domain discriminator.” Electronics Letters, vol. 56, no. 14, July 2020, pp. 737–39. Scopus, doi:10.1049/el.2020.0673.
Chen Z, Miao X, Xiao R, Wang W. Cross-domain speaker recognition using domain adversarial siamese network with a domain discriminator. Electronics Letters. 2020 Jul 9;56(14):737–739.

Published In

Electronics Letters

DOI

ISSN

0013-5194

Publication Date

July 9, 2020

Volume

56

Issue

14

Start / End Page

737 / 739

Related Subject Headings

  • Electrical & Electronic Engineering
  • 4009 Electronics, sensors and digital hardware
  • 4006 Communications engineering
  • 1005 Communications Technologies
  • 0906 Electrical and Electronic Engineering
  • 0801 Artificial Intelligence and Image Processing