Skip to main content
Journal cover image

Variance Normalised Features for Language and Dialect Discrimination

Publication ,  Journal Article
Miao, X; McLoughlin, I; Song, Y
Published in: Circuits Systems and Signal Processing
July 1, 2021

This paper proposes novel features for automated language and dialect identification that aim to improve discriminative power by ensuring that each element of the feature vector has a normalised contribution to inter-class variance. The method firstly computes inter- and intra-class frequency variance statistics and then distributes the overall spectral variance across spectral regions which are sized to contain near-equal-variance difference. Spectral features are average pooled within regions to obtain variance normalised features (VNFs). The proposed VNFs are low complexity drop-in replacements for MFCC, SDC, PLP or other input features used for speech-related tasks. In this paper, they are evaluated in three types of system, against MFCCs, for two data-constrained language and dialect identification tasks. VNFs demonstrate good results, comfortably outperforming MFCCs at most dimension sizes, and yielding particularly good performance for the most challenging data-constrained 3s utterance length in the LID task.

Duke Scholars

Published In

Circuits Systems and Signal Processing

DOI

EISSN

1531-5878

ISSN

0278-081X

Publication Date

July 1, 2021

Volume

40

Issue

7

Start / End Page

3621 / 3638

Related Subject Headings

  • Industrial Engineering & Automation
  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering
  • 0102 Applied Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Miao, X., McLoughlin, I., & Song, Y. (2021). Variance Normalised Features for Language and Dialect Discrimination. Circuits Systems and Signal Processing, 40(7), 3621–3638. https://doi.org/10.1007/s00034-020-01641-1
Miao, X., I. McLoughlin, and Y. Song. “Variance Normalised Features for Language and Dialect Discrimination.” Circuits Systems and Signal Processing 40, no. 7 (July 1, 2021): 3621–38. https://doi.org/10.1007/s00034-020-01641-1.
Miao X, McLoughlin I, Song Y. Variance Normalised Features for Language and Dialect Discrimination. Circuits Systems and Signal Processing. 2021 Jul 1;40(7):3621–38.
Miao, X., et al. “Variance Normalised Features for Language and Dialect Discrimination.” Circuits Systems and Signal Processing, vol. 40, no. 7, July 2021, pp. 3621–38. Scopus, doi:10.1007/s00034-020-01641-1.
Miao X, McLoughlin I, Song Y. Variance Normalised Features for Language and Dialect Discrimination. Circuits Systems and Signal Processing. 2021 Jul 1;40(7):3621–3638.
Journal cover image

Published In

Circuits Systems and Signal Processing

DOI

EISSN

1531-5878

ISSN

0278-081X

Publication Date

July 1, 2021

Volume

40

Issue

7

Start / End Page

3621 / 3638

Related Subject Headings

  • Industrial Engineering & Automation
  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering
  • 0102 Applied Mathematics