Skip to main content

Wavoice: An mmWave-Assisted Noise-Resistant Speech Recognition System

Publication ,  Journal Article
Liu, T; Wang, C; Li, Z; Huang, MC; Xu, W; Lin, F
Published in: ACM Transactions on Sensor Networks
May 11, 2024

As automatic speech recognition evolves, deployment of the voice user interface (VUI) has boomingly expanded. Especially since the COVID-19 pandemic, the VUI has gained more attention in online communication owing to its non-contact property. However, the VUI struggles to be applied in public scenes due to the degradation of received audio signals caused by various ambient noises. In this article, we propose Wavoice, the first noise-resistant multi-modal speech recognition system that fuses two distinct voices sensing modalities (i.e., millimeter-wave signals and audio signals from a microphone) together. One key contribution is to model the inherent correlation between millimeter-wave and audio signals. Based on it, Wavoice facilitates the real-time noise-resistant voice activity detection and user targeting from multiple speakers. Additionally, we elaborate on two novel modules for multi-modal fusion embedded into the neural network, leading to accurate speech recognition. Extensive experiments prove the effectiveness of Wavoice under adverse conditions—that is, the character recognition error rate below 1% in a range of 7 m. In terms of robustness and accuracy, Wavoice considerably outperforms existing audio-only speech recognition methods with lower character error and word error rates.

Duke Scholars

Published In

ACM Transactions on Sensor Networks

DOI

EISSN

1550-4867

ISSN

1550-4859

Publication Date

May 11, 2024

Volume

20

Issue

4

Related Subject Headings

  • Networking & Telecommunications
  • 4009 Electronics, sensors and digital hardware
  • 1005 Communications Technologies
  • 0906 Electrical and Electronic Engineering
  • 0805 Distributed Computing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Liu, T., Wang, C., Li, Z., Huang, M. C., Xu, W., & Lin, F. (2024). Wavoice: An mmWave-Assisted Noise-Resistant Speech Recognition System. ACM Transactions on Sensor Networks, 20(4). https://doi.org/10.1145/3597457
Liu, T., C. Wang, Z. Li, M. C. Huang, W. Xu, and F. Lin. “Wavoice: An mmWave-Assisted Noise-Resistant Speech Recognition System.” ACM Transactions on Sensor Networks 20, no. 4 (May 11, 2024). https://doi.org/10.1145/3597457.
Liu T, Wang C, Li Z, Huang MC, Xu W, Lin F. Wavoice: An mmWave-Assisted Noise-Resistant Speech Recognition System. ACM Transactions on Sensor Networks. 2024 May 11;20(4).
Liu, T., et al. “Wavoice: An mmWave-Assisted Noise-Resistant Speech Recognition System.” ACM Transactions on Sensor Networks, vol. 20, no. 4, May 2024. Scopus, doi:10.1145/3597457.
Liu T, Wang C, Li Z, Huang MC, Xu W, Lin F. Wavoice: An mmWave-Assisted Noise-Resistant Speech Recognition System. ACM Transactions on Sensor Networks. 2024 May 11;20(4).

Published In

ACM Transactions on Sensor Networks

DOI

EISSN

1550-4867

ISSN

1550-4859

Publication Date

May 11, 2024

Volume

20

Issue

4

Related Subject Headings

  • Networking & Telecommunications
  • 4009 Electronics, sensors and digital hardware
  • 1005 Communications Technologies
  • 0906 Electrical and Electronic Engineering
  • 0805 Distributed Computing