Skip to main content

A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION

Publication ,  Conference
Zhang, Y; Liu, J; Tian, Y; Liu, H; Li, M
Published in: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
January 1, 2024

In contrast to human speech, machine-generated sounds of the same type often exhibit consistent frequency characteristics and discernible temporal periodicity. However, leveraging these dual attributes in anomaly detection remains relatively under-explored. In this paper, we propose an automated dual-path framework that learns prominent frequency and temporal patterns for diverse machine types. One pathway uses a novel Frequency-and-Time Excited Network (FTE-Net) to learn the salient features across frequency and time axes of the spectrogram. It incorporates a Frequency-and-Time Chunkwise Encoder (FTC-Encoder) and an excitation network. The other pathway uses a 1D convolutional network for utterance-level spectrum. Experimental results on the DCASE 2023 task 2 dataset show the state-of-the-art performance of our proposed method. Moreover, visualizations of the intermediate feature maps in the excitation network are provided to illustrate the effectiveness of our method.

Duke Scholars

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

1266 / 1270
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhang, Y., Liu, J., Tian, Y., Liu, H., & Li, M. (2024). A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp. 1266–1270). https://doi.org/10.1109/ICASSP48485.2024.10448126
Zhang, Y., J. Liu, Y. Tian, H. Liu, and M. Li. “A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION.” In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1266–70, 2024. https://doi.org/10.1109/ICASSP48485.2024.10448126.
Zhang Y, Liu J, Tian Y, Liu H, Li M. A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2024. p. 1266–70.
Zhang, Y., et al. “A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION.” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2024, pp. 1266–70. Scopus, doi:10.1109/ICASSP48485.2024.10448126.
Zhang Y, Liu J, Tian Y, Liu H, Li M. A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2024. p. 1266–1270.

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

1266 / 1270