Skip to main content

Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation

Publication ,  Conference
Li, T; Chen, J; Hou, H; Li, M
Published in: 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021
January 24, 2021

Convolutional Neural Network (CNN) or Long Short-term Memory (LSTM) based models with the input of spectrogram or waveforms are commonly used for deep learning based audio source separation. In this paper, we propose a Sliced Attention-based neural network (Sams-Net) in the spectrogram domain for the music source separation task. It enables spectral feature interactions with multi-head attention mechanism, achieves easier parallel computing and has a larger receptive field com-pared with LSTMs and CNNs respectively. Experimental results on the MUSDB18 dataset show that the proposed method, with fewer parameters, outperforms most of the state-of-the-art DNN-based methods.

Duke Scholars

Published In

2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021

DOI

Publication Date

January 24, 2021
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, T., Chen, J., Hou, H., & Li, M. (2021). Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. In 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. https://doi.org/10.1109/ISCSLP49672.2021.9362081
Li, T., J. Chen, H. Hou, and M. Li. “Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation.” In 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, 2021. https://doi.org/10.1109/ISCSLP49672.2021.9362081.
Li T, Chen J, Hou H, Li M. Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. In: 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. 2021.
Li, T., et al. “Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation.” 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, 2021. Scopus, doi:10.1109/ISCSLP49672.2021.9362081.
Li T, Chen J, Hou H, Li M. Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. 2021.

Published In

2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021

DOI

Publication Date

January 24, 2021