Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation
Publication
, Conference
Li, T; Chen, J; Hou, H; Li, M
Published in: 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021
January 24, 2021
Convolutional Neural Network (CNN) or Long Short-term Memory (LSTM) based models with the input of spectrogram or waveforms are commonly used for deep learning based audio source separation. In this paper, we propose a Sliced Attention-based neural network (Sams-Net) in the spectrogram domain for the music source separation task. It enables spectral feature interactions with multi-head attention mechanism, achieves easier parallel computing and has a larger receptive field com-pared with LSTMs and CNNs respectively. Experimental results on the MUSDB18 dataset show that the proposed method, with fewer parameters, outperforms most of the state-of-the-art DNN-based methods.
Duke Scholars
Published In
2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021
DOI
Publication Date
January 24, 2021
Citation
APA
Chicago
ICMJE
MLA
NLM
Li, T., Chen, J., Hou, H., & Li, M. (2021). Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. In 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. https://doi.org/10.1109/ISCSLP49672.2021.9362081
Li, T., J. Chen, H. Hou, and M. Li. “Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation.” In 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, 2021. https://doi.org/10.1109/ISCSLP49672.2021.9362081.
Li T, Chen J, Hou H, Li M. Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. In: 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. 2021.
Li, T., et al. “Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation.” 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, 2021. Scopus, doi:10.1109/ISCSLP49672.2021.9362081.
Li T, Chen J, Hou H, Li M. Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. 2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021. 2021.
Published In
2021 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021
DOI
Publication Date
January 24, 2021