Scholars@Duke publication: Bilevel sparse models for polyphonic music transcription

Bilevel sparse models for polyphonic music transcription

Publication , Conference

Yakar, TB; Litman, R; Sprechmann, P; Bronstein, A; Sapiro, G

Published in: Proceedings of the 14th International Society for Music Information Retrieval Conference Ismir 2013

January 1, 2013

In this work, we propose a trainable sparse model for automatic polyphonic music transcription, which incorporates several successful approaches into a unified optimization framework. Our model combines unsupervised synthesis models similar to latent component analysis and nonnegative factorization with metric learning techniques that allow supervised discriminative learning. We develop efficient stochastic gradient training schemes allowing unsupervised, semi-, and fully supervised training of the model as well its adaptation to test data. We show efficient fixed complexity and latency approximation that can replace iterative minimization algorithms in time-critical applications. Experimental evaluation on synthetic and real data shows promising initial results.

Duke Scholars

Author Guillermo Sapiro Electrical and Computer Engineering

Published In

Proceedings of the 14th International Society for Music Information Retrieval Conference Ismir 2013

Publication Date

January 1, 2013

Start / End Page

65 / 70

Citation

APA

Chicago

ICMJE

MLA

NLM

Yakar, T. B., Litman, R., Sprechmann, P., Bronstein, A., & Sapiro, G. (2013). Bilevel sparse models for polyphonic music transcription. In Proceedings of the 14th International Society for Music Information Retrieval Conference Ismir 2013 (pp. 65–70).

Yakar, T. B., R. Litman, P. Sprechmann, A. Bronstein, and G. Sapiro. “Bilevel sparse models for polyphonic music transcription.” In Proceedings of the 14th International Society for Music Information Retrieval Conference Ismir 2013, 65–70, 2013.

Yakar TB, Litman R, Sprechmann P, Bronstein A, Sapiro G. Bilevel sparse models for polyphonic music transcription. In: Proceedings of the 14th International Society for Music Information Retrieval Conference Ismir 2013. 2013. p. 65–70.

Yakar, T. B., et al. “Bilevel sparse models for polyphonic music transcription.” Proceedings of the 14th International Society for Music Information Retrieval Conference Ismir 2013, 2013, pp. 65–70.

Yakar TB, Litman R, Sprechmann P, Bronstein A, Sapiro G. Bilevel sparse models for polyphonic music transcription. Proceedings of the 14th International Society for Music Information Retrieval Conference Ismir 2013. 2013. p. 65–70.

Published In

Proceedings of the 14th International Society for Music Information Retrieval Conference Ismir 2013

Publication Date

January 1, 2013

Start / End Page

65 / 70