Skip to main content

Bilevel sparse models for polyphonic music transcription

Publication ,  Conference
Yakar, TB; Litman, R; Sprechmann, P; Bronstein, A; Sapiro, G
Published in: Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013
January 1, 2013

In this work, we propose a trainable sparse model for automatic polyphonic music transcription, which incorporates several successful approaches into a unified optimization framework. Our model combines unsupervised synthesis models similar to latent component analysis and nonnegative factorization with metric learning techniques that allow supervised discriminative learning. We develop efficient stochastic gradient training schemes allowing unsupervised, semi-, and fully supervised training of the model as well its adaptation to test data. We show efficient fixed complexity and latency approximation that can replace iterative minimization algorithms in time-critical applications. Experimental evaluation on synthetic and real data shows promising initial results.

Duke Scholars

Published In

Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013

Publication Date

January 1, 2013

Start / End Page

65 / 70
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yakar, T. B., Litman, R., Sprechmann, P., Bronstein, A., & Sapiro, G. (2013). Bilevel sparse models for polyphonic music transcription. In Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013 (pp. 65–70).
Yakar, T. B., R. Litman, P. Sprechmann, A. Bronstein, and G. Sapiro. “Bilevel sparse models for polyphonic music transcription.” In Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013, 65–70, 2013.
Yakar TB, Litman R, Sprechmann P, Bronstein A, Sapiro G. Bilevel sparse models for polyphonic music transcription. In: Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013. 2013. p. 65–70.
Yakar, T. B., et al. “Bilevel sparse models for polyphonic music transcription.” Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013, 2013, pp. 65–70.
Yakar TB, Litman R, Sprechmann P, Bronstein A, Sapiro G. Bilevel sparse models for polyphonic music transcription. Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013. 2013. p. 65–70.

Published In

Proceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013

Publication Date

January 1, 2013

Start / End Page

65 / 70