Scholars@Duke publication: Large-scale Bayesian multi-label learning via topic-based label embeddings

Large-scale Bayesian multi-label learning via topic-based label embeddings

Publication , Conference

Raiy, P; Hu, C; Henao, R; Carin, L

Published in: Advances in Neural Information Processing Systems

January 1, 2015

We present a scalable Bayesian multi-label learning model based on learning lowdimensional label embeddings. Our model assumes that each label vector is generated as a weighted combination of a set of topics (each topic being a distribution over labels), where the combination weights (i.e., the embeddings) for each label vector are conditioned on the observed feature vector. This construction, coupled with a Bernoulli-Poisson link function for each label of the binary label vector, leads to a model with a computational cost that scales in the number of positive labels in the label matrix. This makes the model particularly appealing for real-world multi-label learning problems where the label matrix is usually very massive but highly sparse. Using a data-augmentation strategy leads to full local conjugacy in our model, facilitating simple and very efficient Gibbs sampling, as well as an Expectation Maximization algorithm for inference. Also, predicting the label vector at test time does not require doing an inference for the label embeddings and can be done in closed form. We report results on several benchmark data sets, comparing our model with various state-of-the art methods.

Duke Scholars

Author Ricardo Henao Biostatistics & Bioinformatics, Division of Translational Bi ...

Author Lawrence Carin Electrical and Computer Engineering

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2015

Volume

2015-January

Start / End Page

3222 / 3230

Related Subject Headings

4611 Machine learning
1702 Cognitive Sciences
1701 Psychology

Citation

APA

Chicago

ICMJE

MLA

NLM

Raiy, P., Hu, C., Henao, R., & Carin, L. (2015). Large-scale Bayesian multi-label learning via topic-based label embeddings. In Advances in Neural Information Processing Systems (Vol. 2015-January, pp. 3222–3230).

Raiy, P., C. Hu, R. Henao, and L. Carin. “Large-scale Bayesian multi-label learning via topic-based label embeddings.” In Advances in Neural Information Processing Systems, 2015-January:3222–30, 2015.

Raiy P, Hu C, Henao R, Carin L. Large-scale Bayesian multi-label learning via topic-based label embeddings. In: Advances in Neural Information Processing Systems. 2015. p. 3222–30.

Raiy, P., et al. “Large-scale Bayesian multi-label learning via topic-based label embeddings.” Advances in Neural Information Processing Systems, vol. 2015-January, 2015, pp. 3222–30.

Raiy P, Hu C, Henao R, Carin L. Large-scale Bayesian multi-label learning via topic-based label embeddings. Advances in Neural Information Processing Systems. 2015. p. 3222–3230.

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2015

Volume

2015-January

Start / End Page

3222 / 3230

Related Subject Headings

4611 Machine learning
1702 Cognitive Sciences
1701 Psychology