Skip to main content

Multi-task learning for analyzing and sorting large databases of sequential data

Publication ,  Journal Article
Ni, K; Paisley, J; Carin, L; Dunson, D
Published in: IEEE Transactions on Signal Processing
August 1, 2008

A new hierarchical nonparametric Bayesian framework is proposed for the problem of multi-task learning (MTL) with sequential data. The models for multiple tasks, each characterized by sequential data, are learned jointly, and the intertask relationships are obtained simultaneously. This MTL setting is used to analyze and sort large databases composed of sequential data, such as music clips. Within each data set, we represent the sequential data with an infinite hidden Markov model (iHMM), avoiding the problem of model selection (selecting a number of states). Across the data sets, the multiple iHMMs are learned jointly in a MTL setting, employing a nested Dirichlet process (nDP). The nDP-iHMM MTL method allows simultaneous task-level and data-level clustering, with which the individual iHMMs are enhanced and the between-task similarities are learned. Therefore, in addition to improved learning of each of the models via appropriate data sharing, the learned sharing mechanisms are used to infer interdata relationships of interest for data search. Specifically, the MTL-learned task-level sharing mechanisms are used to define the affinity matrix in a graph-diffusion sorting framework. To speed up the MCMC inference for large databases, the nDP-iHMM is truncated to yield a nested Dirichlet-distribution based HMM representation, which accommodates fast variational Bayesian (VB) analysis for large-scale inference, and the effectiveness of the framework is demonstrated using a database composed of 2500 digital music pieces. © 2008 IEEE.

Duke Scholars

Published In

IEEE Transactions on Signal Processing

DOI

ISSN

1053-587X

Publication Date

August 1, 2008

Volume

56

Issue

8 II

Start / End Page

3918 / 3931

Related Subject Headings

  • Networking & Telecommunications
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ni, K., Paisley, J., Carin, L., & Dunson, D. (2008). Multi-task learning for analyzing and sorting large databases of sequential data. IEEE Transactions on Signal Processing, 56(8 II), 3918–3931. https://doi.org/10.1109/TSP.2008.924798
Ni, K., J. Paisley, L. Carin, and D. Dunson. “Multi-task learning for analyzing and sorting large databases of sequential data.” IEEE Transactions on Signal Processing 56, no. 8 II (August 1, 2008): 3918–31. https://doi.org/10.1109/TSP.2008.924798.
Ni K, Paisley J, Carin L, Dunson D. Multi-task learning for analyzing and sorting large databases of sequential data. IEEE Transactions on Signal Processing. 2008 Aug 1;56(8 II):3918–31.
Ni, K., et al. “Multi-task learning for analyzing and sorting large databases of sequential data.” IEEE Transactions on Signal Processing, vol. 56, no. 8 II, Aug. 2008, pp. 3918–31. Scopus, doi:10.1109/TSP.2008.924798.
Ni K, Paisley J, Carin L, Dunson D. Multi-task learning for analyzing and sorting large databases of sequential data. IEEE Transactions on Signal Processing. 2008 Aug 1;56(8 II):3918–3931.

Published In

IEEE Transactions on Signal Processing

DOI

ISSN

1053-587X

Publication Date

August 1, 2008

Volume

56

Issue

8 II

Start / End Page

3918 / 3931

Related Subject Headings

  • Networking & Telecommunications