Scholars@Duke publication: Automated Social Text Annotation With Joint Multilabel Attention Networks.

Automated Social Text Annotation With Joint Multilabel Attention Networks.

Publication , Journal Article

Dong, H; Wang, W; Huang, K; Coenen, F

Published in: IEEE transactions on neural networks and learning systems

May 2021

Automated social text annotation is the task of suggesting a set of tags for shared documents on social media platforms. The automated annotation process can reduce users' cognitive overhead in tagging and improve tag management for better search, browsing, and recommendation of documents. It can be formulated as a multilabel classification problem. We propose a novel deep learning-based method for this problem and design an attention-based neural network with semantic-based regularization, which can mimic users' reading and annotation behavior to formulate better document representation, leveraging the semantic relations among labels. The network separately models the title and the content of each document and injects an explicit, title-guided attention mechanism into each sentence. To exploit the correlation among labels, we propose two semantic-based loss regularizers, i.e., similarity and subsumption, which enforce the output of the network to conform to label semantics. The model with the semantic-based loss regularizers is referred to as the joint multilabel attention network (JMAN). We conducted a comprehensive evaluation study and compared JMAN to the state-of-the-art baseline models, using four large, real-world social media data sets. In terms of F₁ , JMAN significantly outperformed bidirectional gated recurrent unit (Bi-GRU) relatively by around 12.8%-78.6% and the hierarchical attention network (HAN) by around 3.9%-23.8%. The JMAN model demonstrates advantages in convergence and training speed. Further improvement of performance was observed against latent Dirichlet allocation (LDA) and support vector machine (SVM). When applying the semantic-based loss regularizers, the performance of HAN and Bi-GRU in terms of F₁ was also boosted. It is also found that dynamic update of the label semantic matrices (JMAN_d) has the potential to further improve the performance of JMAN but at the cost of substantial memory and warrants further study.

Duke Scholars

Author Kaizhu Huang DKU Faculty

Published In

IEEE transactions on neural networks and learning systems

DOI

10.1109/tnnls.2020.3002798

EISSN

2162-2388

ISSN

2162-237X

Publication Date

May 2021

Volume

Issue

Start / End Page

2224 / 2238

Citation

APA

Chicago

ICMJE

MLA

NLM

Dong, H., Wang, W., Huang, K., & Coenen, F. (2021). Automated Social Text Annotation With Joint Multilabel Attention Networks. IEEE Transactions on Neural Networks and Learning Systems, 32(5), 2224–2238. https://doi.org/10.1109/tnnls.2020.3002798

Dong, Hang, Wei Wang, Kaizhu Huang, and Frans Coenen. “Automated Social Text Annotation With Joint Multilabel Attention Networks.” IEEE Transactions on Neural Networks and Learning Systems 32, no. 5 (May 2021): 2224–38. https://doi.org/10.1109/tnnls.2020.3002798.

Dong H, Wang W, Huang K, Coenen F. Automated Social Text Annotation With Joint Multilabel Attention Networks. IEEE transactions on neural networks and learning systems. 2021 May;32(5):2224–38.

Dong, Hang, et al. “Automated Social Text Annotation With Joint Multilabel Attention Networks.” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 5, May 2021, pp. 2224–38. Epmc, doi:10.1109/tnnls.2020.3002798.

Dong H, Wang W, Huang K, Coenen F. Automated Social Text Annotation With Joint Multilabel Attention Networks. IEEE transactions on neural networks and learning systems. 2021 May;32(5):2224–2238.

Published In

IEEE transactions on neural networks and learning systems

DOI

10.1109/tnnls.2020.3002798

EISSN

2162-2388

ISSN

2162-237X

Publication Date

May 2021

Volume

Issue

Start / End Page

2224 / 2238