Skip to main content

Semi-supervised text categorization by active search

Publication ,  Conference
Xu, Z; Jin, R; Huang, K; Lyu, MR; King, I
Published in: International Conference on Information and Knowledge Management, Proceedings
December 1, 2008

In automated text categorization, given a small number of labeled documents, it is very challenging, if not impossible, to build a reliable classifier that is able to achieve high clas- sification accuracy. To address this problem, a novel web-assisted text categorization framework is proposed in this paper. Important keywords are first automatically identified from the available labeled documents to form the queries. Search engines are then utilized to retrieve from the Web a multitude of relevant documents, which are then exploited by a semi-supervised framework. To our best knowledge, this work is the first study of this kind. Extensive experi-mental study shows the encouraging results of the proposed text categorization framework: using Google as the web search engine, the proposed framework is able to reduce the classification error by 30% when compared with the state- of-the-art supervised text categorization method.

Duke Scholars

Published In

International Conference on Information and Knowledge Management, Proceedings

DOI

Publication Date

December 1, 2008

Start / End Page

1517 / 1518
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Xu, Z., Jin, R., Huang, K., Lyu, M. R., & King, I. (2008). Semi-supervised text categorization by active search. In International Conference on Information and Knowledge Management, Proceedings (pp. 1517–1518). https://doi.org/10.1145/1458082.1458364
Xu, Z., R. Jin, K. Huang, M. R. Lyu, and I. King. “Semi-supervised text categorization by active search.” In International Conference on Information and Knowledge Management, Proceedings, 1517–18, 2008. https://doi.org/10.1145/1458082.1458364.
Xu Z, Jin R, Huang K, Lyu MR, King I. Semi-supervised text categorization by active search. In: International Conference on Information and Knowledge Management, Proceedings. 2008. p. 1517–8.
Xu, Z., et al. “Semi-supervised text categorization by active search.” International Conference on Information and Knowledge Management, Proceedings, 2008, pp. 1517–18. Scopus, doi:10.1145/1458082.1458364.
Xu Z, Jin R, Huang K, Lyu MR, King I. Semi-supervised text categorization by active search. International Conference on Information and Knowledge Management, Proceedings. 2008. p. 1517–1518.

Published In

International Conference on Information and Knowledge Management, Proceedings

DOI

Publication Date

December 1, 2008

Start / End Page

1517 / 1518