Skip to main content
Journal cover image

Maximum margin semi-supervised learning with irrelevant data.

Publication ,  Journal Article
Yang, H; Huang, K; King, I; Lyu, MR
Published in: Neural networks : the official journal of the International Neural Network Society
October 2015

Semi-supervised learning (SSL) is a typical learning paradigms training a model from both labeled and unlabeled data. The traditional SSL models usually assume unlabeled data are relevant to the labeled data, i.e., following the same distributions of the targeted labeled data. In this paper, we address a different, yet formidable scenario in semi-supervised classification, where the unlabeled data may contain irrelevant data to the labeled data. To tackle this problem, we develop a maximum margin model, named tri-class support vector machine (3C-SVM), to utilize the available training data, while seeking a hyperplane for separating the targeted data well. Our 3C-SVM exhibits several characteristics and advantages. First, it does not need any prior knowledge and explicit assumption on the data relatedness. On the contrary, it can relieve the effect of irrelevant unlabeled data based on the logistic principle and maximum entropy principle. That is, 3C-SVM approaches an ideal classifier. This classifier relies heavily on labeled data and is confident on the relevant data lying far away from the decision hyperplane, while maximally ignoring the irrelevant data, which are hardly distinguished. Second, theoretical analysis is provided to prove that in what condition, the irrelevant data can help to seek the hyperplane. Third, 3C-SVM is a generalized model that unifies several popular maximum margin models, including standard SVMs, Semi-supervised SVMs (S(3)VMs), and SVMs learned from the universum (U-SVMs) as its special cases. More importantly, we deploy a concave-convex produce to solve the proposed 3C-SVM, transforming the original mixed integer programming, to a semi-definite programming relaxation, and finally to a sequence of quadratic programming subproblems, which yields the same worst case time complexity as that of S(3)VMs. Finally, we demonstrate the effectiveness and efficiency of our proposed 3C-SVM through systematical experimental comparisons.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Neural networks : the official journal of the International Neural Network Society

DOI

EISSN

1879-2782

ISSN

0893-6080

Publication Date

October 2015

Volume

70

Start / End Page

90 / 102

Related Subject Headings

  • Support Vector Machine
  • Supervised Machine Learning
  • Models, Statistical
  • Models, Neurological
  • Image Processing, Computer-Assisted
  • Entropy
  • Data Interpretation, Statistical
  • Artificial Intelligence & Image Processing
  • Artificial Intelligence
  • Algorithms
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yang, H., Huang, K., King, I., & Lyu, M. R. (2015). Maximum margin semi-supervised learning with irrelevant data. Neural Networks : The Official Journal of the International Neural Network Society, 70, 90–102. https://doi.org/10.1016/j.neunet.2015.06.004
Yang, Haiqin, Kaizhu Huang, Irwin King, and Michael R. Lyu. “Maximum margin semi-supervised learning with irrelevant data.Neural Networks : The Official Journal of the International Neural Network Society 70 (October 2015): 90–102. https://doi.org/10.1016/j.neunet.2015.06.004.
Yang H, Huang K, King I, Lyu MR. Maximum margin semi-supervised learning with irrelevant data. Neural networks : the official journal of the International Neural Network Society. 2015 Oct;70:90–102.
Yang, Haiqin, et al. “Maximum margin semi-supervised learning with irrelevant data.Neural Networks : The Official Journal of the International Neural Network Society, vol. 70, Oct. 2015, pp. 90–102. Epmc, doi:10.1016/j.neunet.2015.06.004.
Yang H, Huang K, King I, Lyu MR. Maximum margin semi-supervised learning with irrelevant data. Neural networks : the official journal of the International Neural Network Society. 2015 Oct;70:90–102.
Journal cover image

Published In

Neural networks : the official journal of the International Neural Network Society

DOI

EISSN

1879-2782

ISSN

0893-6080

Publication Date

October 2015

Volume

70

Start / End Page

90 / 102

Related Subject Headings

  • Support Vector Machine
  • Supervised Machine Learning
  • Models, Statistical
  • Models, Neurological
  • Image Processing, Computer-Assisted
  • Entropy
  • Data Interpretation, Statistical
  • Artificial Intelligence & Image Processing
  • Artificial Intelligence
  • Algorithms