Scholars@Duke publication: Learning classifiers from imbalanced data based on biased minimax probability machine

Learning classifiers from imbalanced data based on biased minimax probability machine

Publication , Conference

Huang, K; Yang, H; King, I; Lyu, MR

Published in: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

October 19, 2004

We consider the problem of the binary classification on imbalanced data, in which nearly all the instances are labelled as one class, while far fewer instances are labelled as the other class, usually the more important class. Traditional machine learning methods seeking an accurate performance over a full range of instances are not suitable to deal with this problem, since they tend to classify all the data into the majority, usually the less important class. Moreover, some current methods have tried to utilize some intermediate factors, e.g., the distribution of the training set, the decision thresholds or the cost matrices, to influence the bias of the classification. However, it remains uncertain whether these methods can improve the performance in a systematic way. In this paper, we propose a novel model named Biased Minimax Probability Machine. Different from previous methods, this model directly controls the worst-case real accuracy of classification of the future data to build up biased classifiers. Hence, it provides a rigorous treatment on imbalanced data. The experimental results on the novel model comparing with those of three competitive methods, i.e., the Naive Bayesian classifier, the k-Nearest Neighbor method, and the decision tree method C4.5, demonstrate the superiority of our novel model.

Duke Scholars

Author Kaizhu Huang DKU Faculty

Published In

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

ISSN

1063-6919

Publication Date

October 19, 2004

Volume

Citation

APA

Chicago

ICMJE

MLA

NLM

Huang, K., Yang, H., King, I., & Lyu, M. R. (2004). Learning classifiers from imbalanced data based on biased minimax probability machine. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Vol. 2).

Huang, K., H. Yang, I. King, and M. R. Lyu. “Learning classifiers from imbalanced data based on biased minimax probability machine.” In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2, 2004.

Huang K, Yang H, King I, Lyu MR. Learning classifiers from imbalanced data based on biased minimax probability machine. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2004.

Huang, K., et al. “Learning classifiers from imbalanced data based on biased minimax probability machine.” Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, 2004.

Huang K, Yang H, King I, Lyu MR. Learning classifiers from imbalanced data based on biased minimax probability machine. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2004.

Published In

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

ISSN

1063-6919

Publication Date

October 19, 2004