Skip to main content

Impact of low class prevalence on the performance evaluation of neural network based classifiers: Experimental study in the context of computer-assisted medical diagnosis

Publication ,  Journal Article
Mazurowski, MA; Habas, PA; Zurada, JM; Tourassi, GD
Published in: IEEE International Conference on Neural Networks - Conference Proceedings
December 1, 2007

This paper presents an experimental study on the impact of low class prevalence on the neural network based classifier performance as measured using Receiver Operator Characteristic (ROC) analysis. Two methods of dealing with the problem are investigated: oversampling and undersampling in the context of varying the class prevalence and the size of training datasets with uncorrelated and correlated features. The results show that the class imbalance can significantly decrease the classifier performance especially in the case of small training datasets. Furthermore, the oversampling method is shown to be more effective than the undersampling method in compensating the class imbalance. Statistically significant differences, however, are observed only in the cases with large total number of samples and very low prevalence. ©2007 IEEE.

Duke Scholars

Published In

IEEE International Conference on Neural Networks - Conference Proceedings

DOI

ISSN

1098-7576

Publication Date

December 1, 2007

Start / End Page

2005 / 2009
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Mazurowski, M. A., Habas, P. A., Zurada, J. M., & Tourassi, G. D. (2007). Impact of low class prevalence on the performance evaluation of neural network based classifiers: Experimental study in the context of computer-assisted medical diagnosis. IEEE International Conference on Neural Networks - Conference Proceedings, 2005–2009. https://doi.org/10.1109/IJCNN.2007.4371266
Mazurowski, M. A., P. A. Habas, J. M. Zurada, and G. D. Tourassi. “Impact of low class prevalence on the performance evaluation of neural network based classifiers: Experimental study in the context of computer-assisted medical diagnosis.” IEEE International Conference on Neural Networks - Conference Proceedings, December 1, 2007, 2005–9. https://doi.org/10.1109/IJCNN.2007.4371266.
Mazurowski MA, Habas PA, Zurada JM, Tourassi GD. Impact of low class prevalence on the performance evaluation of neural network based classifiers: Experimental study in the context of computer-assisted medical diagnosis. IEEE International Conference on Neural Networks - Conference Proceedings. 2007 Dec 1;2005–9.
Mazurowski, M. A., et al. “Impact of low class prevalence on the performance evaluation of neural network based classifiers: Experimental study in the context of computer-assisted medical diagnosis.” IEEE International Conference on Neural Networks - Conference Proceedings, Dec. 2007, pp. 2005–09. Scopus, doi:10.1109/IJCNN.2007.4371266.
Mazurowski MA, Habas PA, Zurada JM, Tourassi GD. Impact of low class prevalence on the performance evaluation of neural network based classifiers: Experimental study in the context of computer-assisted medical diagnosis. IEEE International Conference on Neural Networks - Conference Proceedings. 2007 Dec 1;2005–2009.

Published In

IEEE International Conference on Neural Networks - Conference Proceedings

DOI

ISSN

1098-7576

Publication Date

December 1, 2007

Start / End Page

2005 / 2009