Selection of examples in case-based computer-aided decision systems.

Journal Article (Journal Article)

Case-based computer-aided decision (CB-CAD) systems rely on a database of previously stored, known examples when classifying new, incoming queries. Such systems can be particularly useful since they do not need retraining every time a new example is deposited in the case base. The adaptive nature of case-based systems is well suited to the current trend of continuously expanding digital databases in the medical domain. To maintain efficiency, however, such systems need sophisticated strategies to effectively manage the available evidence database. In this paper, we discuss the general problem of building an evidence database by selecting the most useful examples to store while satisfying existing storage requirements. We evaluate three intelligent techniques for this purpose: genetic algorithm-based selection, greedy selection and random mutation hill climbing. These techniques are compared to a random selection strategy used as the baseline. The study is performed with a previously presented CB-CAD system applied for false positive reduction in screening mammograms. The experimental evaluation shows that when the development goal is to maximize the system's diagnostic performance, the intelligent techniques are able to reduce the size of the evidence database to 37% of the original database by eliminating superfluous and/or detrimental examples while at the same time significantly improving the CAD system's performance. Furthermore, if the case-base size is a main concern, the total number of examples stored in the system can be reduced to only 2-4% of the original database without a decrease in the diagnostic performance. Comparison of the techniques shows that random mutation hill climbing provides the best balance between the diagnostic performance and computational efficiency when building the evidence database of the CB-CAD system.

Full Text

Duke Authors

Cited Authors

  • Mazurowski, MA; Zurada, JM; Tourassi, GD

Published Date

  • November 7, 2008

Published In

Volume / Issue

  • 53 / 21

Start / End Page

  • 6079 - 6096

PubMed ID

  • 18854606

Pubmed Central ID

  • PMC3835388

International Standard Serial Number (ISSN)

  • 0031-9155

Digital Object Identifier (DOI)

  • 10.1088/0031-9155/53/21/013


  • eng

Conference Location

  • England