Combining classification trees using MLE

Journal Article

We propose a probability distribution for an equivalence class of classification trees (that is, those that ignore the value of the cutpoints but retain tree structure). This distribution is parameterized by a central tree structure representing the true model, and a precision or concentration coefficient representing the variability around the central tree. We use this distribution to model an observed set of classification trees exhibiting variability in tree structure. We propose the maximum likelihood estimate of the central tree as the best tree to represent the set. This MLE retains the interpretability of a single tree model and has excellent generalizability. We implement an ascent search for the MLE tree structure using a data set of 13 classification trees that predict the presence or absence of cancer based on immune system parameters.

Full Text

Duke Authors

Cited Authors

  • Shannon, WD; Banks, D

Published Date

  • 1999

Published In

Volume / Issue

  • 18 / 6

Start / End Page

  • 727 - 740

PubMed ID

  • 10204200

International Standard Serial Number (ISSN)

  • 0277-6715

Digital Object Identifier (DOI)

  • 10.1002/(SICI)1097-0258(19990330)18:6<727::AID-SIM61>3.0.CO;2-2