Scholars@Duke publication: Practical outcomes of applying ensemble machine learning classifiers to High-Throughput Screening (HTS) data analysis and screening.

Practical outcomes of applying ensemble machine learning classifiers to High-Throughput Screening (HTS) data analysis and screening.

Publication , Journal Article

Simmons, K; Kinney, J; Owens, A; Kleier, DA; Bloch, K; Argentar, D; Walsh, A; Vaidyanathan, G

Published in: J Chem Inf Model

November 2008

Over the years numerous papers have presented the effectiveness of various machine learning methods in analyzing drug discovery biological screening data. The predictive performance of models developed using these methods has traditionally been evaluated by assessing performance of the developed models against a portion of the data randomly selected for holdout. It has been our experience that such assessments, while widely practiced, result in an optimistic assessment. This paper describes the development of a series of ensemble-based decision tree models, shares our experience at various stages in the model development process, and presents the impact of such models when they are applied to vendor offerings and the forecasted compounds are acquired and screened in the relevant assays. We have seen that well developed models can significantly increase the hit-rates observed in HTS campaigns.

Duke Scholars

Author Ganesan Vaidyanathan Radiology

Published In

J Chem Inf Model

DOI

10.1021/ci800164u

ISSN

1549-9596

Publication Date

November 2008

Volume

Issue

Start / End Page

2196 / 2206

Location

United States

Related Subject Headings

Neural Networks, Computer
Molecular Structure
Medicinal & Biomolecular Chemistry
Informatics
Drug Evaluation, Preclinical
Drug Discovery
Decision Trees
Data Interpretation, Statistical
Artificial Intelligence
3407 Theoretical and computational chemistry

Citation

APA

Chicago

ICMJE

MLA

NLM

Simmons, K., Kinney, J., Owens, A., Kleier, D. A., Bloch, K., Argentar, D., … Vaidyanathan, G. (2008). Practical outcomes of applying ensemble machine learning classifiers to High-Throughput Screening (HTS) data analysis and screening. J Chem Inf Model, 48(11), 2196–2206. https://doi.org/10.1021/ci800164u

Simmons, Kirk, John Kinney, Aaron Owens, Daniel A. Kleier, Karen Bloch, Dave Argentar, Alicia Walsh, and Ganesh Vaidyanathan. “Practical outcomes of applying ensemble machine learning classifiers to High-Throughput Screening (HTS) data analysis and screening.” J Chem Inf Model 48, no. 11 (November 2008): 2196–2206. https://doi.org/10.1021/ci800164u.

Simmons K, Kinney J, Owens A, Kleier DA, Bloch K, Argentar D, et al. Practical outcomes of applying ensemble machine learning classifiers to High-Throughput Screening (HTS) data analysis and screening. J Chem Inf Model. 2008 Nov;48(11):2196–206.

Simmons, Kirk, et al. “Practical outcomes of applying ensemble machine learning classifiers to High-Throughput Screening (HTS) data analysis and screening.” J Chem Inf Model, vol. 48, no. 11, Nov. 2008, pp. 2196–206. Pubmed, doi:10.1021/ci800164u.

Simmons K, Kinney J, Owens A, Kleier DA, Bloch K, Argentar D, Walsh A, Vaidyanathan G. Practical outcomes of applying ensemble machine learning classifiers to High-Throughput Screening (HTS) data analysis and screening. J Chem Inf Model. 2008 Nov;48(11):2196–2206.

Published In

J Chem Inf Model

DOI

10.1021/ci800164u

ISSN

1549-9596

Publication Date

November 2008

Volume

Issue

Start / End Page

2196 / 2206

Location

United States

Related Subject Headings

Neural Networks, Computer
Molecular Structure
Medicinal & Biomolecular Chemistry
Informatics
Drug Evaluation, Preclinical
Drug Discovery
Decision Trees
Data Interpretation, Statistical
Artificial Intelligence
3407 Theoretical and computational chemistry