Evaluation of preprocessing techniques for chief complaint classification.
To determine whether preprocessing chief complaints before automatically classifying them into syndromic categories improves classification performance.We preprocessed chief complaints using two preprocessors (CCP and EMT-P) and evaluated whether classification performance increased for a probabilistic classifier (CoCo) or for a keyword-based classifier (modification of the NYC Department of Health and Mental Hygiene chief complaint coder (KC)).CCP exhibited high accuracy (85%) in preprocessing chief complaints but only slightly improved CoCo's classification performance for a few syndromes. EMT-P, which splits chief complaints into multiple problems, substantially increased CoCo's sensitivity for all syndromes. Preprocessing with CCP or EMT-P only improved KC's sensitivity for the Constitutional syndrome.Evaluation of preprocessing systems should not be limited to accuracy of the preprocessor but should include the effect of preprocessing on syndromic classification. Splitting chief complaints into multiple problems before classification is important for CoCo, but other preprocessing steps only slightly improved classification performance for CoCo and a keyword-based classifier.
Duke Scholars
Published In
DOI
EISSN
ISSN
Publication Date
Volume
Issue
Start / End Page
Related Subject Headings
- Terminology as Topic
- Syndrome
- Population Surveillance
- Pattern Recognition, Automated
- Natural Language Processing
- Medical Informatics
- Diagnosis, Computer-Assisted
- Biomedical Engineering
- Artificial Intelligence
- 4601 Applied computing
Citation
Published In
DOI
EISSN
ISSN
Publication Date
Volume
Issue
Start / End Page
Related Subject Headings
- Terminology as Topic
- Syndrome
- Population Surveillance
- Pattern Recognition, Automated
- Natural Language Processing
- Medical Informatics
- Diagnosis, Computer-Assisted
- Biomedical Engineering
- Artificial Intelligence
- 4601 Applied computing