Scholars@Duke publication: Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials.

Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials.

Publication , Journal Article

Wang, F; Schilsky, RL; Page, D; Califf, RM; Cheung, K; Wang, X; Pang, H

Published in: JAMA Netw Open

October 1, 2020

IMPORTANCE: Adherence to the Consolidated Standards of Reporting Trials (CONSORT) for randomized clinical trials is associated with improvingquality because inadequate reporting in randomized clinical trials may complicate the interpretation and the application of findings to clinical care. OBJECTIVE: To evaluate an automated reporting checklist generation tool that uses natural language processing (NLP), called CONSORT-NLP. DESIGN, SETTING, AND PARTICIPANTS: This study used published journal articles as training, testing, and validation sets to develop, refine, and evaluate the CONSORT-NLP tool. Articles reporting randomized clinical trials were selected from 25 high-impact-factor journals under the following categories: (1) general and internal medicine, (2) oncology, and (3) cardiac and cardiovascular systems. MAIN OUTCOMES AND MEASURES: For an evaluation of the performance of this tool, an accuracy metric defined as the number of correct assessments divided by all assessments was calculated. RESULTS: The CONSORT-NLP tool uses the widely used Portable Document Format as an input file. Of the 37 CONSORT reporting items, 34 (92%) were included in the tool. Of these 34 reporting items, 30 were fully implemented; 28 (93%) of the fully implemented CONSORT reporting items had an accuracy of more than 90% for the validation set. The remaining 2 (7%) had an accuracy between 80% and 90% for the validation set. Two to 5 articles were selected from each of these journals for a total of 158 articles to establish a training set of 111 articles to train CONSORT-NLP for CONSORT reporting items, a testing set of 25 articles to refine CONSORT-NLP, and a validation set of 22 articles to assess the performance of CONSORT-NLP. The CONSORT-NLP tool used the Portable Document Format of the articles as input files. A CONSORT-NLP graphical user interface was built using Java in 2019. The time required to complete the CONSORT checklist manually vs using the CONSORT-NLP tool was compared for 30 articles. Two case studies for randomized clinical trials are provided as an illustration for the CONSORT-NLP tool. For the 30 articles investigated, CONSORT-NLP required a mean (SD) 23.0 (4.1) seconds, whereas the manual reviewer required a mean (SD) 11.9 (2.2), 22.6 (4.6), and 57.6 (7.1) minutes, for 3 reviewers, respectively. CONCLUSIONS AND RELEVANCE: The CONSORT-NLP tool is designed to assist in the reporting of randomized clinical trials. Potential users of CONSORT-NLP include clinicians, researchers, and scientists who plan to publish a randomized trial study in a peer-reviewed journal. The use of CONSORT-NLP may help them save substantial time when generating the CONSORT checklist. This tool may also be useful for manuscript reviewers and journal editors who review these articles.

Duke Scholars

Author Xiaofei Wang Biostatistics & Bioinformatics, Division of Biostatistics

Author Herbert Pang Biostatistics & Bioinformatics, Division of Biostatistics

Author David Page Biostatistics & Bioinformatics, Division of Biostatistics

Altmetric Attention Stats

Dimensions Citation Stats

Published In

JAMA Netw Open

DOI

10.1001/jamanetworkopen.2020.14661

EISSN

2574-3805

Publication Date

October 1, 2020

Volume

Issue

Start / End Page

e2014661

Location

United States

Related Subject Headings

Research Report
Reproducibility of Results
Randomized Controlled Trials as Topic
Natural Language Processing
Humans
Cross-Sectional Studies
Checklist
Biomedical Research
42 Health sciences
32 Biomedical and clinical sciences

Citation

APA

Chicago

ICMJE

MLA

NLM

Wang, F., Schilsky, R. L., Page, D., Califf, R. M., Cheung, K., Wang, X., & Pang, H. (2020). Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials. JAMA Netw Open, 3(10), e2014661. https://doi.org/10.1001/jamanetworkopen.2020.14661

Wang, Fan, Richard L. Schilsky, David Page, Robert M. Califf, Kei Cheung, Xiaofei Wang, and Herbert Pang. “Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials.” JAMA Netw Open 3, no. 10 (October 1, 2020): e2014661. https://doi.org/10.1001/jamanetworkopen.2020.14661.

Wang F, Schilsky RL, Page D, Califf RM, Cheung K, Wang X, et al. Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials. JAMA Netw Open. 2020 Oct 1;3(10):e2014661.

Wang, Fan, et al. “Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials.” JAMA Netw Open, vol. 3, no. 10, Oct. 2020, p. e2014661. Pubmed, doi:10.1001/jamanetworkopen.2020.14661.

Wang F, Schilsky RL, Page D, Califf RM, Cheung K, Wang X, Pang H. Development and Validation of a Natural Language Processing Tool to Generate the CONSORT Reporting Checklist for Randomized Clinical Trials. JAMA Netw Open. 2020 Oct 1;3(10):e2014661.

Published In

JAMA Netw Open

DOI

10.1001/jamanetworkopen.2020.14661

EISSN

2574-3805

Publication Date

October 1, 2020

Volume

Issue

Start / End Page

e2014661

Location

United States

Related Subject Headings

Research Report
Reproducibility of Results
Randomized Controlled Trials as Topic
Natural Language Processing
Humans
Cross-Sectional Studies
Checklist
Biomedical Research
42 Health sciences
32 Biomedical and clinical sciences