Skip to main content

Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing.

Publication ,  Journal Article
Bao, R; Huang, L; Andrade, J; Tan, W; Kibbe, WA; Jiang, H; Feng, G
Published in: Cancer Inform
2014

The advent of next-generation sequencing technologies has greatly promoted advances in the study of human diseases at the genomic, transcriptomic, and epigenetic levels. Exome sequencing, where the coding region of the genome is captured and sequenced at a deep level, has proven to be a cost-effective method to detect disease-causing variants and discover gene targets. In this review, we outline the general framework of whole exome sequence data analysis. We focus on established bioinformatics tools and applications that support five analytical steps: raw data quality assessment, pre-processing, alignment, post-processing, and variant analysis (detection, annotation, and prioritization). We evaluate the performance of open-source alignment programs and variant calling tools using simulated and benchmark datasets, and highlight the challenges posed by the lack of concordance among variant detection tools. Based on these results, we recommend adopting multiple tools and resources to reduce false positives and increase the sensitivity of variant calling. In addition, we briefly discuss the current status and solutions for big data management, analysis, and summarization in the field of bioinformatics.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Cancer Inform

DOI

ISSN

1176-9351

Publication Date

2014

Volume

13

Issue

Suppl 2

Start / End Page

67 / 82

Location

United States

Related Subject Headings

  • Bioinformatics
  • 4003 Biomedical engineering
  • 3211 Oncology and carcinogenesis
  • 1112 Oncology and Carcinogenesis
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Bao, R., Huang, L., Andrade, J., Tan, W., Kibbe, W. A., Jiang, H., & Feng, G. (2014). Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing. Cancer Inform, 13(Suppl 2), 67–82. https://doi.org/10.4137/CIN.S13779
Bao, Riyue, Lei Huang, Jorge Andrade, Wei Tan, Warren A. Kibbe, Hongmei Jiang, and Gang Feng. “Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing.Cancer Inform 13, no. Suppl 2 (2014): 67–82. https://doi.org/10.4137/CIN.S13779.
Bao R, Huang L, Andrade J, Tan W, Kibbe WA, Jiang H, et al. Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing. Cancer Inform. 2014;13(Suppl 2):67–82.
Bao, Riyue, et al. “Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing.Cancer Inform, vol. 13, no. Suppl 2, 2014, pp. 67–82. Pubmed, doi:10.4137/CIN.S13779.
Bao R, Huang L, Andrade J, Tan W, Kibbe WA, Jiang H, Feng G. Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing. Cancer Inform. 2014;13(Suppl 2):67–82.

Published In

Cancer Inform

DOI

ISSN

1176-9351

Publication Date

2014

Volume

13

Issue

Suppl 2

Start / End Page

67 / 82

Location

United States

Related Subject Headings

  • Bioinformatics
  • 4003 Biomedical engineering
  • 3211 Oncology and carcinogenesis
  • 1112 Oncology and Carcinogenesis