Skip to main content

Genotypic data in relational databases: Efficient storage and rapid retrieval

Publication ,  Conference
Lichtenwalter, RN; Zorina-Lichtenwalter, K; Diatchenko, L
Published in: Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics
January 1, 2017

As technologies to produce genotypic data have become less expensive, the widths and depths of such data have sharply increased. Relational databases have performed poorly in this domain. Data storage and retrieval is now mostly conducted by highly coupled and specialized software packages and file formats, but relational databases offer advantages if the domain challenges can be overcome. We revisit their feasibility as a tool for efficiently storing and querying extremely large genotypic data sets. We describe a technique for managing genotypic data in the PostgreSQL relational database, compare it to common existing techniques for storing and querying genotypic data, and demonstrate that it can greatly reduce both query times and storage requirements.

Duke Scholars

Published In

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

DOI

EISSN

1611-3349

ISSN

0302-9743

Publication Date

January 1, 2017

Volume

10509 LNCS

Start / End Page

408 / 421

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Lichtenwalter, R. N., Zorina-Lichtenwalter, K., & Diatchenko, L. (2017). Genotypic data in relational databases: Efficient storage and rapid retrieval. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics (Vol. 10509 LNCS, pp. 408–421). https://doi.org/10.1007/978-3-319-66917-5_27
Lichtenwalter, R. N., K. Zorina-Lichtenwalter, and L. Diatchenko. “Genotypic data in relational databases: Efficient storage and rapid retrieval.” In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 10509 LNCS:408–21, 2017. https://doi.org/10.1007/978-3-319-66917-5_27.
Lichtenwalter RN, Zorina-Lichtenwalter K, Diatchenko L. Genotypic data in relational databases: Efficient storage and rapid retrieval. In: Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. 2017. p. 408–21.
Lichtenwalter, R. N., et al. “Genotypic data in relational databases: Efficient storage and rapid retrieval.” Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, vol. 10509 LNCS, 2017, pp. 408–21. Scopus, doi:10.1007/978-3-319-66917-5_27.
Lichtenwalter RN, Zorina-Lichtenwalter K, Diatchenko L. Genotypic data in relational databases: Efficient storage and rapid retrieval. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. 2017. p. 408–421.

Published In

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

DOI

EISSN

1611-3349

ISSN

0302-9743

Publication Date

January 1, 2017

Volume

10509 LNCS

Start / End Page

408 / 421

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences