The Chado Natural Diversity module: a new generic database schema for large-scale phenotyping and genotyping data.

Journal Article (Journal Article)

Linking phenotypic with genotypic diversity has become a major requirement for basic and applied genome-centric biological research. To meet this need, a comprehensive database backend for efficiently storing, querying and analyzing large experimental data sets is necessary. Chado, a generic, modular, community-based database schema is widely used in the biological community to store information associated with genome sequence data. To meet the need to also accommodate large-scale phenotyping and genotyping projects, a new Chado module called Natural Diversity has been developed. The module strictly adheres to the Chado remit of being generic and ontology driven. The flexibility of the new module is demonstrated in its capacity to store any type of experiment that either uses or generates specimens or stock organisms. Experiments may be grouped or structured hierarchically, whereas any kind of biological entity can be stored as the observed unit, from a specimen to be used in genotyping or phenotyping experiments, to a group of species collected in the field that will undergo further lab analysis. We describe details of the Natural Diversity module, including the design approach, the relational schema and use cases implemented in several databases.

Full Text

Duke Authors

Cited Authors

  • Jung, S; Menda, N; Redmond, S; Buels, RM; Friesen, M; Bendana, Y; Sanderson, L-A; Lapp, H; Lee, T; MacCallum, B; Bett, KE; Cain, S; Clements, D; Mueller, LA; Main, D

Published Date

  • January 2011

Published In

Volume / Issue

  • 2011 /

Start / End Page

  • bar051 -

PubMed ID

  • 22120662

Pubmed Central ID

  • PMC3225077

Electronic International Standard Serial Number (EISSN)

  • 1758-0463

International Standard Serial Number (ISSN)

  • 1758-0463

Digital Object Identifier (DOI)

  • 10.1093/database/bar051

Language

  • eng