Skip to main content

SNPpy--database management for SNP data from genome wide association studies.

Publication ,  Journal Article
Mitha, F; Herodotou, H; Borisov, N; Jiang, C; Yoder, J; Owzar, K
Published in: PLoS One
2011

BACKGROUND: We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS). This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP) data. SNPpy and its dependencies are open source software. RESULTS: The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. CONCLUSIONS: By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

PLoS One

DOI

EISSN

1932-6203

Publication Date

2011

Volume

6

Issue

10

Start / End Page

e24982

Location

United States

Related Subject Headings

  • Software
  • Polymorphism, Single Nucleotide
  • Humans
  • Genome-Wide Association Study
  • General Science & Technology
  • Databases, Genetic
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Mitha, F., Herodotou, H., Borisov, N., Jiang, C., Yoder, J., & Owzar, K. (2011). SNPpy--database management for SNP data from genome wide association studies. PLoS One, 6(10), e24982. https://doi.org/10.1371/journal.pone.0024982
Mitha, Faheem, Herodotos Herodotou, Nedyalko Borisov, Chen Jiang, Josh Yoder, and Kouros Owzar. “SNPpy--database management for SNP data from genome wide association studies.PLoS One 6, no. 10 (2011): e24982. https://doi.org/10.1371/journal.pone.0024982.
Mitha F, Herodotou H, Borisov N, Jiang C, Yoder J, Owzar K. SNPpy--database management for SNP data from genome wide association studies. PLoS One. 2011;6(10):e24982.
Mitha, Faheem, et al. “SNPpy--database management for SNP data from genome wide association studies.PLoS One, vol. 6, no. 10, 2011, p. e24982. Pubmed, doi:10.1371/journal.pone.0024982.
Mitha F, Herodotou H, Borisov N, Jiang C, Yoder J, Owzar K. SNPpy--database management for SNP data from genome wide association studies. PLoS One. 2011;6(10):e24982.

Published In

PLoS One

DOI

EISSN

1932-6203

Publication Date

2011

Volume

6

Issue

10

Start / End Page

e24982

Location

United States

Related Subject Headings

  • Software
  • Polymorphism, Single Nucleotide
  • Humans
  • Genome-Wide Association Study
  • General Science & Technology
  • Databases, Genetic