TFBSshape: a motif database for DNA shape features of transcription factor binding sites.

Published

Journal Article

Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.

Full Text

Duke Authors

Cited Authors

  • Yang, L; Zhou, T; Dror, I; Mathelier, A; Wasserman, WW; Gordân, R; Rohs, R

Published Date

  • January 2014

Published In

Volume / Issue

  • 42 / Database issue

Start / End Page

  • D148 - D155

PubMed ID

  • 24214955

Pubmed Central ID

  • 24214955

Electronic International Standard Serial Number (EISSN)

  • 1362-4962

International Standard Serial Number (ISSN)

  • 0305-1048

Digital Object Identifier (DOI)

  • 10.1093/nar/gkt1087

Language

  • eng