Comprehensive genome-wide transcription factor analysis reveals that a combination of high affinity and low affinity DNA binding is needed for human gene regulation

BMC Genomics. 2015;16 Suppl 7(Suppl 7):S12. doi: 10.1186/1471-2164-16-S7-S12. Epub 2015 Jun 11.

Abstract

Background: High-throughput in vivo protein-DNA interaction experiments are currently widely used in gene regulation studies. Hitherto, comprehensive data analysis remains a challenge and for that reason most computational methods only consider the top few hundred or thousand strongest protein binding sites whereas weak protein binding sites are completely ignored.

Results: A new biophysical model of protein-DNA interactions, BayesPI2+, was developed to address the above-mentioned challenges. BayesPI2+ can be run in either a serial computation model or a parallel ensemble learning framework. BayesPI2+ allowed us to analyze all binding sites of the transcription factors, including weak binding that cannot be analyzed by other models. It is evaluated in both synthetic and real in vivo protein-DNA binding experiments. Analysing ESR1 and SPIB in breast carcinoma and activated B cell-like diffuse large B-cell lymphoma cell lines, respectively, revealed that the concerted binding to high and low affinity sites correlates best with gene expression.

Conclusions: BayesPI2+ allows us to analyze transcription factor binding on a larger scale than hitherto achieved. By this analysis, we were able to demonstrate that genes are regulated by concerted binding to high and low affinity binding sites. The program and output results are publicly available at: http://folk.uio.no/junbaiw/BayesPI2Plus.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Binding Sites
  • Breast Neoplasms / metabolism*
  • Cell Line, Tumor
  • Computational Biology / methods
  • DNA / metabolism*
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism
  • Estrogen Receptor alpha / chemistry
  • Estrogen Receptor alpha / genetics
  • Estrogen Receptor alpha / metabolism
  • Female
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Lymphoma, Large B-Cell, Diffuse / metabolism*
  • Models, Theoretical
  • Transcription Factors / chemistry*
  • Transcription Factors / genetics
  • Transcription Factors / metabolism*

Substances

  • DNA-Binding Proteins
  • ESR1 protein, human
  • Estrogen Receptor alpha
  • Transcription Factors
  • SPIB protein, human
  • DNA