Functional annotation of HOT regions in the human genome: implications for human disease and cancer

Sci Rep. 2015 Jun 26:5:11633. doi: 10.1038/srep11633.

Abstract

Advances in genome-wide association studies (GWAS) and large-scale sequencing studies have resulted in an impressive and growing list of disease- and trait-associated genetic variants. Most studies have emphasised the discovery of genetic variation in coding sequences, however, the noncoding regulatory effects responsible for human disease and cancer biology have been substantially understudied. To better characterise the cis-regulatory effects of noncoding variation, we performed a comprehensive analysis of the genetic variants in HOT (high-occupancy target) regions, which are considered to be one of the most intriguing findings of recent large-scale sequencing studies. We observed that GWAS variants that map to HOT regions undergo a substantial net decrease and illustrate development-specific localisation during haematopoiesis. Additionally, genetic risk variants are disproportionally enriched in HOT regions compared with LOT (low-occupancy target) regions in both disease-relevant and cancer cells. Importantly, this enrichment is biased toward disease- or cancer-specific cell types. Furthermore, we observed that cancer cells generally acquire cancer-specific HOT regions at oncogenes through diverse mechanisms of cancer pathogenesis. Collectively, our findings demonstrate the key roles of HOT regions in human disease and cancer and represent a critical step toward further understanding disease biology, diagnosis, and therapy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Line, Tumor
  • Disease / genetics*
  • Genetic Predisposition to Disease / genetics*
  • Genetic Variation*
  • Genome, Human / genetics*
  • Genome-Wide Association Study / methods*
  • Hematopoiesis / genetics
  • Hematopoietic Stem Cells / metabolism
  • Humans
  • Molecular Sequence Annotation
  • Neoplasms / genetics*
  • Neoplasms / pathology
  • Oncogenes / genetics
  • Polymorphism, Single Nucleotide
  • Risk Factors

Associated data

  • GEO/GSE54296