U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

LOC112997542 Sharpr-MPRA regulatory region 5990 [ Homo sapiens (human) ]

Gene ID: 112997542, updated on 10-Oct-2023

Summary

Gene symbol
LOC112997542
Gene description
Sharpr-MPRA regulatory region 5990
Gene type
biological region
Feature type(s)
regulatory: enhancer
RefSeq status
REVIEWED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Summary
This genomic sequence was predicted to be a transcriptional regulatory region based on chromatin state analysis from the ENCODE (ENCyclopedia Of DNA Elements) project. The major subregion was validated as an active enhancer by the ChIP-STARR-seq massively parallel reporter assay (MPRA) in naive human embryonic stem cells, where it associates with the NANOG transcription factor and is marked by the H3K27ac and H3K4me1 histone modifications. Another subregion was validated as an enhancer by Sharpr-MPRA (Systematic high-resolution activation and repression profiling with reporter tiling using massively parallel reporter assays) in both HepG2 liver carcinoma cells (group: HepG2 Activating DNase unmatched - State 1:Tss, active promoter, TSS/CpG island region) and K562 erythroleukemia cells (group: K562 Activating DNase unmatched - State 1:Tss). This locus also includes two accessible chromatin subregions that were validated as enhancers based on their ability to activate an origin of replication minimal core promoter by the ATAC-STARR-seq (assay for transposase-accessible chromatin with self-transcribing active regulatory region sequencing) MPRA in GM12878 lymphoblastoid cells. [provided by RefSeq, May 2023]
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See LOC112997542 in Genome Data Viewer
Location:
4q
Annotation release Status Assembly Chr Location
RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 4 NC_000004.12 (83455071..83456110)
RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 4 NC_060928.1 (86785138..86786177)
105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 4 NC_000004.11 (84376969..84377263)

Chromosome 4 - NC_000004.12Genomic Context describing neighboring genes Neighboring gene uncharacterized LOC105377313 Neighboring gene heparanase Neighboring gene CDK7 strongly-dependent group 2 enhancer GRCh37_chr4:84237230-84238429 Neighboring gene ReSE screen-validated silencer GRCh37_chr4:84242552-84242745 Neighboring gene Sharpr-MPRA regulatory region 10996 Neighboring gene chromosome 18 open reading frame 21 pseudogene Neighboring gene CDK7 strongly-dependent group 2 enhancer GRCh37_chr4:84309555-84310754 Neighboring gene OCT4-NANOG hESC enhancer GRCh37_chr4:84311240-84312057 Neighboring gene helicase, POLQ like Neighboring gene mitochondrial ribosomal protein S18C Neighboring gene abraxas 1, BRCA1 A complex subunit Neighboring gene solute carrier family 25 member 14 pseudogene 1 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 15542 Neighboring gene ribosomal protein L30 pseudogene 5

Genomic regions, transcripts, and products

General gene information

Other Names

  • ATAC-STARR-seq lymphoblastoid active region 21687
  • ATAC-STARR-seq lymphoblastoid active region 21688
  • NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr4:84376224-84377054

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

Genomic

  1. NG_061535.2 

    Range
    101..1140
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p14 Primary Assembly

Genomic

  1. NC_000004.12 Reference GRCh38.p14 Primary Assembly

    Range
    83455071..83456110
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Alternate T2T-CHM13v2.0

Genomic

  1. NC_060928.1 Alternate T2T-CHM13v2.0

    Range
    86785138..86786177
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)