U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from GEO Profiles

    • Showing Current items.

    Cenpu centromere protein U [ Mus musculus (house mouse) ]

    Gene ID: 71876, updated on 5-Mar-2024

    Summary

    Official Symbol
    Cenpuprovided by MGI
    Official Full Name
    centromere protein Uprovided by MGI
    Primary source
    MGI:MGI:1919126
    See related
    Ensembl:ENSMUSG00000031629 AllianceGenome:MGI:1919126
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    Mlf1ip; 1700029A22Rik
    Summary
    Acts upstream of or within chordate embryonic development. Located in cytoplasm and nucleus. Is expressed in several structures, including alimentary system; central nervous system; gonad; hemolymphoid system; and sensory organ. Orthologous to human CENPU (centromere protein U). [provided by Alliance of Genome Resources, Apr 2022]
    Expression
    Broad expression in testis adult (RPKM 4.2), liver E14 (RPKM 3.4) and 21 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    Location:
    8 B1.1; 8 26.38 cM
    Exon count:
    14
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 8 NC_000074.7 (47005054..47033603)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 8 NC_000074.6 (46552019..46580568)

    Chromosome 8 - NC_000074.7Genomic Context describing neighboring genes Neighboring gene predicted gene, 30931 Neighboring gene STARR-seq mESC enhancer starr_21383 Neighboring gene STARR-seq mESC enhancer starr_21384 Neighboring gene STARR-positive B cell enhancer ABC_E2261 Neighboring gene acyl-CoA synthetase long-chain family member 1 Neighboring gene STARR-positive B cell enhancer ABC_E6635 Neighboring gene CapStarr-seq enhancer MGSCv37_chr8:47637172-47637355 Neighboring gene primase and polymerase (DNA-directed) Neighboring gene predicted gene 45607 Neighboring gene STARR-positive B cell enhancer ABC_E1365 Neighboring gene caspase 3

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Clone Names

    • MGC143675, MGC143676

    Gene Ontology Provided by MGI

    Function Evidence Code Pubs
    enables protein binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 
    Process Evidence Code Pubs
    acts_upstream_of_or_within chordate embryonic development IMP
    Inferred from Mutant Phenotype
    more info
    PubMed 
    involved_in chromosome segregation NAS
    Non-traceable Author Statement
    more info
    PubMed 
    Component Evidence Code Pubs
    located_in centriolar satellite ISO
    Inferred from Sequence Orthology
    more info
     
    located_in chromosome IEA
    Inferred from Electronic Annotation
    more info
     
    located_in chromosome, centromeric region IEA
    Inferred from Electronic Annotation
    more info
     
    located_in cytoplasm IDA
    Inferred from Direct Assay
    more info
    PubMed 
    part_of inner kinetochore ISO
    Inferred from Sequence Orthology
    more info
     
    located_in kinetochore IEA
    Inferred from Electronic Annotation
    more info
     
    located_in nucleoplasm ISO
    Inferred from Sequence Orthology
    more info
     
    is_active_in nucleus IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    located_in nucleus IDA
    Inferred from Direct Assay
    more info
    PubMed 
    located_in nucleus NAS
    Non-traceable Author Statement
    more info
    PubMed 

    General protein information

    Preferred Names
    centromere protein U
    Names
    CENP-U
    MLF1-interacting protein
    myeloid leukemia factor 1 interacting protein

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001368403.1NP_001355332.1  centromere protein U isoform 2

      Status: VALIDATED

      Source sequence(s)
      AC119267
      UniProtKB/TrEMBL
      Q149H7
      Conserved Domains (2) summary
      pfam13097
      Location:139307
      CENP-U; CENP-A nucleosome associated complex (NAC) subunit
      pfam01496
      Location:242384
      V_ATPase_I; V-type ATPase 116kDa subunit family
    2. NM_027973.4NP_082249.1  centromere protein U isoform 1

      See identical proteins and their annotated locations for NP_082249.1

      Status: VALIDATED

      Source sequence(s)
      AC119267
      Consensus CDS
      CCDS22292.1
      UniProtKB/Swiss-Prot
      Q6UNA2, Q8C4M7, Q9D9U1
      UniProtKB/TrEMBL
      Q149H7
      Related
      ENSMUSP00000034045.8, ENSMUST00000034045.15
      Conserved Domains (1) summary
      pfam13097
      Location:144312
      CENP-U; CENP-A nucleosome associated complex (NAC) subunit

    RNA

    1. NR_160797.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC119267
      Related
      ENSMUST00000135432.8
    2. NR_160798.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC119267

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000074.7 Reference GRCm39 C57BL/6J

      Range
      47005054..47033603
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    RNA

    1. XR_004934878.1 RNA Sequence

    2. XR_004934879.1 RNA Sequence