U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from OMIM

    • Showing Current items.

    ZBTB8OS zinc finger and BTB domain containing 8 opposite strand [ Homo sapiens (human) ]

    Gene ID: 339487, updated on 3-Apr-2024

    Summary

    Official Symbol
    ZBTB8OSprovided by HGNC
    Official Full Name
    zinc finger and BTB domain containing 8 opposite strandprovided by HGNC
    Primary source
    HGNC:HGNC:24094
    See related
    Ensembl:ENSG00000176261 MIM:615891; AllianceGenome:HGNC:24094
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Homo sapiens
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
    Also known as
    ARCH; ARCH2
    Summary
    Predicted to enable metal ion binding activity. Involved in tRNA splicing, via endonucleolytic cleavage and ligation. Part of tRNA-splicing ligase complex. [provided by Alliance of Genome Resources, Apr 2022]
    Expression
    Ubiquitous expression in colon (RPKM 4.0), lymph node (RPKM 3.5) and 25 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    Location:
    1p35.1
    Exon count:
    10
    Annotation release Status Assembly Chr Location
    RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 1 NC_000001.11 (32620820..32650932, complement)
    RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 1 NC_060925.1 (32479596..32510712, complement)
    105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 1 NC_000001.10 (33086421..33116533, complement)

    Chromosome 1 - NC_000001.11Genomic Context describing neighboring genes Neighboring gene uncharacterized LOC124903952 Neighboring gene ReSE screen-validated silencer GRCh37_chr1:33013203-33013387 Neighboring gene MPRA-validated peak165 silencer Neighboring gene zinc finger and BTB domain containing 8A Neighboring gene MPRA-validated peak166 silencer Neighboring gene H3K27ac hESC enhancer GRCh37_chr1:33072181-33072681 Neighboring gene uncharacterized LOC102723870 Neighboring gene H3K27ac hESC enhancer GRCh37_chr1:33077607-33078108 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 694 Neighboring gene OCT4-NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33107487-33108246 Neighboring gene OCT4-NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33108247-33109005 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 603 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 695 Neighboring gene hESC enhancers GRCh37_chr1:33116021-33116608 and GRCh37_chr1:33116609-33117197 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 696 Neighboring gene RB binding protein 4, chromatin remodeling factor Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:33154443-33154943 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33160284-33161022 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33167839-33168604 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33168605-33169370 Neighboring gene syncoilin, intermediate filament protein Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 607 Neighboring gene Sharpr-MPRA regulatory region 2185 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33183271-33184026 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33189892-33190720 Neighboring gene Sharpr-MPRA regulatory region 8469 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33200787-33201312 Neighboring gene OCT4-NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33201837-33202361 Neighboring gene OCT4-NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33202362-33202885 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 608 Neighboring gene H3K27ac hESC enhancer GRCh37_chr1:33219766-33220279 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33225004-33225886 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33225887-33226770 Neighboring gene ReSE screen-validated silencer GRCh37_chr1:33227405-33227625 Neighboring gene NHS like 3 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33231243-33232135 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33232136-33233029 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:33235076-33235991 Neighboring gene Sharpr-MPRA regulatory region 15696

    Genomic regions, transcripts, and products

    Expression

    • Project title: HPA RNA-seq normal tissues
    • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
    • BioProject: PRJEB4337
    • Publication: PMID 24309898
    • Analysis date: Wed Apr 4 07:08:55 2018

    Bibliography

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General protein information

    Preferred Names
    protein archease
    Names
    archease (ARCH)
    archease-like protein
    zinc finger and BTB domain-containing opposite strand protein 8

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001308135.2NP_001295064.1  protein archease isoform 2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (2) lacks an alternate in-frame exon in the 5' coding region compared to variant 1. It encodes isoform 2 which has the same N- and C- termini, but lacks a short internal segment compared to isoform 1.
      Source sequence(s)
      AC114489, AL033529
      UniProtKB/TrEMBL
      A8K0B5
      Related
      ENSP00000483675.2, ENST00000373506.8
      Conserved Domains (1) summary
      pfam01951
      Location:47172
      Archease; Archease protein family (MTH1598/TM1083)
    2. NM_001308136.2NP_001295065.1  protein archease isoform 3

      See identical proteins and their annotated locations for NP_001295065.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (3) lacks an exon in the 3' coding region, which results in a frameshift and an early stop codon, compared to variant 1. The encoded isoform (3) is shorter and has a distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AC114489, AL033529
      UniProtKB/TrEMBL
      H7C3R6
      Related
      ENSP00000413485.2, ENST00000436661.6
      Conserved Domains (1) summary
      pfam01951
      Location:43121
      Archease; Archease protein family (MTH1598/TM1083)
    3. NM_001308137.2NP_001295066.1  protein archease isoform 4

      Status: VALIDATED

      Description
      Transcript Variant: This variant (4) uses an alternate splice donor site in the 5' coding region, and lacks exons in the 5' and 3' coding regions, with the latter resulting in a frameshift and an early stop codon, compared to variant 1. The encoded isoform (4) contains two distinct amino acids near the N-terminus, lacks an internal segment, is shorter, and has a distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AC114489, AL033529
      UniProtKB/TrEMBL
      H7C3R6
      Conserved Domains (1) summary
      pfam01951
      Location:47114
      Archease; Archease protein family (MTH1598/TM1083)
    4. NM_001308138.2NP_001295067.1  protein archease isoform 5

      See identical proteins and their annotated locations for NP_001295067.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (5) has multiple differences in the coding region compared to variant 1. This variant represents translation initiation at a downstream start codon compared to variant 1; the 5'-most initiation codon, as used in variant 1, is associated with a truncated ORF that would render the transcript a candidate for nonsense-mediated decay (NMD). Leaky scanning may allow translation initiation at the downstream start codon to encode an isoform (5) that has a shorter N-terminus, compared to isoform 1. Variants 5, 6, 7, and 8 all encode the same isoform (5).
      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Related
      ENSP00000481039.1, ENST00000465588.2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    5. NM_001308139.2NP_001295068.1  protein archease isoform 5

      See identical proteins and their annotated locations for NP_001295068.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (6) uses an alternate splice acceptor site in the 5' coding region compared to variant 1. This variant represents translation initiation at a downstream start codon compared to variant 1; the 5'-most initiation codon, as used in variant 1, is associated with a truncated ORF that would render the transcript a candidate for nonsense-mediated decay (NMD). Leaky scanning may allow translation initiation at the downstream start codon to encode an isoform (5) that has a shorter N-terminus, compared to isoform 1. Variants 5, 6, 7, and 8 all encode the same isoform (5).
      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    6. NM_001308140.2NP_001295069.1  protein archease isoform 5

      See identical proteins and their annotated locations for NP_001295069.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (7) uses an alternate splice donor site in the 5' coding region compared to variant 1. This variant represents translation initiation at a downstream start codon compared to variant 1; the 5'-most initiation codon, as used in variant 1, is associated with a truncated ORF that would render the transcript a candidate for nonsense-mediated decay (NMD). Leaky scanning may allow translation initiation at the downstream start codon to encode an isoform (5) that has a shorter N-terminus, compared to isoform 1. Variants 5, 6, 7, and 8 all encode the same isoform (5).
      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    7. NM_001308141.2NP_001295070.1  protein archease isoform 5

      See identical proteins and their annotated locations for NP_001295070.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (8) uses an alternate splice donor site in the 5' coding region compared to variant 1. This variant represents translation initiation at a downstream start codon compared to variant 1; the 5'-most initiation codon, as used in variant 1, is associated with a truncated ORF that would render the transcript a candidate for nonsense-mediated decay (NMD). Leaky scanning may allow translation initiation at the downstream start codon to encode an isoform (5) that has a shorter N-terminus, compared to isoform 1. Variants 5, 6, 7, and 8 all encode the same isoform (5).
      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    8. NM_001330475.2NP_001317404.1  protein archease isoform 6

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS81292.1
      UniProtKB/TrEMBL
      A0A8C8MQ05
      Related
      ENSP00000362600.3, ENST00000373501.6
      Conserved Domains (1) summary
      pfam01951
      Location:6131
      Archease; Archease protein family (MTH1598/TM1083)
    9. NM_001366255.1NP_001353184.1  protein archease isoform 6

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS81292.1
      UniProtKB/TrEMBL
      A0A8C8MQ05
      Conserved Domains (1) summary
      pfam01951
      Location:6131
      Archease; Archease protein family (MTH1598/TM1083)
    10. NM_001366256.1NP_001353185.1  protein archease isoform 5

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    11. NM_001366257.1NP_001353186.1  protein archease isoform 5

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529, HY228209
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    12. NM_001366258.1NP_001353187.1  protein archease isoform 5

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    13. NM_001366259.1NP_001353188.1  protein archease isoform 5

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    14. NM_001366260.1NP_001353189.1  protein archease isoform 5

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    15. NM_001366263.1NP_001353192.1  protein archease isoform 5

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529, CN344730
      Consensus CDS
      CCDS76134.1
      UniProtKB/TrEMBL
      A0A087WXH6, A0A9K3Y7L1, D3DPQ2
      Conserved Domains (1) summary
      pfam01951
      Location:1110
      Archease; Archease protein family (MTH1598/TM1083)
    16. NM_001366264.1NP_001353193.1  protein archease isoform 7

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529, HY108987
      UniProtKB/TrEMBL
      A8K0B5
      Conserved Domains (1) summary
      pfam01951
      Location:43191
      Archease; Archease protein family (MTH1598/TM1083)
    17. NM_001366265.1NP_001353194.1  protein archease isoform 8

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529, CB992352
      UniProtKB/TrEMBL
      A8K0B5
      Conserved Domains (1) summary
      pfam01951
      Location:43152
      Archease; Archease protein family (MTH1598/TM1083)
    18. NM_001366266.1NP_001353195.1  protein archease isoform 9

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529, HY067040
      UniProtKB/TrEMBL
      A8K0B5
      Conserved Domains (1) summary
      pfam01951
      Location:43149
      Archease; Archease protein family (MTH1598/TM1083)
    19. NM_001366267.1NP_001353196.1  protein archease isoform 10

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Conserved Domains (1) summary
      pfam01951
      Location:1127
      Archease; Archease protein family (MTH1598/TM1083)
    20. NM_001366268.1NP_001353197.1  protein archease isoform 11

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Conserved Domains (1) summary
      pfam01951
      Location:1122
      Archease; Archease protein family (MTH1598/TM1083)
    21. NM_001366269.1NP_001353198.1  protein archease isoform 12

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Conserved Domains (1) summary
      pfam01951
      Location:688
      Archease; Archease protein family (MTH1598/TM1083)
    22. NM_001366270.1NP_001353199.1  protein archease isoform 13

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      UniProtKB/TrEMBL
      A0A087X1H4
      Related
      ENSP00000484207.2, ENST00000492007.6
      Conserved Domains (1) summary
      cl00606
      Location:5493
      Archease; Archease protein family (MTH1598/TM1083)
    23. NM_001366271.1NP_001353200.1  protein archease isoform 14

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
      Conserved Domains (1) summary
      cl00606
      Location:6792
      Archease; Archease protein family (MTH1598/TM1083)
    24. NM_178547.5NP_848642.2  protein archease isoform 1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) encodes the longest isoform (1).
      Source sequence(s)
      AL033529, AY151084
      Consensus CDS
      CCDS365.2
      UniProtKB/Swiss-Prot
      Q5TGK5, Q6PDA1, Q8IWS9, Q8IWT0, Q8NEV6, Q8NEV7
      UniProtKB/TrEMBL
      A0A087X0V4
      Related
      ENSP00000417677.2, ENST00000468695.6
      Conserved Domains (1) summary
      pfam01951
      Location:31167
      Archease; Archease protein family (MTH1598/TM1083)

    RNA

    1. NR_158772.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    2. NR_158773.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    3. NR_158774.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    4. NR_158775.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    5. NR_158776.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    6. NR_158777.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    7. NR_158778.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    8. NR_158779.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    9. NR_158780.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529
    10. NR_158781.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AA927666, AC114489, AL033529
    11. NR_158782.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC114489, AL033529, CK819207

    RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCh38.p14 Primary Assembly

    Genomic

    1. NC_000001.11 Reference GRCh38.p14 Primary Assembly

      Range
      32620820..32650932 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_017001136.3XP_016856625.1  protein archease isoform X4

    2. XM_047419295.1XP_047275251.1  protein archease isoform X5

    3. XM_047419291.1XP_047275247.1  protein archease isoform X2

      UniProtKB/TrEMBL
      A0A8C8MQ05
    4. XM_047419294.1XP_047275250.1  protein archease isoform X3

    5. XM_011541327.3XP_011539629.1  protein archease isoform X1

      UniProtKB/TrEMBL
      H7C3R6
      Conserved Domains (1) summary
      pfam01951
      Location:43122
      Archease; Archease protein family (MTH1598/TM1083)

    RNA

    1. XR_007059335.1 RNA Sequence

    Alternate T2T-CHM13v2.0

    Genomic

    1. NC_060925.1 Alternate T2T-CHM13v2.0

      Range
      32479596..32510712 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_054336276.1XP_054192251.1  protein archease isoform X4

    2. XM_054336277.1XP_054192252.1  protein archease isoform X5

    3. XM_054336274.1XP_054192249.1  protein archease isoform X2

      UniProtKB/TrEMBL
      A0A8C8MQ05
    4. XM_054336275.1XP_054192250.1  protein archease isoform X3

    5. XM_054336273.1XP_054192248.1  protein archease isoform X1

    RNA

    1. XR_008486016.1 RNA Sequence

    Suppressed Reference Sequence(s)

    The following Reference Sequences have been suppressed. Explain

    1. NM_001366278.1: Suppressed sequence

      Description
      NM_001366278.1: This RefSeq was removed because it is redundant with an existing RefSeq.