U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

CTSH cathepsin H [ Homo sapiens (human) ]

Gene ID: 1512, updated on 3-Apr-2024

Summary

Official Symbol
CTSHprovided by HGNC
Official Full Name
cathepsin Hprovided by HGNC
Primary source
HGNC:HGNC:2535
See related
Ensembl:ENSG00000103811 MIM:116820; AllianceGenome:HGNC:2535
Gene type
protein coding
RefSeq status
REVIEWED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Also known as
ACC4; ACC5; CPSB; ACC-4; ACC-5
Summary
The protein encoded by this gene is a lysosomal cysteine proteinase important in the overall degradation of lysosomal proteins. It is composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. The encoded protein, which belongs to the peptidase C1 protein family, can act both as an aminopeptidase and as an endopeptidase. Increased expression of this gene has been correlated with malignant progression of prostate tumors. Alternate splicing of this gene results in multiple transcript variants encoding different isoforms. [provided by RefSeq, Jan 2016]
Expression
Broad expression in lung (RPKM 144.1), kidney (RPKM 97.7) and 24 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

Location:
15q25.1
Exon count:
13
Annotation release Status Assembly Chr Location
RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 15 NC_000015.10 (78921058..78945046, complement)
RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 15 NC_060939.1 (76784722..76808716, complement)
105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 15 NC_000015.9 (79213400..79237388, complement)

Chromosome 15 - NC_000015.10Genomic Context describing neighboring genes Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 6717 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 6718 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr15:79165819-79166450 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 6719 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr15:79168309-79168910 Neighboring gene mortality factor 4 like 1 Neighboring gene RNA, U6 small nuclear 415, pseudogene Neighboring gene ATAC-STARR-seq lymphoblastoid active region 9923 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 9924 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 9925 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 9926 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 9927 Neighboring gene Sharpr-MPRA regulatory region 2017 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr15:79237120-79237658 Neighboring gene Ras protein specific guanine nucleotide releasing factor 1 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 9928 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 9929 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 9930 Neighboring gene uncharacterized LOC100129540 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr15:79288791-79289564 Neighboring gene uncharacterized LOC105370917 Neighboring gene MED14-independent group 3 enhancer GRCh37_chr15:79296069-79297268 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr15:79298197-79299032 Neighboring gene Sharpr-MPRA regulatory region 5581

Genomic regions, transcripts, and products

Expression

  • Project title: HPA RNA-seq normal tissues
  • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
  • BioProject: PRJEB4337
  • Publication: PMID 24309898
  • Analysis date: Wed Apr 4 07:08:55 2018

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

Phenotypes

EBI GWAS Catalog

Description
Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes.
EBI GWAS Catalog
Genome-wide association study on bipolar disorder in the Bulgarian population.
EBI GWAS Catalog
Meta-analysis of genome-wide association study data identifies additional type 1 diabetes risk loci.
EBI GWAS Catalog

HIV-1 interactions

Protein interactions

Protein Gene Interaction Pubs
Vpr vpr A stable-isotope labeling by amino acids in cell culture coupled with mass spectrometry-based proteomics identifies upregulation of cathepsin H (CTSH) expression by HIV-1 Vpr in Vpr transduced macrophages PubMed

Go to the HIV-1, Human Interaction Database

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Markers

Clone Names

  • MGC1519, DKFZp686B24257

Gene Ontology Provided by GOA

Process Evidence Code Pubs
involved_in ERK1 and ERK2 cascade IDA
Inferred from Direct Assay
more info
PubMed 
involved_in T cell mediated cytotoxicity ISS
Inferred from Sequence or Structural Similarity
more info
 
involved_in adaptive immune response IEP
Inferred from Expression Pattern
more info
PubMed 
involved_in antigen processing and presentation TAS
Traceable Author Statement
more info
PubMed 
involved_in apoptotic process IEA
Inferred from Electronic Annotation
more info
 
involved_in bradykinin catabolic process IDA
Inferred from Direct Assay
more info
PubMed 
involved_in cellular response to thyroid hormone stimulus IEP
Inferred from Expression Pattern
more info
PubMed 
involved_in dichotomous subdivision of terminal units involved in lung branching ISS
Inferred from Sequence or Structural Similarity
more info
 
involved_in immune response IBA
Inferred from Biological aspect of Ancestor
more info
 
involved_in immune response-regulating signaling pathway IDA
Inferred from Direct Assay
more info
PubMed 
involved_in membrane protein proteolysis IDA
Inferred from Direct Assay
more info
PubMed 
involved_in metanephros development ISS
Inferred from Sequence or Structural Similarity
more info
 
involved_in negative regulation of apoptotic process ISS
Inferred from Sequence or Structural Similarity
more info
 
involved_in neuropeptide catabolic process IDA
Inferred from Direct Assay
more info
PubMed 
involved_in positive regulation of angiogenesis ISS
Inferred from Sequence or Structural Similarity
more info
 
involved_in positive regulation of apoptotic signaling pathway IBA
Inferred from Biological aspect of Ancestor
more info
 
involved_in positive regulation of cell migration IDA
Inferred from Direct Assay
more info
PubMed 
involved_in positive regulation of cell population proliferation ISS
Inferred from Sequence or Structural Similarity
more info
 
involved_in positive regulation of epithelial cell migration ISS
Inferred from Sequence or Structural Similarity
more info
 
involved_in positive regulation of gene expression IDA
Inferred from Direct Assay
more info
PubMed 
involved_in positive regulation of peptidase activity IBA
Inferred from Biological aspect of Ancestor
more info
 
involved_in positive regulation of peptidase activity IDA
Inferred from Direct Assay
more info
PubMed 
involved_in protein destabilization IMP
Inferred from Mutant Phenotype
more info
PubMed 
involved_in proteolysis IDA
Inferred from Direct Assay
more info
PubMed 
involved_in proteolysis TAS
Traceable Author Statement
more info
PubMed 
involved_in proteolysis involved in protein catabolic process IBA
Inferred from Biological aspect of Ancestor
more info
 
involved_in response to retinoic acid ISS
Inferred from Sequence or Structural Similarity
more info
 
involved_in surfactant homeostasis IDA
Inferred from Direct Assay
more info
PubMed 
involved_in zymogen activation IDA
Inferred from Direct Assay
more info
PubMed 

General protein information

Preferred Names
pro-cathepsin H
Names
N-benzoylarginine-beta-naphthylamide hydrolase
aleurain
cathepsin B3
cathepsin BA
NP_001306066.1
NP_001398024.1
NP_004381.2
XP_016877440.1
XP_054233346.1

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

Genomic

  1. NG_009614.2 RefSeqGene

    Range
    5000..28988
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. NM_001319137.2NP_001306066.1  pro-cathepsin H isoform c

    Status: REVIEWED

    Description
    Transcript Variant: This variant (3) differs in the 5' UTR, lacks a portion of the 5' coding region and initiates translation at an alternate start codon, compared to variant 1. It encodes isoform c which is shorter and has a distinct N-terminus, compared to isoform a.
    Source sequence(s)
    AC011944
    Conserved Domains (1) summary
    pfam00112
    Location:32198
    Peptidase_C1; Papain family cysteine protease
  2. NM_001411095.1NP_001398024.1  pro-cathepsin H isoform d

    Status: REVIEWED

    Source sequence(s)
    AC011944
    Consensus CDS
    CCDS92047.1
    UniProtKB/TrEMBL
    E9PKT6
    Related
    ENSP00000504051.1, ENST00000677316.1
  3. NM_004390.5NP_004381.2  pro-cathepsin H isoform a preproprotein

    See identical proteins and their annotated locations for NP_004381.2

    Status: REVIEWED

    Description
    Transcript Variant: This variant (1) represents the shorter transcript and encodes the longer isoform (a).
    Source sequence(s)
    AC011944
    Consensus CDS
    CCDS10308.1
    UniProtKB/Swiss-Prot
    B2RBK0, P09668, Q96NY6, Q9BUM7
    UniProtKB/TrEMBL
    Q6IBC3
    Related
    ENSP00000220166.6, ENST00000220166.10
    Conserved Domains (2) summary
    pfam00112
    Location:117332
    Peptidase_C1; Papain family cysteine protease
    pfam08246
    Location:3590
    Inhibitor_I29; Cathepsin propeptide inhibitor domain (I29)

RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p14 Primary Assembly

Genomic

  1. NC_000015.10 Reference GRCh38.p14 Primary Assembly

    Range
    78921058..78945046 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_017021951.2XP_016877440.1  pro-cathepsin H isoform X1

    UniProtKB/TrEMBL
    Q6IBC3

Alternate T2T-CHM13v2.0

Genomic

  1. NC_060939.1 Alternate T2T-CHM13v2.0

    Range
    76784722..76808716 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_054377371.1XP_054233346.1  pro-cathepsin H isoform X1

Suppressed Reference Sequence(s)

The following Reference Sequences have been suppressed. Explain

  1. NM_148979.2: Suppressed sequence

    Description
    NM_148979.2: This RefSeq was permanently suppressed because it represents a poorly supported variant with non-consensus splice sites.