Molecular cloning and sequence analysis of factor C cDNA from the Singapore horseshoe crab, Carcinoscorpius rotundicauda

Mol Mar Biol Biotechnol. 1995 Mar;4(1):90-103.

Abstract

Two forms of Factor C cDNAs: CrFC21 (3448 bp) and CrFC26 (4182 bp) have been cloned into lambda gt22. CrFC26 includes 568 nucleotides of 5' untranslated region (5' UTR) containing seven ATGs before the real initiation site, an open reading frame (ORF) of 3249 nucleotides, a stop codon, and 365 nucleotides of 3' untranslated sequence. There are four polyadenylation signals and six potential glycosylation sites. The ORF codes for a signal peptide of 24 amino acids and a Factor C zymogen of 1059 residues. The CrFC21 lacks most of the 5' UTR, and has some base changes in its ORF. The predicted secondary mRNA structures of the 5' end of CrFC26 showed numerous stem-and-loop structures, thus obscuring its real start codon. In contrast, CrFC21 has a well-exposed AUG start site, and expresses Factor C in transcription-translation reactions in vitro. There is a typical serine protease catalytic triad of Asp-His-Ser, which is structurally like prothrombin, but catalytically more similar to trypsin. Although an overall homology of 97.7% was observed in comparison with the Tachypleus tridentatus Factor C (TtFC) cDNA, there were notable differences in the restriction sites and subtle base substitutions in the CrFC cDNA. The high degree of homology between Factor C from T. tridentatus and C. rotundicauda substantiates, at the molecular level, the proximity of these two species in the course of evolution. This finding contravenes the apparent disparities with respect to their morphology, ecological habitat, and taxonomical classification.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Arthropod Proteins
  • Base Sequence
  • Biological Evolution
  • Blotting, Northern
  • Cloning, Molecular
  • DNA, Complementary
  • Enzyme Precursors / genetics*
  • Horseshoe Crabs / genetics*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • RNA / chemistry
  • RNA / genetics
  • Recombinant Proteins / genetics
  • Restriction Mapping
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Serine Endopeptidases / genetics*

Substances

  • Arthropod Proteins
  • DNA, Complementary
  • Enzyme Precursors
  • Recombinant Proteins
  • RNA
  • Serine Endopeptidases
  • Limulus clotting factor C

Associated data

  • GENBANK/S77063
  • GENBANK/S77064