A novel strategy for the identification of toxinlike structures in spider venom

Proteins. 2005 Apr 1;59(1):131-40. doi: 10.1002/prot.20390.

Abstract

We compared two different approaches to sequence information analysis from the expressed sequence tag (EST) library constructed for the venom glands of the spider Agelena orientalis. Some results were more illustrative and reliable by the contig analysis technique, whereas our novel method, with specific structural markers introduced for protein structure detection, allowed us to overcome some limitations of the contig analysis. A novel technique was suggested for the identification in data banks of the spider's ion channel inhibitor toxins using primary structure features common to all spiders. Analysis of about 150 polypeptides made it possible to introduce 3 primary structure motifs for spider toxins: the Principal Structural Motif (PSM), which postulates the existence of 6 amino acid residues between the first and second cysteine residue and the Cys-Cys sequence at a distance of 5-10 amino acid residues from the second cysteine; the Extra Structural Motif (ESM), which postulates the existence of a pair of CXC fragments in the C-region; and the Processing Quadruplet Motif (PQM), which specifies the Arg residue at position -1 and Glu residues at positions -2, -3, and/or -4 in the precursor sequences just before the postprocessing site. In the processed data bank we found 48 toxinlike structures with ion channel inhibitor motifs. These include agelenin earlier isolated from Agelena opulenta and 25 more homologous sequences, 15 homologs of mu-agatoxin 2 from the spider Agelenopsis aperta, 3 structures with low homology to omega-agatoxin-IIIA, and 4 new structures. Also we showed that toxinlike structures exceed two thirds of the overall database sequences.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Consensus Sequence
  • DNA Replication
  • DNA, Complementary
  • Expressed Sequence Tags
  • Molecular Sequence Data
  • Peptides / chemistry
  • Recombinant Proteins / chemistry
  • Spider Venoms / chemistry*
  • Spider Venoms / genetics
  • Spider Venoms / isolation & purification
  • Spiders

Substances

  • DNA, Complementary
  • Peptides
  • Recombinant Proteins
  • Spider Venoms

Associated data

  • GENBANK/AY681297
  • GENBANK/AY681298
  • GENBANK/AY681299
  • GENBANK/AY681300
  • GENBANK/AY681301
  • GENBANK/AY681302
  • GENBANK/AY681303
  • GENBANK/AY681304
  • GENBANK/AY681305
  • GENBANK/AY681306
  • GENBANK/AY681307
  • GENBANK/AY681308
  • GENBANK/AY681309
  • GENBANK/AY681310
  • GENBANK/AY681311
  • GENBANK/AY681312
  • GENBANK/AY681313
  • GENBANK/AY681314
  • GENBANK/AY681315
  • GENBANK/AY681316
  • GENBANK/AY681317
  • GENBANK/AY681318
  • GENBANK/AY681319
  • GENBANK/AY681320
  • GENBANK/AY681321
  • GENBANK/AY681322
  • GENBANK/AY681323
  • GENBANK/AY681324
  • GENBANK/AY681325
  • GENBANK/AY681326
  • GENBANK/AY681327
  • GENBANK/AY681328
  • GENBANK/AY681329
  • GENBANK/AY681330
  • GENBANK/AY681331
  • GENBANK/AY681332
  • GENBANK/AY681333
  • GENBANK/AY681334
  • GENBANK/AY681335
  • GENBANK/AY681336
  • GENBANK/AY681337
  • GENBANK/AY681338
  • GENBANK/AY681339
  • GENBANK/AY681340
  • GENBANK/AY681341
  • GENBANK/AY681342
  • GENBANK/AY681343
  • GENBANK/AY681344