Determination of the entire sequence of turtle CR1: the first open reading frame of the turtle CR1 element encodes a protein with a novel zinc finger motif

Mol Biol Evol. 1997 Dec;14(12):1206-17. doi: 10.1093/oxfordjournals.molbev.a025730.

Abstract

CR1 elements are a family of retroposons. They are classified as long interspersed elements (LINEs) or non-long-terminal-repeat (non-LTR) retrotransposons, and they have been found in the genomes of many vertebrates. However, they have been only partially characterized, and only a 2-kb region of the 3' end of chicken CR1 has been sequenced. In the present study, we determined the entire consensus sequence of CR1 elements in the turtle genome, designated PsCR1. The first open reading frame (ORF1) of PsCR1 has two unusual arrangements of Cys residues. One of them includes a zinc finger motif, CX2CX14CX2C. The putative zinc finger has cysteine residues with identical spacing and a similar amino acid composition to those found in the species-specific transcription initiation factors SL1 and TIF-IB. The 5' untranslated region (5' UTR) of PsCR1 contains a sequence similar to part of the human L1 promoter, L1 site A, and several cis elements of the type found in eukaryotic genes. Within a region of about 500 bp, there are nine "E boxes," cis elements that are recognized by the basic helix-loop-helix (bHLH) family of proteins. This observation raises the possibility that cellular transcription factors that bind to these sequences might act in concert to regulate the expression of PsCR1. The extent of the sequence divergence of the 3' UTR of CR1 between species was found to be lower than the rate of nonsynonymous substitutions per site in ORF2, suggesting that a strict functional constraint must exist for this region. This result strongly suggests that the conserved 3'-end sequence of CR1 is the recognition site for the reverse transcriptase of CR1. A discussion is presented of a possible mechanism for the integration of CR1 elements and also of the intriguing possible recruitment of the reverse transcriptase for the retroposition of SINEs.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Consensus Sequence
  • Conserved Sequence
  • DNA / genetics
  • Evolution, Molecular
  • Humans
  • Molecular Sequence Data
  • Open Reading Frames
  • Phylogeny
  • Retroelements / genetics*
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Turtles / genetics*
  • Zinc Fingers / genetics

Substances

  • Retroelements
  • DNA