Unusual features of fibrillarin cDNA and gene structure in Euglena gracilis: evolutionary conservation of core proteins and structural predictions for methylation-guide box C/D snoRNPs throughout the domain Eucarya

Nucleic Acids Res. 2005 May 13;33(9):2781-91. doi: 10.1093/nar/gki574. Print 2005.

Abstract

Box C/D ribonucleoprotein (RNP) particles mediate O2'-methylation of rRNA and other cellular RNA species. In higher eukaryotic taxa, these RNPs are more complex than their archaeal counterparts, containing four core protein components (Snu13p, Nop56p, Nop58p and fibrillarin) compared with three in Archaea. This increase in complexity raises questions about the evolutionary emergence of the eukaryote-specific proteins and structural conservation in these RNPs throughout the eukaryotic domain. In protists, the primarily unicellular organisms comprising the bulk of eukaryotic diversity, the protein composition of box C/D RNPs has not yet been extensively explored. This study describes the complete gene, cDNA and protein sequences of the fibrillarin homolog from the protozoon Euglena gracilis, the first such information to be obtained for a nucleolus-localized protein in this organism. The E.gracilis fibrillarin gene contains a mixture of intron types exhibiting markedly different sizes. In contrast to most other E.gracilis mRNAs characterized to date, the fibrillarin mRNA lacks a spliced leader (SL) sequence. The predicted fibrillarin protein sequence itself is unusual in that it contains a glycine-lysine (GK)-rich domain at its N-terminus rather than the glycine-arginine-rich (GAR) domain found in most other eukaryotic fibrillarins. In an evolutionarily diverse collection of protists that includes E.gracilis, we have also identified putative homologs of the other core protein components of box C/D RNPs, thereby providing evidence that the protein composition seen in the higher eukaryotic complexes was established very early in eukaryotic cell evolution.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Chromosomal Proteins, Non-Histone / chemistry
  • Chromosomal Proteins, Non-Histone / genetics*
  • DNA, Complementary / chemistry
  • Euglena gracilis / genetics*
  • Eukaryotic Cells / chemistry
  • Evolution, Molecular*
  • Gene Components
  • Introns
  • Molecular Sequence Data
  • RNA, Small Nuclear / chemistry
  • Ribonucleoproteins / chemistry
  • Ribonucleoproteins / genetics*
  • Ribonucleoproteins, Small Nucleolar / chemistry*
  • Sequence Alignment

Substances

  • Chromosomal Proteins, Non-Histone
  • DNA, Complementary
  • RNA, Small Nuclear
  • Ribonucleoproteins
  • Ribonucleoproteins, Small Nucleolar
  • U4 small nuclear RNA
  • fibrillarin

Associated data

  • GENBANK/AF110181
  • GENBANK/AY925002
  • GENBANK/AY950656
  • GENBANK/AY950657
  • GENBANK/AY950658
  • GENBANK/AY950659
  • GENBANK/AY950660
  • GENBANK/AY950661
  • GENBANK/AY950662