The human ribosomal protein genes: sequencing and comparative analysis of 73 genes

Genome Res. 2002 Mar;12(3):379-90. doi: 10.1101/gr.214202.

Abstract

The ribosome, as a catalyst for protein synthesis, is universal and essential for all organisms. Here we describe the structure of the genes encoding human ribosomal proteins (RPs) and compare this class of genes among several eukaryotes. Using genomic and full-length cDNA sequences, we characterized 73 RP genes and found that (1) transcription starts at a C residue within a characteristic oligopyrimidine tract; (2) the promoter region is GC rich, but often has a TATA box or similar sequence element; (3) the genes are small (4.4 kb), but have as many as 5.6 exons on average; (4) the initiator ATG is in the first or second exon and is within plus minus 5 bp of the first intron boundaries in about half of cases; and (5) 5'- and 3'-UTRs are significantly smaller (42 bp and 56 bp, respectively) than the genome average. Comparison of RP genes from humans, Drosophila melanogaster, Caenorhabditis elegans, and Saccharomyces cerevisiae revealed the coding sequences to be highly conserved (63% homology on average), although gene size and the number of exons vary. The positions of the introns are also conserved among these species as follows: 44% of human introns are present at the same position in either D. melanogaster or C. elegans, suggesting RP genes are highly suitable for studying the evolution of introns.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence / genetics
  • Base Sequence / physiology
  • Caenorhabditis elegans / genetics
  • Drosophila melanogaster / genetics
  • Exons / genetics
  • Gene Expression Regulation / genetics
  • Gene Expression Regulation / physiology
  • Genes, Fungal / genetics
  • Genes, Helminth / genetics
  • Genes, Insect / genetics
  • Genetic Variation / genetics
  • Genetic Variation / physiology
  • Humans
  • Interspersed Repetitive Sequences / genetics
  • Interspersed Repetitive Sequences / physiology
  • Introns / genetics
  • Molecular Sequence Data
  • Promoter Regions, Genetic / genetics
  • Promoter Regions, Genetic / physiology
  • Ribosomal Proteins / chemistry
  • Ribosomal Proteins / genetics*
  • Ribosomal Proteins / physiology
  • Saccharomyces cerevisiae / genetics
  • Sequence Analysis, DNA / methods*

Substances

  • Ribosomal Proteins

Associated data

  • GENBANK/AB055762
  • GENBANK/AB055763
  • GENBANK/AB055764
  • GENBANK/AB055765
  • GENBANK/AB055766
  • GENBANK/AB055767
  • GENBANK/AB055768
  • GENBANK/AB055769
  • GENBANK/AB055770
  • GENBANK/AB055771
  • GENBANK/AB055772
  • GENBANK/AB055773
  • GENBANK/AB055774
  • GENBANK/AB055775
  • GENBANK/AB055776
  • GENBANK/AB055777
  • GENBANK/AB055778
  • GENBANK/AB055779
  • GENBANK/AB055780
  • GENBANK/AB056456
  • GENBANK/AB061820
  • GENBANK/AB061821
  • GENBANK/AB061822
  • GENBANK/AB061823
  • GENBANK/AB061824
  • GENBANK/AB061825
  • GENBANK/AB061826
  • GENBANK/AB061827
  • GENBANK/AB061828
  • GENBANK/AB061829
  • GENBANK/AB061830
  • GENBANK/AB061831
  • GENBANK/AB061832
  • GENBANK/AB061833
  • GENBANK/AB061834
  • GENBANK/AB061835
  • GENBANK/AB061836
  • GENBANK/AB061837
  • GENBANK/AB061838
  • GENBANK/AB061839
  • GENBANK/AB061840
  • GENBANK/AB061841
  • GENBANK/AB061842
  • GENBANK/AB061843
  • GENBANK/AB061844
  • GENBANK/AB061845
  • GENBANK/AB061846
  • GENBANK/AB061847
  • GENBANK/AB061848
  • GENBANK/AB061849
  • GENBANK/AB061850
  • GENBANK/AB061851
  • GENBANK/AB061852
  • GENBANK/AB061853
  • GENBANK/AB061854
  • GENBANK/AB061855
  • GENBANK/AB061856
  • GENBANK/AB061857
  • GENBANK/AB061858
  • GENBANK/AB061859
  • GENBANK/AB062066
  • GENBANK/AB062067
  • GENBANK/AB062068
  • GENBANK/AB062069
  • GENBANK/AB062070
  • GENBANK/AB062071
  • GENBANK/AB070559