Multiple variable first exons: a mechanism for cell- and tissue-specific gene regulation

Genome Res. 2004 Jan;14(1):79-89. doi: 10.1101/gr.1225204. Epub 2003 Dec 12.

Abstract

A large family of neural protocadherin (Pcdh) proteins is encoded by three closely linked mammalian gene clusters (alpha, beta, and gamma). Pcdh alpha and gamma clusters have a striking genomic organization. Specifically, each "variable" exon is spliced to a common set of downstream "constant" exons within each cluster. Recent studies demonstrated that the cell-specific expression of each Pcdh gene is determined bya combination of variable-exon promoter activation and cis-splicing of the corresponding variable exon to the first constant exon. To determine whether there are other similarly organized gene clusters in mammalian genomes, we performed a genome-wide search and identified a large number of mammalian genes containing multiple variable first exons. Here we describe several clusters that contain about a dozen variable exons arrayed in tandem, including UDP glucuronosyltransferase (UGT1), plectin, neuronal nitric oxide synthase (NOS1), and glucocorticoid receptor (GR) genes. In all these cases, multiple variable first exons are each spliced to a common set of downstream constant exons to generate diverse functional mRNAs. As an example, we analyzed the tissue-specific expression profile of the mouse UGT1 repertoire and found that multiple isoforms are expressed in a tissue-specific manner. Therefore, this variable and constant genomic organization provides a genetic mechanism for directing distinct cell- and tissue-specific patterns of gene expression.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cloning, Molecular
  • Evolution, Molecular
  • Exons / genetics*
  • Gene Expression Profiling / methods
  • Gene Expression Regulation / genetics*
  • Gene Expression Regulation, Enzymologic / genetics
  • Humans
  • Intermediate Filament Proteins / genetics
  • Mice
  • Molecular Sequence Data
  • Monosaccharide Transport Proteins / genetics
  • Multigene Family / genetics
  • N-Acetylglucosaminyltransferases / genetics
  • Nitric Oxide Synthase / genetics
  • Nitric Oxide Synthase Type I
  • Organ Specificity / genetics
  • Phylogeny
  • Plectin
  • Rats
  • Receptors, Glucocorticoid / genetics

Substances

  • Intermediate Filament Proteins
  • Monosaccharide Transport Proteins
  • PLEC protein, human
  • Plec protein, mouse
  • Plec protein, rat
  • Plectin
  • Receptors, Glucocorticoid
  • UDP-galactose translocator
  • NOS1 protein, human
  • Nitric Oxide Synthase
  • Nitric Oxide Synthase Type I
  • Nos1 protein, mouse
  • Nos1 protein, rat
  • N-acetylglucosaminyltransferase IGnT
  • N-Acetylglucosaminyltransferases

Associated data

  • GENBANK/AY227194
  • GENBANK/AY227195
  • GENBANK/AY227196
  • GENBANK/AY227197
  • GENBANK/AY227198
  • GENBANK/AY227199
  • GENBANK/AY227200
  • GENBANK/AY227201
  • GENBANK/AY435128
  • GENBANK/AY435129
  • GENBANK/AY435130
  • GENBANK/AY435131
  • GENBANK/AY435132
  • GENBANK/AY435133
  • GENBANK/AY435134
  • GENBANK/AY435135
  • GENBANK/AY435136
  • GENBANK/AY435137
  • GENBANK/AY435138
  • GENBANK/AY435139
  • GENBANK/AY435140
  • GENBANK/AY435141
  • GENBANK/AY435142
  • GENBANK/AY435143
  • GENBANK/AY435144
  • GENBANK/AY435145
  • GENBANK/AY435146
  • GENBANK/AY435147
  • GENBANK/AY435148
  • GENBANK/AY435149
  • GENBANK/AY435150
  • GENBANK/AY435151
  • GENBANK/AY435152
  • GENBANK/AY435153
  • GENBANK/AY480022
  • GENBANK/AY480023
  • GENBANK/AY480024
  • GENBANK/AY480025
  • GENBANK/AY480026
  • GENBANK/AY480027
  • GENBANK/AY480028
  • GENBANK/AY480029
  • GENBANK/AY480030
  • GENBANK/AY480031
  • GENBANK/AY480032
  • GENBANK/AY480033
  • GENBANK/AY480034
  • GENBANK/AY480035
  • GENBANK/AY480036
  • GENBANK/AY480037
  • GENBANK/AY480038
  • GENBANK/AY480039
  • GENBANK/AY480040
  • GENBANK/AY480041
  • GENBANK/AY480042
  • GENBANK/AY480043
  • GENBANK/AY480044
  • GENBANK/AY480045
  • GENBANK/AY480046
  • GENBANK/AY480047
  • GENBANK/AY480048
  • GENBANK/AY480049
  • GENBANK/AY480050
  • GENBANK/AY480051