Nucleotide variation in the Egfr locus of Drosophila melanogaster

Genetics. 2004 Jul;167(3):1199-212. doi: 10.1534/genetics.104.026252.

Abstract

The Epidermal growth factor receptor is an essential gene with diverse pleiotropic roles in development throughout the animal kingdom. Analysis of sequence diversity in 10.9 kb covering the complete coding region and 6.4 kb of potential regulatory regions in a sample of 250 alleles from three populations of Drosophila melanogaster suggests that the intensity of different population genetic forces varies along the locus. A total of 238 independent common SNPs and 20 indel polymorphisms were detected, with just six common replacements affecting >1475 amino acids, four of which are in the short alternate first exon. Sequence diversity is lowest in a 2-kb portion of intron 2, which is also highly conserved in comparison with D. simulans and D. pseudoobscura. Linkage disequilibrium decays to background levels within 500 bp of most sites, so haplotypes are generally restricted to up to 5 polymorphisms. The two North American samples from North Carolina and California have diverged in allele frequency at a handful of individual SNPs, but a Kenyan sample is both more divergent and more polymorphic. The effect of sample size on inference of the roles of population structure, uneven recombination, and weak selection in patterning nucleotide variation in the locus is discussed.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Analysis of Variance
  • Animals
  • Base Sequence
  • California
  • Drosophila Proteins / genetics*
  • Drosophila melanogaster / genetics*
  • ErbB Receptors / genetics*
  • Gene Frequency
  • Genetic Variation*
  • Genetics, Population*
  • Haplotypes / genetics
  • Kenya
  • Linkage Disequilibrium
  • Molecular Sequence Data
  • North Carolina
  • Polymorphism, Genetic
  • Protein Kinases / genetics*
  • Receptors, Invertebrate Peptide / genetics*
  • Sequence Analysis, DNA

Substances

  • Drosophila Proteins
  • Receptors, Invertebrate Peptide
  • Protein Kinases
  • Egfr protein, Drosophila
  • ErbB Receptors