Origin and diversity of the SOX transcription factor gene family: genome-wide analysis in Fugu rubripes

Gene. 2004 Mar 17:328:177-86. doi: 10.1016/j.gene.2003.12.008.

Abstract

The SOX family of transcription factors are found throughout the animal kingdom and are important in a variety of developmental contexts. Genome analysis has identified 20 Sox genes in human and mouse, which can be subdivided into 8 groups, based on sequence comparison and intron-exon structure. Most of the SOX groups identified in mammals are represented by a single SOX sequence in invertebrate model organisms, suggesting a duplication and divergence mechanism has operated during vertebrate evolution. We have now analysed the Sox gene complement in the pufferfish, Fugu rubripes, in order to shed further light on the diversity and origins of the Sox gene family. Major differences were found between the Sox family in Fugu and those in humans and mice. In particular, Fugu does not have orthologues of Sry, Sox15 and Sox30, which appear to be specific to mammals, while Sox19, found in Fugu and zebrafish but absent in mammals, seems to be specific to fishes. Six mammalian Sox genes are represented by two copies each in Fugu, indicating a large-scale gene duplication in the fish lineage. These findings point to recent Sox gene loss, duplication and divergence occurring during the evolution of tetrapod and teleost lineages, and provide further evidence for large-scale segmental or a whole-genome duplication occurring early in the radiation of teleosts.

Publication types

  • Comparative Study

MeSH terms

  • Animals
  • Base Sequence
  • DNA / chemistry
  • DNA / genetics
  • Evolution, Molecular
  • Genetic Variation / genetics*
  • Genome
  • HMG-Box Domains / genetics
  • High Mobility Group Proteins / genetics*
  • Molecular Sequence Data
  • Multigene Family / genetics*
  • Phylogeny*
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Homology, Amino Acid
  • Takifugu / genetics*
  • Terminology as Topic
  • Transcription Factors / genetics*

Substances

  • High Mobility Group Proteins
  • Transcription Factors
  • DNA

Associated data

  • GENBANK/AY277950
  • GENBANK/AY277951
  • GENBANK/AY277952
  • GENBANK/AY277953
  • GENBANK/AY277954
  • GENBANK/AY277955
  • GENBANK/AY277956
  • GENBANK/AY277957
  • GENBANK/AY277958
  • GENBANK/AY277959
  • GENBANK/AY277960
  • GENBANK/AY277961
  • GENBANK/AY277962
  • GENBANK/AY277963
  • GENBANK/AY277964
  • GENBANK/AY277965
  • GENBANK/AY277966
  • GENBANK/AY277967
  • GENBANK/AY277968
  • GENBANK/AY277969
  • GENBANK/AY277970
  • GENBANK/AY277971
  • GENBANK/AY277972
  • GENBANK/AY277973