Repetitive sequences in plant nuclear DNA: types, distribution, evolution and function

Genomics Proteomics Bioinformatics. 2014 Aug;12(4):164-71. doi: 10.1016/j.gpb.2014.07.003. Epub 2014 Aug 15.

Abstract

Repetitive DNA sequences are a major component of eukaryotic genomes and may account for up to 90% of the genome size. They can be divided into minisatellite, microsatellite and satellite sequences. Satellite DNA sequences are considered to be a fast-evolving component of eukaryotic genomes, comprising tandemly-arrayed, highly-repetitive and highly-conserved monomer sequences. The monomer unit of satellite DNA is 150-400 base pairs (bp) in length. Repetitive sequences may be species- or genus-specific, and may be centromeric or subtelomeric in nature. They exhibit cohesive and concerted evolution caused by molecular drive, leading to high sequence homogeneity. Repetitive sequences accumulate variations in sequence and copy number during evolution, hence they are important tools for taxonomic and phylogenetic studies, and are known as "tuning knobs" in the evolution. Therefore, knowledge of repetitive sequences assists our understanding of the organization, evolution and behavior of eukaryotic genomes. Repetitive sequences have cytoplasmic, cellular and developmental effects and play a role in chromosomal recombination. In the post-genomics era, with the introduction of next-generation sequencing technology, it is possible to evaluate complex genomes for analyzing repetitive sequences and deciphering the yet unknown functional potential of repetitive sequences.

Keywords: Concerted evolution; Dispersed; Next-generation sequencing; Repetitive sequences; Satellites; Tandem.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Biological Evolution*
  • Cell Nucleus / genetics*
  • Chromosomes, Plant / genetics*
  • DNA, Plant / genetics*
  • Plants / genetics*
  • Repetitive Sequences, Nucleic Acid / genetics*

Substances

  • DNA, Plant