Independent origin of MIRNA genes controlling homologous target genes by partial inverted duplication of antisense-transcribed sequences

Plant J. 2020 Jan;101(2):401-419. doi: 10.1111/tpj.14550. Epub 2019 Nov 26.

Abstract

Some microRNAs (miRNAs) are key regulators of developmental processes, mainly by controlling the accumulation of transcripts encoding transcription factors that are important for morphogenesis. MADS-box genes encode a family of transcription factors which control diverse developmental processes in flowering plants. Here we study the convergent evolution of two MIRNA (MIR) gene families, named MIR444 and MIR824, targeting members of the same clade of MIKCC -group MADS-box genes. We show that these two MIR genes most likely originated independently in monocots (MIR444) and in Brassicales (eudicots, MIR824). We provide evidence that, in both cases, the future target gene was transcribed in antisense prior to the evolution of the MIR genes. Both MIR genes then likely originated by a partial inverted duplication of their target genes, resulting in natural antisense organization of the newly evolved MIR gene and its target gene at birth. We thus propose a model for the origin of MIR genes, MEPIDAS (MicroRNA Evolution by Partial Inverted Duplication of Antisense-transcribed Sequences). MEPIDAS is a refinement of the inverted duplication hypothesis. According to MEPIDAS, a MIR gene evolves at a genomic locus at which the future target gene is also transcribed in the antisense direction. A partial inverted duplication at this locus causes the antisense transcript to fold into a stem-loop structure that is recognized by the miRNA biogenesis machinery to produce a miRNA that regulates the gene at this locus. Our analyses exemplify how to elucidate the origin of conserved miRNAs by comparative genomics and will guide future studies. OPEN RESEARCH BADGE: This article has earned an Open Data Badge for making publicly available the digitally-shareable data necessary to reproduce the reported results. The data is available at https://www.ncbi.nlm.nih.gov/genbank/.

Keywords: Dioscorea bulbifera; Tarenaya hassleriana; AGL17-like gene; MADS-box gene; gene birth; plant development; transcription factor.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics
  • Arabidopsis Proteins / genetics
  • Evolution, Molecular
  • Gene Duplication
  • Gene Expression Regulation, Plant
  • Genes, Plant / genetics*
  • Genomics
  • MADS Domain Proteins / genetics
  • Magnoliopsida / genetics
  • MicroRNAs / genetics*
  • Phylogeny
  • Plant Development
  • Transcription Factors / genetics*

Substances

  • AGL17 protein, Arabidopsis
  • Arabidopsis Proteins
  • MADS Domain Proteins
  • MicroRNAs
  • Transcription Factors

Associated data

  • GENBANK/KY094495
  • GENBANK/KY172122
  • GENBANK/KY319139
  • GENBANK/KY742743
  • GENBANK/KY742744
  • GENBANK/KY742745
  • GENBANK/KY742746
  • GENBANK/KY742747
  • GENBANK/KY742748
  • GENBANK/KY742749
  • GENBANK/KY742750
  • GENBANK/KY742751
  • GENBANK/KY742752
  • GENBANK/KY742753
  • GENBANK/KY742754
  • GENBANK/KY742755
  • GENBANK/KY774629
  • GENBANK/KY774630
  • GENBANK/KY774631
  • GENBANK/KY774632
  • GENBANK/KY774633
  • GENBANK/KY774634
  • GENBANK/KY774635
  • GENBANK/KY774636
  • GENBANK/KY774637
  • GENBANK/KY774638
  • GENBANK/KY774639
  • GENBANK/KY774640
  • GENBANK/KY774641
  • GENBANK/KY774642
  • GENBANK/KY774643
  • GENBANK/KY774644
  • GENBANK/KY774645
  • GENBANK/KY774646
  • GENBANK/KY774647
  • GENBANK/KY774648
  • GENBANK/KY774649
  • GENBANK/KY774650
  • GENBANK/KY774651
  • GENBANK/KY774652