Deep sequencing of small RNAs identifies canonical and non-canonical miRNA and endogenous siRNAs in mammalian somatic tissues

Nucleic Acids Res. 2013 Mar 1;41(5):3339-51. doi: 10.1093/nar/gks1474. Epub 2013 Jan 15.

Abstract

MicroRNAs (miRNAs) are small RNA molecules that regulate gene expression. They are characterized by specific maturation processes defined by canonical and non-canonical biogenic pathways. Analysis of ∼0.5 billion sequences from mouse data sets derived from different tissues, developmental stages and cell types, partly characterized by either ablation or mutation of the main proteins belonging to miRNA processor complexes, reveals 66 high-confidence new genomic loci coding for miRNAs that could be processed in a canonical or non-canonical manner. A proportion of the newly discovered miRNAs comprises mirtrons, for which we define a new sub-class. Notably, some of these newly discovered miRNAs are generated from untranslated and open reading frames of coding genes, and we experimentally validate these. We also show that many annotated miRNAs do not present miRNA-like features, as they are neither processed by known processing complexes nor loaded on AGO2; this indicates that the current miRNA miRBase database list should be refined and re-defined. Accordingly, a group of them map on ribosomal RNA molecules, whereas others cannot undergo genuine miRNA biogenesis. Notably, a group of annotated miRNAs are Dgcr8 independent and DICER dependent endogenous small interfering RNAs that derive from a unique hairpin formed from a short interspersed nuclear element.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Pairing
  • Base Sequence
  • Conserved Sequence
  • DEAD-box RNA Helicases / genetics
  • Embryonic Stem Cells / metabolism
  • Gene Expression
  • HEK293 Cells
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Mice
  • Mice, Knockout
  • MicroRNAs / classification
  • MicroRNAs / genetics*
  • MicroRNAs / metabolism
  • Molecular Sequence Annotation
  • Molecular Sequence Data
  • NIH 3T3 Cells
  • Nucleic Acid Conformation
  • Organ Specificity
  • Proteins / genetics
  • RNA, Small Interfering / genetics*
  • RNA, Small Interfering / metabolism
  • RNA-Binding Proteins
  • Ribonuclease III / genetics
  • Sequence Analysis, RNA
  • Transcriptome

Substances

  • Dgcr8 protein, mouse
  • MicroRNAs
  • Proteins
  • RNA, Small Interfering
  • RNA-Binding Proteins
  • Dicer1 protein, mouse
  • Ribonuclease III
  • DEAD-box RNA Helicases