MitoFish and MitoAnnotator: a mitochondrial genome database of fish with an accurate and automatic annotation pipeline

Mol Biol Evol. 2013 Nov;30(11):2531-40. doi: 10.1093/molbev/mst141. Epub 2013 Aug 16.

Abstract

Mitofish is a database of fish mitochondrial genomes (mitogenomes) that includes powerful and precise de novo annotations for mitogenome sequences. Fish occupy an important position in the evolution of vertebrates and the ecology of the hydrosphere, and mitogenomic sequence data have served as a rich source of information for resolving fish phylogenies and identifying new fish species. The importance of a mitogenomic database continues to grow at a rapid pace as massive amounts of mitogenomic data are generated with the advent of new sequencing technologies. A severe bottleneck seems likely to occur with regard to mitogenome annotation because of the overwhelming pace of data accumulation and the intrinsic difficulties in annotating sequences with degenerating transfer RNA structures, divergent start/stop codons of the coding elements, and the overlapping of adjacent elements. To ease this data backlog, we developed an annotation pipeline named MitoAnnotator. MitoAnnotator automatically annotates a fish mitogenome with a high degree of accuracy in approximately 5 min; thus, it is readily applicable to data sets of dozens of sequences. MitoFish also contains re-annotations of previously sequenced fish mitogenomes, enabling researchers to refer to them when they find annotations that are likely to be erroneous or while conducting comparative mitogenomic analyses. For users who need more information on the taxonomy, habitats, phenotypes, or life cycles of fish, MitoFish provides links to related databases. MitoFish and MitoAnnotator are freely available at http://mitofish.aori.u-tokyo.ac.jp/ (last accessed August 28, 2013); all of the data can be batch downloaded, and the annotation pipeline can be used via a web interface.

Keywords: database; fish; genome annotator; high-throughput sequencing; mitochondrion; phylogenetics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Databases, Genetic*
  • Evolution, Molecular
  • Fishes / genetics*
  • Genome, Mitochondrial*
  • Genomics
  • High-Throughput Nucleotide Sequencing
  • Molecular Sequence Annotation / methods*
  • Phylogeny
  • RNA, Transfer / genetics
  • Software

Substances

  • RNA, Transfer