What is an archaeon and are the Archaea really unique?

PeerJ. 2018 Oct 18:6:e5770. doi: 10.7717/peerj.5770. eCollection 2018.

Abstract

The recognition of the group Archaea as a major branch of the tree of life (ToL) prompted a new view of the evolution of biodiversity. The genomic representation of archaeal biodiversity has since significantly increased. In addition, advances in phylogenetic modeling of multi-locus datasets have resolved many recalcitrant branches of the ToL. Despite the technical advances and an expanded taxonomic representation, two important aspects of the origins and evolution of the Archaea remain controversial, even as we celebrate the 40th anniversary of the monumental discovery. These issues concern (i) the uniqueness (monophyly) of the Archaea, and (ii) the evolutionary relationships of the Archaea to the Bacteria and the Eukarya; both of these are relevant to the deep structure of the ToL. To explore the causes for this persistent ambiguity, I examine multiple datasets and different phylogenetic approaches that support contradicting conclusions. I find that the uncertainty is primarily due to a scarcity of information in standard datasets-universal core-genes datasets-to reliably resolve the conflicts. These conflicts can be resolved efficiently by comparing patterns of variation in the distribution of functional genomic signatures, which are less diffused unlike patterns of primary sequence variation. Relatively lower heterogeneity in distribution patterns minimizes uncertainties and supports statistically robust phylogenetic inferences, especially of the earliest divergences of life. This case study further highlights the limitations of primary sequence data in resolving difficult phylogenetic problems, and raises questions about evolutionary inferences drawn from the analyses of sequence alignments of a small set of core genes. In particular, the findings of this study corroborate the growing consensus that reversible substitution mutations may not be optimal phylogenetic markers for resolving early divergences in the ToL, nor for determining the polarity of evolutionary transitions across the ToL.

Keywords: Archaea; Asgard; Chimeric genome; Clade; Directional evolution; Genome fusion; Non-stationary; Phylogenomics; Rooting; Tree of life.

Grants and funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors. Work by this author was partially supported by the Swedish Research Council (to Måns Ehrenberg) and the Knut and Alice Wallenberg Foundation, RiboCORE (to Måns Ehrenberg and Dan Andersson). Article processing charge was supported by research grants from the Swedish Research Council, Research Environment Grant dnr: 2016-06264 and Knut and Alice Wallenberg Foundation, KAW 2017.0055 (to Suparna Sanyal). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.