Bayes estimators for phylogenetic reconstruction

P M Huggins; W Li; D Haws; T Friedrich; J Liu; R Yoshida

doi:10.1093/sysbio/syr021

Bayes estimators for phylogenetic reconstruction

Syst Biol. 2011 Jul;60(4):528-40. doi: 10.1093/sysbio/syr021. Epub 2011 Apr 6.

Authors

P M Huggins¹, W Li, D Haws, T Friedrich, J Liu, R Yoshida

Affiliation

¹ Lane Center for Computational Biology, Carnegie Mellon University, Mellon Institute Building, 4400 Fifth Avenue, Pittsburgh, PA 15213, USA.

Abstract

Tree reconstruction methods are often judged by their accuracy, measured by how close they get to the true tree. Yet, most reconstruction methods like maximum likelihood (ML) do not explicitly maximize this accuracy. To address this problem, we propose a Bayesian solution. Given tree samples, we propose finding the tree estimate that is closest on average to the samples. This "median" tree is known as the Bayes estimator (BE). The BE literally maximizes posterior expected accuracy, measured in terms of closeness (distance) to the true tree. We discuss a unified framework of BE trees, focusing especially on tree distances that are expressible as squared euclidean distances. Notable examples include Robinson-Foulds (RF) distance, quartet distance, and squared path difference. Using both simulated and real data, we show that BEs can be estimated in practice by hill-climbing. In our simulation, we find that BEs tend to be closer to the true tree, compared with ML and neighbor joining. In particular, the BE under squared path difference tends to perform well in terms of both path difference and RF distances.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Animals
Bayes Theorem
Computer Simulation
Models, Genetic*
Phylogeny*
Software
Urodela / classification
Urodela / genetics

Abstract

Publication types

MeSH terms

Grants and funding