IMPUTOR: Phylogenetically Aware Software for Imputation of Errors in Next-Generation Sequencing

Genome Biol Evol. 2018 Apr 1;10(5):1248-1254. doi: 10.1093/gbe/evy088.

Abstract

We introduce IMPUTOR, software for phylogenetically aware imputation of missing haploid nonrecombining genomic data. Targeted for next-generation sequencing data, IMPUTOR uses the principle of parsimony to impute data marked as missing due to low coverage. Along with efficiently imputing missing variant genotypes, IMPUTOR is capable of reliably and accurately correcting many nonmissing sites that represent spurious sequencing errors. Tests on simulated data show that IMPUTOR is capable of detecting many induced mutations without making erroneous imputations/corrections, with as many as 95% of missing sites imputed and 81% of errors corrected under optimal conditions. We tested IMPUTOR with human Y-chromosomes from pairs of close relatives and demonstrate IMPUTOR's efficacy in imputing missing and correcting erroneous calls.

Publication types

  • Research Support, Non-U.S. Gov't
  • Technical Report

MeSH terms

  • Algorithms*
  • Chromosomes, Human, Y / classification
  • Chromosomes, Human, Y / genetics
  • Gene Frequency
  • Genome-Wide Association Study
  • Genomics / methods
  • Haplotypes
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Male
  • Phylogeny*
  • Polymorphism, Single Nucleotide
  • Software*