Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells

Nature. 2012 Jul 11;487(7406):190-5. doi: 10.1038/nature11236.

Abstract

Recent advances in whole-genome sequencing have brought the vision of personal genomics and genomic medicine closer to reality. However, current methods lack clinical accuracy and the ability to describe the context (haplotypes) in which genome variants co-occur in a cost-effective manner. Here we describe a low-cost DNA sequencing and haplotyping process, long fragment read (LFR) technology, which is similar to sequencing long single DNA molecules without cloning or separation of metaphase chromosomes. In this study, ten LFR libraries were made using only ∼100 picograms of human DNA per sample. Up to 97% of the heterozygous single nucleotide variants were assembled into long haplotype contigs. Removal of false positive single nucleotide variants not phased by multiple LFR haplotypes resulted in a final genome error rate of 1 in 10 megabases. Cost-effective and accurate genome sequencing and haplotyping from 10-20 human cells, as demonstrated here, will enable comprehensive genetic studies and diverse clinical applications.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Alleles
  • Cell Line
  • Female
  • Gene Silencing
  • Genetic Variation
  • Genome, Human*
  • Genomics / methods*
  • Haplotypes
  • Humans
  • Mutation
  • Reproducibility of Results
  • Sequence Analysis, DNA / economics
  • Sequence Analysis, DNA / methods*
  • Sequence Analysis, DNA / standards