U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



NA19240_prelim_2.0

Description:
MGI Reference Genomes Project NA19240 preliminary assembly version 2
Organism name:
Homo sapiens (human)
Isolate:
NA19240
Sex:
female
BioSample:
SAMN03838746
BioProject:
PRJNA288807
Submitter:
The Genome Institute at Washington University School of Medicine
Date:
2017/05/19
Assembly level:
Contig
Genome representation:
full
GenBank assembly accession:
GCA_001524155.2 (replaced)
RefSeq assembly accession:
n/a
RefSeq assembly and GenBank assembly identical:
n/a
WGS Project:
LKPB02
Assembly method:
Falcon v. November 2016
Genome coverage:
73x
Sequencing technology:
PacBio

IDs: 1013781 [UID] 4162648 [GenBank]

See Genome Information for Homo sapiens

There are 1094 assemblies for this organism

See more

History (Show revision history)

Comment

This chromosome-level assembly of the NA19420 genome, NA19240_prelim_3.0, is a draft and represents a work in progress. It will subsequently be re-submitted with BACs incorporated into regions of the genome that are difficult to assemble. Also, single allelic representations ... of specific regions will be added when available.
Sequence Assembly Release Notes for Homo sapiens NA19240_prelim_3.
 Background:
 DNA used for shotgun sequencing is derived from the blood, b-lymphocytes cells, of an adult female, identified as NA19240 (Coriell Institute for Medical Research). The NA19420 genome is diploid and from a Yoruban family trio Y117. Sequence from this project will be used to improve the contiguity of the human reference sequence and add diverse allelic variation.
 Total sequence (subreads) input coverage on the PacBio RS II instrument was 70x prior to error correction using a genome size estimate of 3Gb. The combined sequence reads were assembled using the Falcon software, and then error corrected using the Quiver and Pilon algorithms. Contigs of 200 bp or less have been excluded from NA19240_prelim_3.0.
 This work was supported by the NHGRI 'Improving The Human Reference Genome Resource' grant no. 5U41HG007635 to Richard K. Wilson, at the McDonnell Genome Institute, Washington University School of Medicine.
 DNA Source Contact: Dr. Fedik Rahimov, at the Coriell Institute for Medical Research.  more

Global statistics

Total sequence length2,874,719,792
Total ungapped length2,874,719,792
Number of contigs2,951
Contig N5025,740,126
Contig L5034
Total number of chromosomes and plasmids0
Number of component sequences (WGS or clone)2,951

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
The primary assembly unit does not have any assembled chromosomes or linkage groups.
Please download the full sequence report for information on the scaffolds.

Assembly statistics

MoleculeTotal
Length
Contig
Count
Ungapped
Length
Contig
N50
Spanned
Gaps
Unspanned
Gaps
unplaced2,874,719,7922,9512,874,719,79225,740,12600