U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



Hs_NA19240-1.0

Organism name:
Homo sapiens (human)
Isolate:
NA19240
Sex:
female
BioSample:
SAMN03838746
BioProject:
PRJNA288807
Submitter:
The Genome Institute at Washington University School of Medicine
Date:
2016/08/22
Assembly level:
Chromosome
Genome representation:
full
GenBank assembly accession:
GCA_001524155.1 (replaced)
RefSeq assembly accession:
n/a
RefSeq assembly and GenBank assembly identical:
n/a
WGS Project:
LKPB01
Assembly method:
Falcon v. September 2015
Genome coverage:
30x
Sequencing technology:
PacBio

IDs: 632211 [UID] 2842968 [GenBank]

See Genome Information for Homo sapiens

There are 1094 assemblies for this organism

See more

History (Show revision history)

Comment

Sequence Assembly Release Notes - Homo sapiens NA19240
 Background:
 The NA19240 DNA for shotgun sequencing is derived from the blood, b-lymphocytes cells, of an adult female from the Coriell Institute for Medical Research. The NA19240 genome is diploid and ... from a Yoruban family trio Y117. Sequence from this project will be used to improve the contiguity of the human reference sequence and add diverse allelic variation.
 Total sequence genome input coverage on the PacBio RS II instrument was 30x after error correction (74x prior to error correction) using a genome size estimate of 3Gb. The combined sequence reads were assembled using the Falcon software, and then error corrected with the Quiver algorithm. This NA19240 1.0 version has been cleaned of contaminating contigs, and contigs 200bp and less were removed. The assembly is made up of a total of 3600 contigs, with an N50 contig length of 5.83Mb. The assembly spans 2.74Gb.
 This work was supported by the NHGRI 'Improving The Human Reference Genome Resource' grant no. 5U41HG007635 to Richard K. Wilson, at the McDonnell Genome Institute, Washington University School of Medicine.
 DNA Source Contact: Dr. Fedik Rahimov, at the Coriell Institute for Medical Research, 403 Haddon Ave, Camden, NJ 08103 or request DNA through this link: https://catalog.coriell.org/0/Sections/Search/Sample_Detail.aspx?Ref=GM19240
 H. sapiens NA19240 1.0 Sequence and Assembly Credits:
 DNA source - Dr. Fedik Rahimov, at the Coriell Institute for Medical Research. Genome Sequence - McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO. Sequence Assembly - McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO.

 Citation upon use of this assembly in a manuscript:
 It is requested that users of this H. sapiens NA19240 1.0 sequence assembly acknowledge Richard K. Wilson and McDonnell Genome Institute, Washington University School of Medicine in any publications that result from use of this sequence assembly. 

 H. sapiens NA19240 1.0 assembly statistics: (scaffold = supercontig)
 *** Contiguity: PacBio Contigs *** Total contig number: 3600 Total contig bases: 2744417834 bp Average contig length: 762338 bp Maximum contig length: 20393869 bp N50 contig length: 5831088 bp N50 contig number: 139
 *** PacBio Contig Distribution*** Contigs > 1M: 546 Contigs 250K--1M: 378 Contigs 100K--250K: 300 Contigs 10--100K: 1743 Contigs 5--10K: 293 Contigs 2--5K: 191 Contigs 0--2K: 101  more

Global statistics

Total sequence length3,078,993,553
Total ungapped length2,806,503,622
Gaps between scaffolds204
Number of scaffolds1,588
Scaffold N5078,547,065
Scaffold L5015
Number of contigs3,189
Contig N507,249,122
Contig L50111
Total number of chromosomes and plasmids23
Number of component sequences (WGS or clone)3,603

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
Assembly Unit: Primary Assembly (GCA_001524165.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1CM003683.1n/an/a0
Chromosome 2CM003684.1n/an/a0
Chromosome 3CM003685.1n/an/a0
Chromosome 4CM003686.1n/an/a0
Chromosome 5CM003687.1n/an/a0
Chromosome 6CM003688.1n/an/a0
Chromosome 7CM003689.1n/an/a0
Chromosome 8CM003690.1n/an/a0
Chromosome 9CM003691.1n/an/a0
Chromosome 10CM003692.1n/an/a0
Chromosome 11CM003693.1n/an/a0
Chromosome 12CM003694.1n/an/a0
Chromosome 13CM003695.1n/an/a0
Chromosome 14CM003696.1n/an/a0
Chromosome 15CM003697.1n/an/a0
Chromosome 16CM003698.1n/an/a0
Chromosome 17CM003699.1n/an/a0
Chromosome 18CM003700.1n/an/a0
Chromosome 19CM003701.1n/an/a0
Chromosome 20CM003702.1n/an/a0
Chromosome 21CM003703.1n/an/a0
Chromosome 22CM003704.1n/an/a0
Chromosome XCM003705.1n/an/a0
unplacedn/an/an/a1,468

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
All3,078,993,5531,5882,806,503,62278,547,0651,601204
Chromosome 1248,231,5146219,578,764121,017,9601329
Chromosome 2242,558,7216233,409,538147,607,3661159
Chromosome 3198,197,2273194,194,252104,422,452466
Chromosome 4190,258,8686184,829,363121,342,101729
Chromosome 5180,829,5884174,589,37798,154,018607
Chromosome 6171,554,2962167,033,562110,140,878545
Chromosome 7159,573,1364149,116,86996,864,56813010
Chromosome 8142,975,3024140,602,38271,342,244477
Chromosome 9138,660,40314110,269,50370,054,6279818
Chromosome 10134,428,0912123,368,32491,951,482735
Chromosome 11134,506,9782129,075,10880,617,519596
Chromosome 12133,893,0583129,348,89995,004,741636
Chromosome 13115,312,707595,128,47679,840,5992710
Chromosome 14107,765,308187,100,71488,705,308265
Chromosome 15102,068,414378,357,91378,547,065557
Chromosome 1690,996,388474,623,66143,806,0031058
Chromosome 1781,885,629575,769,19254,773,290778
Chromosome 1877,906,405261,822,99658,109,100316
Chromosome 1958,975,477253,415,32831,354,8581037
Chromosome 2064,593,076759,294,69833,017,9403710
Chromosome 2147,834,4911734,302,91333,779,0653322
Chromosome 2251,669,9061133,840,56731,266,7845614
Chromosome X154,646,2757147,758,92839,075,81310210
unplaced49,672,2951,46849,672,29574,21000