U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



Pan_troglodytes-2.1.3

Organism name:
Pan troglodytes (chimpanzee)
Isolate:
Yerkes chimp pedigree #C0471 (Clint)
Sex:
male
BioSample:
SAMN02981217
BioProject:
PRJNA13184
Submitter:
Chimpanzee Sequencing and Analysis Consortium
Date:
2010/11/15
Assembly level:
Chromosome
Genome representation:
full
GenBank assembly accession:
GCA_000001515.2 (replaced)
RefSeq assembly accession:
n/a
RefSeq assembly and GenBank assembly identical:
n/a
WGS Project:
AACZ03
Assembly method:
PCAP
Genome coverage:
6x
Sequencing technology:
Sanger

IDs: 202158 [UID] 202158 [GenBank]

See Genome Information for Pan troglodytes

There are 10 assemblies for this organism

See more

History (Show revision history)

Comment

This assembly, version 2.1.4, has an updated chromosome Y compared to version 2.1.3. Assembly 2.1.3 represented an improvement on the 2.1 chimp assembly by adding in over 300,000 finishing reads, and merging in 640 finished BACS. There were approximately ... 49,000 additional merges made in that assembly as compared to the 2.1 assembly.

Sequencing/Assembly: The whole genome shotgun sequence data were assembled and organized by the Washington University Genome Center. The underlying whole genome shotgun data were generated at the Washington University School of Medicine and the Broad Institute. A 5 megabase region of chromosome 7 was finished at the Washington University Genome Sequencing Center (chr7:84674857-89461887). The chromosome Y sequence was finished at the Washington University Genome Sequencing Center with detailed mapping and extensive collaboration with David Page's group at the Whitehead Institute (The DNA Sequence of Chimpanzee Chromosome Y, unpublished; Hughes et al., Conservation of Y-linked genes during human evolution revealed by comparative sequencing in chimpanzee. Nature, 2005 437:100-3; PMID:16136134). The chromosome 21 sequence data was kindly provided by Todd Taylor and the Riken Genome Sciences Center (Watanabe et al., DNA sequence and comparative analysis of chimpanzee chromosome 22. Nature. 2004 May27;429(6990):382-8. PMID: 15164055).

This assembly covers about 97 percent of the genome and is based on 6X sequence coverage. It is composed of 192,898 contigs with an N50 length of 44kb, and 33,990 supercontigs with an N50 length of 8.4 Mb. 

The whole genome shotgun data from primary donor-derived reads (Clint, a captive-born male chimpanzee from the Yerkes Primate Research Center (Atlanta, USA)) were assembled using PCAP (Huang 2006) using stringent parameters derived by eliminating detectable global mis-assemblies (interchromosomal cross-overs determined by alignment of the chimpanzee genome against the human genome) larger than 50kb.

The assembly data were aligned against the human genome at UCSC (B. Raney) utilizing BLASTZ (Schwartz 2003) to align and score non-repetitive chimpanzee regions against repeat-masked human sequence. Alignment chains differentiated between orthologous and paralogous alignments (Kent 2003) and only 'reciprocal best' alignments were retained in the alignment set. The chimpanzee AGP files were generated from these alignments in a manner similar to that already described (The Chimpanzee Genome Sequencing Consortium 2005). Centromeres were introduced into the chimp sequence at the positions of the centromeres in the human chromosomes. Ten documented/known human inversions (Yunis 1982) supported by the assembly were introduced into the ordering as was the separation of alignments to human chromosome 2 into chimpanzee chromosomes 2A and 2B. We removed the contigs from the WGS project that corresponded to the finished chromosome 21 and chromosome Y sequences as well as the contig corresponding to chromosome 7 because they are represented by the corresponding finished sequences. The chromosome 21 sequence is GenBank Accession Number BA000046 and the chromosome Y sequence is GenBank Accession Number DP000054.

The Nov. 2010 version of chromosome 7, CM000321.3, is assembled from BACs plus contigs of the previous version of the WGS project, AACZ02000000. With the release of CM000321.3, nearly 700 unlocalized chromosome 7 scaffolds that were redundant with the new chr7 were suppressed and some AACZ03000000 contigs are not part of assembly Pan_troglodytes-2.1.3 or Pan_troglodytes-2.1.4  more

Global statistics

Total sequence length3,307,943,878
Total ungapped length2,900,544,493
Gaps between scaffolds2,927
Number of scaffolds26,996
Scaffold N509,141,361
Scaffold L5084
Number of contigs183,905
Contig N5050,595
Contig L5013,683
Total number of chromosomes and plasmids25
Number of component sequences (WGS or clone)185,379

Supplemental Content

PubMed articles for this assembly

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
Assembly Unit: Primary Assembly (GCA_000000075.2)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1CM000314.2n/an/a1,554
Chromosome 2ACM000315.2n/an/a693
Chromosome 2BCM000316.2n/an/a684
Chromosome 3CM000317.2n/an/a1,156
Chromosome 4CM000318.2n/an/a1,260
Chromosome 5CM000319.2n/an/a1,049
Chromosome 6CM000320.2n/an/a1,326
Chromosome 7CM000321.3n/an/a505
Chromosome 8CM000322.3n/an/a1,031
Chromosome 9CM000323.2n/an/a968
Chromosome 10CM000324.2n/an/a1,485
Chromosome 11CM000325.2n/an/a825
Chromosome 12CM000326.2n/an/a785
Chromosome 13CM000327.2n/an/a539
Chromosome 14CM000328.2n/an/a537
Chromosome 15CM000329.2n/an/a552
Chromosome 16CM000330.2n/an/a811
Chromosome 17CM000331.2n/an/a572
Chromosome 18CM000332.2n/an/a495
Chromosome 19CM000333.2n/an/a428
Chromosome 20CM000334.3n/an/a381
Chromosome 21BA000046.3n/an/a0
Chromosome 22CM000335.2n/an/a223
Chromosome XCM000336.2n/an/a703
Chromosome YDP000054.1n/an/a3
unplacedn/an/an/a5,541

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule3,307,943,87826,9962,900,544,4939,141,361156,9092,927
Chromosome 1AllAssembled moleculeUnlocalized scaffolds237,734,144228,333,8719,400,2731,7942401,554225,089,433216,475,3488,614,0856,423,6886,937,07615,95312,18011,2339472422420
Chromosome 2AAllAssembled moleculeUnlocalized scaffolds116,634,913113,622,3743,012,539836143693108,191,507105,406,7162,784,79110,151,23210,151,2328,8895,0084,7662421441440
Chromosome 2BAllAssembled moleculeUnlocalized scaffolds249,660,250247,518,4782,141,77274359684129,207,671127,265,8181,941,85317,110,49817,110,4983,7125,2995,12417560600
Chromosome 3AllAssembled moleculeUnlocalized scaffolds205,796,835202,329,9553,466,8801,215591,156197,140,879193,960,2113,180,66817,615,69517,615,6953,1198,2627,99426862620
Chromosome 4AllAssembled moleculeUnlocalized scaffolds200,134,428193,495,0926,639,3361,334741,260192,189,618186,186,1286,003,49013,863,95413,863,95412,0707,8017,09570675750
Chromosome 5AllAssembled moleculeUnlocalized scaffolds185,762,886182,651,0973,111,7891,138891,049177,316,640174,479,3262,837,3149,899,12510,833,7383,2267,3367,10123590900
Chromosome 6AllAssembled moleculeUnlocalized scaffolds181,908,250172,623,8819,284,3691,406801,326172,745,967163,985,0958,760,87223,248,24223,248,24247,8167,5306,90462682820
Chromosome 7AllAssembled moleculeUnlocalized scaffolds166,880,357161,824,5865,055,77153833505162,745,289158,137,0264,608,26318,865,56018,865,56024,4461,01342558834340
Chromosome 8AllAssembled moleculeUnlocalized scaffolds151,216,614143,986,4697,230,1451,112811,031144,357,478137,484,3016,873,17712,795,83512,795,8353,851,1366,5356,09444182820
Chromosome 9AllAssembled moleculeUnlocalized scaffolds145,513,354137,840,9877,672,3671,214246968116,058,603109,044,6977,013,9068,824,7508,824,75020,4086,3345,5407942482480
Chromosome 10AllAssembled moleculeUnlocalized scaffolds141,864,825133,524,3798,340,4461,6051201,485132,161,043124,713,5007,447,5439,658,1959,809,79510,6146,9546,0958591231230
Chromosome 11AllAssembled moleculeUnlocalized scaffolds141,445,470133,121,5348,323,93690075825130,961,106123,055,4177,905,68916,830,31416,830,3145,442,0336,5896,02456576760
Chromosome 12AllAssembled moleculeUnlocalized scaffolds136,466,624134,246,2142,220,41085368785131,273,554129,256,4322,017,12213,767,38513,767,3852,9116,8976,73516269690
Chromosome 13AllAssembled moleculeUnlocalized scaffolds124,261,224115,123,2339,137,9915905153996,228,94887,360,2498,868,6997,813,1928,163,0877,510,0473,7083,29341554540
Chromosome 14AllAssembled moleculeUnlocalized scaffolds108,632,646106,544,9382,087,7085895253787,717,90085,803,8541,914,04611,377,05712,406,9605,1904,3324,17715554540
Chromosome 15AllAssembled moleculeUnlocalized scaffolds102,709,32999,548,3183,161,0116408855279,681,71076,715,7742,965,9367,352,5007,352,50015,6184,0523,81823490900
Chromosome 16AllAssembled moleculeUnlocalized scaffolds96,299,60389,983,8296,315,7741,07025981179,871,50774,138,8975,732,6102,982,3933,074,22419,1895,7484,9987502612610
Chromosome 17AllAssembled moleculeUnlocalized scaffolds87,656,18082,630,4425,025,73870913757277,877,68373,160,6844,716,9994,894,9254,894,92555,1815,8475,4314161381380
Chromosome 18AllAssembled moleculeUnlocalized scaffolds78,479,32676,611,4991,867,8275222749575,477,81273,774,8511,702,96110,953,49310,953,4934,9163,0302,88214828280
Chromosome 19AllAssembled moleculeUnlocalized scaffolds66,022,82663,644,9932,377,83357214442853,860,89751,805,7172,055,1802,728,8972,888,1039,8035,9925,6853071471470
Chromosome 20AllAssembled moleculeUnlocalized scaffolds63,612,00161,729,2931,882,7084254438159,620,30557,880,8311,739,47410,027,82010,027,82011,9293,5743,44113345450
Chromosome 21Assembled molecule32,799,110332,706,06927,632,627202
Chromosome 22AllAssembled moleculeUnlocalized scaffolds50,904,73549,737,9841,166,75132910622333,251,71132,184,5571,067,1542,798,5952,798,59514,3872,8162,720961091090
Chromosome XAllAssembled moleculeUnlocalized scaffolds160,356,486156,848,1443,508,3421,310607703138,723,601135,947,7832,775,8181,257,3061,341,3536,72521,25920,6076526086080
Chromosome YAllAssembled moleculeUnlocalized scaffolds24,725,38123,952,694772,68785323,463,20922,691,222771,9874,037,0604,037,060276,07054477440
unplacedAssembled molecule50,466,0815,54142,624,35314,9918,7390