U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



AgamP3

Organism name:
Anopheles gambiae str. PEST (African malaria mosquito)
Infraspecific name:
Strain: PEST
BioSample:
SAMN02952903
BioProject:
PRJNA1438
Submitter:
The International Consortium for the Sequencing of Anopheles Genome
Date:
2006/10/16
Assembly level:
Chromosome
Genome representation:
full
Excluded from RefSeq:
  • superseded by newer assembly for species
GenBank assembly accession:
GCA_000005575.1 (latest)
RefSeq assembly accession:
GCF_000005575.2 (suppressed) see latest RefSeq assembly for this species
RefSeq assembly and GenBank assembly identical:
no (hide details)
  • Only in RefSeq: chromosome MT
  • Data displayed for GenBank version
WGS Project:
AAAB01

IDs: 305108 [UID] 304838 [GenBank] 305108 [RefSeq]

See Genome Information for Anopheles gambiae

There are 7 assemblies for this organism

See more

History (Show revision history)

Comment

Accessions AAAB01000001-AAAB01008987 constitute the genome assembly.
Some scaffolds are now marked in the Definition line as probably representing either contaminating bacterial DNA or alternative assemblies of a region also present in another larger scaffold; they have no annotation of gene ... features.
Accessions AAB01008988-AAAB01069724 are additional short contigs that we were unwilling to designate as unique in the genome and are unable to place within a larger scaffold in the assembly process. This happens because they may be repetitive, and we are very cautious about placing them incorrectly; and/or because they have very few Celera mate pairs to allow us to place them with confidence.
Strings of n's in a record represent gaps between contigs within a scaffold, and the length of each string corresponds to the approximate length of the gap.
VectorBase (www.vectorbase.org), a NIAID Bioinformatics Resource Center, is responsible for the assembled genome sequence and its annotation. For more information, contact info@vectorbase.org.

Annotation was updated on the contigs in May 2011.  more

Global statistics

Total sequence length265,011,681
Total ungapped length252,438,733
Gaps between scaffolds55
Number of scaffolds8,144
Scaffold N5012,309,988
Scaffold L509
Number of contigs16,824
Contig N5085,548
Contig L50696
Total number of chromosomes and plasmids6
Number of component sequences (WGS or clone)8,164

Supplemental Content

PubMed articles for this assembly

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
Assembly Unit: Primary Assembly (GCA_000005585.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome XCM000360.1=NC_004818.20
Chromosome 2LCM000356.1=NT_078265.20
Chromosome 2RCM000357.1=NT_078266.20
Chromosome 3LCM000358.1=NT_078267.50
Chromosome 3RCM000359.1=NT_078268.40
Chromosome Yn/an/an/a55
unplacedn/an/an/a8,029

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule265,011,6818,144252,438,73312,309,9888,68055
Chromosome XAssembled molecule24,393,1081323,385,3493,715,0791,27512
Chromosome 2LAssembled molecule49,364,3251248,525,74712,309,98894611
Chromosome 2RAssembled molecule61,545,1051660,132,45310,831,4511,64315
Chromosome 3LAssembled molecule41,963,4351340,758,47312,698,2471,26012
Chromosome 3RAssembled molecule53,200,684652,226,56818,711,4321,1235
Chromosome YAllAssembled moleculeUnlocalized scaffolds183,0450183,04555055135,1550135,15511,068011,068909000
unplacedAssembled molecule34,361,9798,02927,274,98812,7122,4240