U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



X_maculatus-5.0-male

Organism name:
Xiphophorus maculatus (southern platyfish)
Infraspecific name:
Strain: JP 163 A
Sex:
male
BioSample:
SAMN08025980
BioProject:
PRJNA72525
Submitter:
The Genome Institute, Washington University at St. Louis
Date:
2017/11/16
Assembly level:
Scaffold
Genome representation:
full
GenBank assembly accession:
GCA_002775205.1 (replaced)
RefSeq assembly accession:
n/a
RefSeq assembly and GenBank assembly identical:
n/a
WGS Project:
PGSD01
Assembly method:
HGAP4_SMRT_Link v. 5.0.19585
Genome coverage:
83x
Sequencing technology:
PacBio_Sequel

IDs: 1441241 [UID] 5655918 [GenBank]

See Genome Information for Xiphophorus maculatus

There are 2 assemblies for this organism

See more

History (Show revision history)

Comment

Xiphophorus maculatus 5.0 assembly.
 The platyfish DNA for single molecule real time (SMRT) sequencing is derived from a single male (Xiphophorus maculatus, Strain - JP 163 A) from the Xiphophorus Genetic Stock Center (Dr. Ron Walter, Director), Texas State ... University, San Marcos, Texas, USA. X. maculatus Jp 163 A are line bred (i.e., brother-sister matings) and this fish is from the 114th generation if line breeding. Sequences were generated on the Pacific Biosciences Sequel instrument (V2 chemistry) to approx. 83x genome coverage based on a genome size estimate of 700 Mb. All SMRT sequences were assembled with the HGAP4 algorithm (SMRT Link v5.0.1.9585) then error corrected using the Arrow error-correction module. Additional polishing of the assembly for residual indels was done by aligning 
50x coverage of Illumina data and the Pilon algorithm. Scaffolds were generated by alignment to a Bionano map created with the same DNA source using the Irys software. Finally, all scaffolds were ordered and oriented by alignment to the genetic linkage map using Chromonomer (http://catchenlab.life.illinois.edu/chromonomer/.cite). 
 Of the 704 Mb assembled genome (X_maculatus-5.0), the total assembly N50 contig and scaffold lengths are 9.2Mb (n=259) and 31.5Mb (n=103), respectively. 
 For questions regarding this X_maculatus-5.0 assembly please contact Dr. Wes Warren, McDonnell Genome Institute at Washington University School of Medicine, St. Louis, MO or Dr. Ron Walter, Texas State University, San Marcos, Texas, USA. 
 Data use:
 The X_maculatus-5.0 assembly sequence is made freely available to the community by McDonnell Genome Institute at Washington University School of Medicine, with the following understanding: 1. The data may be freely downloaded, used in analyses, and repackaged in databases. 2. Users are free to use the data in scientific papers analyzing these data if the providers of this data are properly cited. 3. Any redistribution of the data should carry this notice. 
 Xiphophorus maculatus Sequence and Assembly Credits:
 DNA source - Dr. Ron Walter, Texas State University, San Marcos, TX.
 Genome Sequence - The McDonnell Genome Institute, Washington University School of Medicine.
 Sequence Assembly and Chromosomal Sequence Construction - The McDonnell Genome Institute, Washington University School of Medicine
 Platyfish RNAseq data - Dr. Ron Walter, Texas State University, San Marcos, TX. 
 Funding for the sequence characterization of the platyfish genome is being provided by grants to Dr. Wesley Warren, McDonnell Genome Institute, Washington University Schoold of Medicine and Dr. Ron Walter through the National Institutes of Health (NIH) and Dr. Manfred Schartl at the Universitat Wurzburg, Germany.

 ASSEMBLY STATS:
 SCAFFOLDS
 COUNT 103 
 LENGTH 704,304,639 bp 
 AVG 6,837,909 bp 
 N50 31,535,491 bp
 LARGEST 35,293,739 bp 
 Scaffolds > 1M: 24 ( 699,980,690 bp ) 99.4%
 Scaffolds 250K--1M: 1 ( 269,636 bp ) 0.04%
 Scaffolds 100K--250K: 14 ( 1,854,102 bp ) 0.26%
 Scaffolds 10K--100K: 52 ( 2,150,311 bp ) 0.31%
 Scaffolds 5K--10K: 5 ( 37,383 bp ) 0.005%
 Scaffolds 2K--5K: 3 ( 9,783 bp ) 0.001%
 Scaffolds 0--2K: 4 ( 2,734 bp ) 0.0004%
 CONTIGS
 COUNT 259 
 LENGTH 700,976,734 bp 
 AVG 2,706,473 bp 
 N50 9,181,372 bp
 LARGEST 26,812,185 bp 
 Contigs > 1M: 115 ( 664,204,175 bp ) 94.8%
 Contigs 250K--1M: 54 ( 30,725,410 bp ) 4.4%
 Contigs 100K--250K: 26 ( 3,846,938 bp ) 0.55%
 Contigs 10K--100K: 52 ( 2,150,311 bp ) 0.31%
 Contigs 5K--10K: 5 ( 37,383 bp ) 0.005%
 Contigs 2K--5K: 3 ( 9,783 bp ) 0.001%
 Contigs 0--2K: 4 ( 2,734 bp ) 0.0004%  more

Global statistics

Total sequence length704,304,519
Total ungapped length700,976,614
Gaps between scaffolds0
Number of scaffolds101
Scaffold N5031,535,491
Scaffold L5011
Number of contigs257
Contig N509,181,372
Contig L5025
Total number of chromosomes and plasmids0
Number of component sequences (WGS or clone)101

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
The primary assembly unit does not have any assembled chromosomes or linkage groups.
Please download the full sequence report for information on the scaffolds.

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
unplaced704,304,519101700,976,61431,535,4911560