NCBI Astyanax mexicanus Annotation Release 102

The RefSeq genome records for Astyanax mexicanus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Astyanax mexicanus Annotation Release 102

Annotation release ID: 102
Date of Entrez queries for transcripts and proteins: Sep 27 2017
Date of submission of annotation to the public databases: Oct 3 2017
Software version: 7.4

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Astyanax_mexicanus-2.0	GCF_000372685.2	Washington University School of Medince	09-25-2017	Reference	25 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Astyanax_mexicanus-2.0
Genes and pseudogenes	31,695
protein-coding	25,293
non-coding	5,314
pseudogenes	1,088
genes with variants	8,303
mRNAs	42,649
fully-supported	40,014
with > 5% ab initio	1,061
partial	767
with filled gap(s)	345
known RefSeq (NM_)	29
model RefSeq (XM_)	42,620
Other RNAs	6,665
fully-supported	4,342
with > 5% ab initio	0
partial	1
with filled gap(s)	1
known RefSeq (NR_)	0
model RefSeq (XR_)	4,342
CDSs	42,825
fully-supported	40,014
with > 5% ab initio	1,228
partial	737
with major correction(s)	1,721
known RefSeq (NP_)	29
model RefSeq (XP_)	42,620

Detailed reports

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	30,607	24,387	9,618	67	1,207,083
All transcripts	49,314	3,006	2,425	50	86,493
mRNA	42,649	3,357	2,719	168	86,493
misc_RNA	670	2,570	2,045	198	16,169
tRNA	2,323	75	73	67	93
lncRNA	3,672	864	615	50	6,945
Single-exon transcripts	890	1,964	1,666	279	8,351
coding transcripts (NM_/XM_ )	890	1,964	1,666	279	8,351
CDSs	42,649	2,076	1,512	96	85,164
Exons	288,664	286	138	1	19,261
in coding transcripts (NM_/XM_ )	277,482	286	138	1	19,261
in non-coding transcripts (NR_/XR_ )	15,502	262	130	2	6,528
Introns	258,229	2,978	940	26	1,198,388
in coding transcripts (NM_/XM_ )	250,689	2,956	942	26	1,198,388
in non-coding transcripts (NR_/XR_ )	11,790	3,473	855	30	1,198,388

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	1.67	1	1	50
Number of exons per transcript	12.17	9	1	173

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 25117 coding genes, 22887 genes had a protein with an alignment covering 50% or more of the query and 11238 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
Astyanax_mexicanus-2.0	GCF_000372685.2	6.66%	40.99%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	29	29 (100.00%)	28 (96.55%)	98.85%	98.39%
Same-species Genbank	98	95 (96.94%)	88 (89.80%)	99.25%	98.53%
Same-species EST	189,864	176,335 (92.87%)	162,939 (85.82%)	99.26%	99.05%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	4,192,463,364	76%	24%	306,463
SAMEA4503838	NA	whole organism (Astyanax mexicanus, SAMEA4503838)	397,584	58%	52%	37,354
SAMEA4503839	NA	whole organism (Astyanax mexicanus, SAMEA4503839)	414,428	55%	56%	37,711
SAMEA4503841	NA	whole organism (Astyanax mexicanus, SAMEA4503841)	255,618,746	84%	22%	255,842
SAMEA4503842	NA	whole organism (Astyanax mexicanus, SAMEA4503842)	235,176,886	84%	21%	256,781
SAMEA4503843	NA	brain (Astyanax mexicanus, SAMEA4503843)	105,921,364	82%	14%	214,960
SAMEA4503844	NA	olfactory epithelium (Astyanax mexicanus, SAMEA4503844)	125,673,620	83%	18%	228,543
SAMEA4503845	NA	brain (Astyanax mexicanus, SAMEA4503845)	105,726,390	81%	14%	216,488
SAMEA4503846	NA	olfactory epithelium (Astyanax mexicanus, SAMEA4503846)	102,044,592	81%	19%	222,515
SAMEA4503847	NA	brain (Astyanax mexicanus, SAMEA4503847)	108,408,414	79%	16%	216,823
SAMEA4503848	NA	olfactory epithelium (Astyanax mexicanus, SAMEA4503848)	101,522,038	84%	17%	223,052
SAMEA4503849	NA	brain (Astyanax mexicanus, SAMEA4503849)	106,176,338	80%	15%	215,354
SAMEA4503850	NA	olfactory epithelium (Astyanax mexicanus, SAMEA4503850)	117,190,546	83%	19%	216,094
SAMEA4503851	NA	whole organism (Astyanax mexicanus, SAMEA4503851)	570,828	69%	45%	55,169
SAMEA4503852	NA	whole organism (Astyanax mexicanus, SAMEA4503852)	541,816	69%	45%	54,092
SAMN01819972	NA	whole adult surface dwelling fish (Astyanax mexicanus, SAMN01819972)	695,199	20%	71%	85,664
SAMN01819973	NA	whole adult Pachon cavefish (Astyanax mexicanus, SAMN01819973)	804,842	22%	73%	92,383
SAMN01915377	NA	Kidney (Astyanax mexicanus, SAMN01915377)	104,850,378	62%	26%	199,795
SAMN01915378	NA	Liver (Astyanax mexicanus, SAMN01915378)	88,786,178	70%	36%	147,347
SAMN01915413	NA	Muscle (Astyanax mexicanus, SAMN01915413)	97,264,000	45%	26%	95,634
SAMN01915414	NA	Heart (Astyanax mexicanus, SAMN01915414)	86,041,602	56%	21%	177,919
SAMN01915445	NA	Nasal (Astyanax mexicanus, SAMN01915445)	98,745,702	66%	20%	212,460
SAMN01915446	NA	Skin (Astyanax mexicanus, SAMN01915446)	89,862,892	69%	24%	200,600
SAMN01915487	NA	Eyes-Surface (Astyanax mexicanus, SAMN01915487)	81,398,298	69%	20%	211,203
SAMN01915488	NA	Brain (Astyanax mexicanus, SAMN01915488)	88,020,994	64%	13%	201,686
SAMN02998081	NA	Whole embryo of 50 individuals, Surface-dwelling fish (Astyanax mexicanus, 10 hpf, SAMN02998081)	42,705,678	66%	11%	144,508
SAMN02998082	NA	Whole embryo of 50 individuals, Surface-dwelling fish (Astyanax mexicanus, 24 hpf, SAMN02998082)	54,662,960	70%	11%	173,659
SAMN03000816	NA	Whole embryo of 50 individuals, Surface-dwelling fish (Astyanax mexicanus, 36 hpf, SAMN03000816)	73,934,551	73%	12%	190,425
SAMN03000817	NA	Whole embryo of 50 individuals, Surface-dwelling fish (Astyanax mexicanus, 72 hpf, SAMN03000817)	52,824,622	72%	12%	185,302
SAMN03001823	NA	Whole embryo of 50 individuals, Cave-dwelling fish (Astyanax mexicanus, 10 hpf, SAMN03001823)	53,010,135	74%	11%	172,535
SAMN03001824	NA	Whole embryo of 50 individuals, Cave-dwelling fish (Astyanax mexicanus, 24 hpf, SAMN03001824)	53,215,543	70%	11%	175,052
SAMN03001825	NA	Whole embryo of 50 individuals, Pachon cave-dwelling fish (Astyanax mexicanus, 36 hpf, SAMN03001825)	54,688,429	70%	11%	178,276
SAMN03001826	NA	Whole embryo of 50 individuals, Pachon cave-dwelling fish (Astyanax mexicanus, 72 hpf, SAMN03001826)	52,753,605	69%	12%	178,022
SAMN03742265	27189481	Ovary (Astyanax mexicanus, unknown, female, SAMN03742265)	122,377,718	76%	30%	241,169
SAMN03742266	27189481	Brain (Astyanax mexicanus, unknown, female, SAMN03742266)	69,534,372	79%	22%	226,245
SAMN03742267	27189481	Gills (Astyanax mexicanus, unknown, female, SAMN03742267)	52,845,998	79%	30%	215,283
SAMN03742268	27189481	Heart (Astyanax mexicanus, unknown, female, SAMN03742268)	60,601,246	72%	32%	185,673
SAMN03742269	27189481	Muscle (Astyanax mexicanus, unknown, female, SAMN03742269)	56,192,344	84%	41%	175,068
SAMN03742270	27189481	testis (Astyanax mexicanus, unknown, female, SAMN03742270)	138,997,002	81%	38%	255,899
SAMN03742271	27189481	Head kidney (Astyanax mexicanus, unknown, female, SAMN03742271)	61,404,528	71%	29%	204,246
SAMN03742272	27189481	Bones (Astyanax mexicanus, unknown, female, SAMN03742272)	59,214,408	81%	35%	221,616
SAMN03742273	27189481	Intestine (Astyanax mexicanus, unknown, female, SAMN03742273)	90,449,228	79%	33%	200,463
SAMN03742274	27189481	Testis (Astyanax mexicanus, unknown, male, SAMN03742274)	59,352,402	81%	31%	246,155
SAMN03742275	27189481	Embryos (Astyanax mexicanus, 12-15 hpf, male and female, SAMN03742275)	86,749,058	80%	27%	224,200
SAMN03742294	27189481	Ovary (Astyanax mexicanus, unknown, female, SAMN03742294)	77,294,494	84%	36%	180,286
SAMN03742295	27189481	Brain (Astyanax mexicanus, unknown, female, SAMN03742295)	62,189,906	78%	24%	230,803
SAMN03742296	27189481	Gills (Astyanax mexicanus, unknown, female, SAMN03742296)	66,526,566	78%	30%	216,841
SAMN03742297	27189481	Heart (Astyanax mexicanus, unknown, female, SAMN03742297)	67,203,152	65%	33%	180,997
SAMN03742298	27189481	Muscle (Astyanax mexicanus, unknown, female, SAMN03742298)	70,717,868	83%	42%	168,299
SAMN03742299	27189481	Liver (Astyanax mexicanus, unknown, female, SAMN03742299)	65,787,382	79%	35%	158,147
SAMN03742300	27189481	Head kidney (Astyanax mexicanus, unknown, female, SAMN03742300)	63,095,800	71%	33%	187,874
SAMN03742301	27189481	Bones (Astyanax mexicanus, unknown, female, SAMN03742301)	49,045,198	80%	33%	216,927
SAMN03742302	27189481	Intestine (Astyanax mexicanus, unknown, female, SAMN03742302)	147,777,610	78%	33%	207,291
SAMN03742303	27189481	Testis (Astyanax mexicanus, unknown, male, SAMN03742303)	61,654,344	79%	28%	232,788
SAMN03742304	27189481	Embryos (Astyanax mexicanus, 13-14 hpf, male and female, SAMN03742304)	63,803,542	77%	27%	211,935

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
ERR1679839	ERX1749509	ERP017705	SAMEA4503838	397,584	58%	52%
ERR1679840	ERX1749510	ERP017705	SAMEA4503839	414,428	55%	56%
ERR1679844	ERX1749512	ERP017705	SAMEA4503841	255,618,746	84%	22%
ERR1679845	ERX1749513	ERP017705	SAMEA4503842	235,176,886	84%	21%
ERR1679846	ERX1749514	ERP017705	SAMEA4503843	105,921,364	82%	14%
ERR1679847	ERX1749515	ERP017705	SAMEA4503844	125,673,620	83%	18%
ERR1679848	ERX1749516	ERP017705	SAMEA4503845	105,726,390	81%	14%
ERR1679849	ERX1749517	ERP017705	SAMEA4503846	102,044,592	81%	19%
ERR1679850	ERX1749518	ERP017705	SAMEA4503847	108,408,414	79%	16%
ERR1679851	ERX1749519	ERP017705	SAMEA4503848	101,522,038	84%	17%
ERR1679852	ERX1749520	ERP017705	SAMEA4503849	106,176,338	80%	15%
ERR1679853	ERX1749521	ERP017705	SAMEA4503850	117,190,546	83%	19%
ERR1679841	ERX1749522	ERP017705	SAMEA4503851	570,828	69%	45%
ERR1679842	ERX1749523	ERP017705	SAMEA4503852	541,816	69%	45%
SRR639083	SRX212200	SRP017488	SAMN01819972	695,199	20%	71%
SRR639085	SRX212201	SRP017488	SAMN01819973	804,842	22%	73%
SRR693223	SRX230378	SRP018725	SAMN01915377	104,850,378	62%	26%
SRR688577	SRX229524	SRP018725	SAMN01915378	88,786,178	70%	36%
SRR689641	SRX229782	SRP018725	SAMN01915413	97,264,000	45%	26%
SRR695563	SRX230380	SRP018725	SAMN01915414	86,041,602	56%	21%
SRR689670	SRX229783	SRP018725	SAMN01915445	98,745,702	66%	20%
SRR690472	SRX229909	SRP018725	SAMN01915446	89,862,892	69%	24%
SRR689583	SRX229523	SRP018725	SAMN01915487	81,398,298	69%	20%
SRR692924	SRX230379	SRP018725	SAMN01915488	88,020,994	64%	13%
SRR1555606	SRX684707	SRP045680	SAMN02998081	18,653,029	77%	11%
SRR1556144	SRX685276	SRP045680	SAMN02998081	13,073,088	44%	11%
SRR1556145	SRX685277	SRP045680	SAMN02998081	10,979,561	72%	10%
SRR1556193	SRX685278	SRP045680	SAMN02998082	20,816,193	62%	10%
SRR1556198	SRX685284	SRP045680	SAMN02998082	20,900,695	77%	12%
SRR1556201	SRX685287	SRP045680	SAMN02998082	12,946,072	69%	11%
SRR1556204	SRX685288	SRP045680	SAMN03000816	22,573,994	74%	11%
SRR1556205	SRX685289	SRP045680	SAMN03000816	30,997,846	69%	12%
SRR1556258	SRX685342	SRP045680	SAMN03000816	20,362,711	78%	12%
SRR1556273	SRX685357	SRP045680	SAMN03000817	19,786,688	69%	11%
SRR1556275	SRX685358	SRP045680	SAMN03000817	19,161,420	76%	12%
SRR1556277	SRX685359	SRP045680	SAMN03000817	13,876,514	70%	12%
SRR1556298	SRX685375	SRP045680	SAMN03001823	21,263,956	77%	11%
SRR1556299	SRX685380	SRP045680	SAMN03001823	18,067,412	75%	11%
SRR1556300	SRX685381	SRP045680	SAMN03001823	13,678,767	68%	10%
SRR1556301	SRX685382	SRP045680	SAMN03001824	24,177,332	65%	10%
SRR1556303	SRX685383	SRP045680	SAMN03001824	15,936,475	78%	12%
SRR1556304	SRX685384	SRP045680	SAMN03001824	13,101,736	69%	11%
SRR1556305	SRX685385	SRP045680	SAMN03001825	23,370,191	65%	10%
SRR1556306	SRX685386	SRP045680	SAMN03001825	19,758,880	77%	12%
SRR1556307	SRX685387	SRP045680	SAMN03001825	11,559,358	70%	11%
SRR1556308	SRX685393	SRP045680	SAMN03001826	20,840,247	64%	11%
SRR1556309	SRX685397	SRP045680	SAMN03001826	16,154,645	75%	12%
SRR1556316	SRX685401	SRP045680	SAMN03001826	15,758,713	70%	12%
SRR2045404	SRX1043993	SRP058863	SAMN03742265	122,377,718	76%	30%
SRR2045405	SRX1043994	SRP058863	SAMN03742266	69,534,372	79%	22%
SRR2045406	SRX1043995	SRP058863	SAMN03742267	52,845,998	79%	30%
SRR2045407	SRX1043996	SRP058863	SAMN03742268	60,601,246	72%	32%
SRR2045408	SRX1043997	SRP058863	SAMN03742269	56,192,344	84%	41%
SRR2045409	SRX1043998	SRP058863	SAMN03742270	138,997,002	81%	38%
SRR2045410	SRX1043999	SRP058863	SAMN03742271	61,404,528	71%	29%
SRR2045411	SRX1044000	SRP058863	SAMN03742272	59,214,408	81%	35%
SRR2045412	SRX1044001	SRP058863	SAMN03742273	90,449,228	79%	33%
SRR2045413	SRX1044002	SRP058863	SAMN03742274	59,352,402	81%	31%
SRR2045414	SRX1044003	SRP058863	SAMN03742275	86,749,058	80%	27%
SRR2045426	SRX1044017	SRP058866	SAMN03742294	77,294,494	84%	36%
SRR2045427	SRX1044018	SRP058866	SAMN03742295	62,189,906	78%	24%
SRR2045428	SRX1044019	SRP058866	SAMN03742296	66,526,566	78%	30%
SRR2045429	SRX1044020	SRP058866	SAMN03742297	67,203,152	65%	33%
SRR2045430	SRX1044021	SRP058866	SAMN03742298	70,717,868	83%	42%
SRR2045431	SRX1044022	SRP058866	SAMN03742299	65,787,382	79%	35%
SRR2045432	SRX1044023	SRP058866	SAMN03742300	63,095,800	71%	33%
SRR2045433	SRX1044024	SRP058866	SAMN03742301	49,045,198	80%	33%
SRR2045434	SRX1044025	SRP058866	SAMN03742302	147,777,610	78%	33%
SRR2045435	SRX1044026	SRP058866	SAMN03742303	61,654,344	79%	28%
SRR2045436	SRX1044027	SRP058866	SAMN03742304	63,803,542	77%	27%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Pygocentrus nattereri high-quality model RefSeq (XP_)	17,844	17,665 (99.00%)	17,665 (99.00%)	71.53%	81.38%
Actinopterygii GenBank	78,488	74,862 (95.38%)	74,862 (95.38%)	69.13%	80.02%
Actinopterygii known RefSeq (NP_)	24,733	23,955 (96.85%)	23,955 (96.85%)	68.63%	79.07%
Danio rerio high-quality model RefSeq (XP_)	8,055	7,901 (98.09%)	7,901 (98.09%)	66.08%	75.67%
Ictalurus punctatus high-quality model RefSeq (XP_)	16,311	16,115 (98.80%)	16,115 (98.80%)	70.07%	79.60%
Homo sapiens known RefSeq (NP_)	50,022	42,897 (85.76%)	42,897 (85.76%)	64.50%	67.75%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
Astyanax_mexicanus-2.0 (Current) Coverage: 68.36%	Astyanax_mexicanus-2.0 (Current) Coverage: 70.15%
Astyanax_mexicanus-1.0.2 (Previous) Coverage: 93.96%	Astyanax_mexicanus-1.0.2 (Previous) Coverage: 94.09%
Percent Identity: 97.79%	Percent Identity: 97.66%

Comparison of the current and previous annotations

The annotation produced for this release (102) was compared to the annotation in the previous release (101) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	Astyanax_mexicanus-2.0 (Current) to Astyanax_mexicanus-1.0.2 (Previous)
Identical	1%
Minor changes	54%
Major changes	16%
New	27%
Deprecated	9%
Other	2%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences