NCBI Xenopus tropicalis Annotation Release 103

The RefSeq genome records for Xenopus tropicalis were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Xenopus tropicalis Annotation Release 103

Annotation release ID: 103
Date of Entrez queries for transcripts and proteins: Aug 31 2016
Date of submission of annotation to the public databases: Sep 13 2016
Software version: 7.1

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Xenopus_tropicalis_v9.1	GCF_000004195.3	DOE Joint Genome Institute	07-13-2016	Reference	11 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Xenopus_tropicalis_v9.1
Genes and pseudogenes	26,881
protein-coding	21,258
non-coding	5,243
pseudogenes	380
genes with variants	8,032
mRNAs	39,662
fully-supported	36,773
with > 5% ab initio	1,609
partial	1,872
with filled gap(s)	168
known RefSeq (NM_)	8,399
model RefSeq (XM_)	31,263
Other RNAs	6,651
fully-supported	3,925
with > 5% ab initio	0
partial	16
with filled gap(s)	14
known RefSeq (NR_)	190
model RefSeq (XR_)	3,738
CDSs	39,708
fully-supported	36,773
with > 5% ab initio	1,781
partial	1,477
with major correction(s)	894
known RefSeq (NP_)	8,386
model RefSeq (XP_)	31,263

Detailed reports

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	26,501	25,520	10,147	60	1,289,076
All transcripts	46,313	2,885	2,277	17	27,487
mRNA	39,662	3,217	2,550	138	27,487
misc_RNA	784	3,105	2,455	114	19,287
miRNA	199	22	22	17	26
tRNA	2,701	74	73	70	88
lncRNA	2,963	1,140	801	82	11,181
snRNA	1	107	107	107	107
rRNA	3	102	121	65	121
Single-exon transcripts	1,569	1,681	1,195	242	19,131
coding transcripts (NM_/XM_ )	1,567	1,680	1,194	242	19,131
non-coding transcripts (NR_/XR_ )	2	2,204	2,930	1,477	2,930
CDSs	39,662	1,917	1,389	96	26,400
Exons	237,550	300	138	1	19,131
in coding transcripts (NM_/XM_ )	228,165	297	137	1	19,131
in non-coding transcripts (NR_/XR_ )	14,305	318	139	2	14,946
Introns	206,947	3,491	1,118	25	1,068,981
in coding transcripts (NM_/XM_ )	200,506	3,463	1,119	25	1,068,981
in non-coding transcripts (NR_/XR_ )	11,220	3,788	1,037	28	219,918

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	1.84	1	1	50
Number of exons per transcript	11.47	8	1	147

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 21186 coding genes, 19781 genes had a protein with an alignment covering 50% or more of the query and 12160 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
Xenopus_tropicalis_v9.1	GCF_000004195.3	32.69%	36.76%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	8,806	8,672 (98.48%)	8,319 (94.47%)	99.23%	97.74%
Same-species Genbank	17,826	17,481 (98.06%)	1,971 (11.06%)	99.11%	96.97%
Same-species EST	1,271,465	1,160,511 (91.27%)	1,077,310 (84.73%)	98.08%	97.79%
Xenopus laevis EST	693,929	274,272 (39.52%)	188,058 (27.10%)	91.08%	96.97%

RefSeq transcript alignment quality report

The known RefSeq transcripts (NM_ and NR_ accessions) are a set of hiqh-quality transcripts maintained by the RefSeq group at NCBI. Alignment statistics for this group of transcripts, such as percent and number of sequences not aligning at all, percent best alignments split between multiple scaffolds, and percent alignments not covering the full CDS are indicative of the genome quality and are provided below.

	Xenopus_tropicalis_v9.1 Primary Assembly
Number of sequences retrieved from Entrez	8,806
Number (%) of sequences not aligning	134 (1.52%)
Number (%) of sequences with multiple best alignments (split genes)	139 (1.60%)
Number (%) of sequences with CDS coverage < 95%	682 (8.04%)

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Track name	Number of reads	Percent aligned reads	Percent spliced reads	Number of introns
All	Aggregate of all aligned samples	6,501,975,442	79%	17%	243,281
SAMEA1025683	early neurula, anterior half, embryo, abnormal eye development (Xenopus (Silurana) tropicalis, SAMEA1025683)	10,917,912	83%	17%	112,055
SAMEA1025684	early neurula, anterior half, embryo, abnormal eye development (Xenopus (Silurana) tropicalis, SAMEA1025684)	10,594,686	83%	17%	112,438
SAMEA1025685	early neurula, anterior half, embryo, wildtype (Xenopus (Silurana) tropicalis, SAMEA1025685)	3,486	84%	17%	579
SAMEA1025686	early neurula, anterior half, embryo, wildtype (Xenopus (Silurana) tropicalis, SAMEA1025686)	11,174,204	83%	17%	109,881
SAMEA1025687	early neurula, anterior half, embryo, wildtype (Xenopus (Silurana) tropicalis, SAMEA1025687)	11,329,040	72%	15%	110,096
SAMEA1025688	early neurula, anterior half, embryo, abnormal eye development (Xenopus (Silurana) tropicalis, SAMEA1025688)	13,327,780	84%	17%	116,972
SAMEA1696413	anterior neural plate (Xenopus tropicalis, SAMEA1696413)	30,752,654	78%	10%	123,925
SAMEA1696414	anterior neural plate (Xenopus tropicalis, SAMEA1696414)	35,076,528	78%	9%	119,323
SAMEA1696415	anterior neural plate (Xenopus tropicalis, SAMEA1696415)	32,742,590	79%	10%	129,861
SAMEA1696416	anterior neural plate (Xenopus tropicalis, SAMEA1696416)	31,520,510	80%	10%	129,104
SAMEA1696417	anterior neural plate (Xenopus tropicalis, SAMEA1696417)	29,609,058	80%	10%	125,122
SAMEA1696418	anterior neural plate (Xenopus tropicalis, SAMEA1696418)	29,140,836	80%	10%	125,548
SAMEA2431823	liver (Xenopus tropicalis, 5, female, SAMEA2431823)	37,226,659	84%	23%	125,570
SAMEA2431824	liver (Xenopus tropicalis, 5, female, SAMEA2431824)	36,011,911	55%	15%	109,353
SAMEA2431825	liver (Xenopus tropicalis, 5, female, SAMEA2431825)	34,646,332	77%	22%	118,100
SAMEA2431826	liver (Xenopus tropicalis, 5, female, SAMEA2431826)	35,243,576	68%	19%	123,926
SAMEA2431827	liver (Xenopus tropicalis, 5, female, SAMEA2431827)	35,637,492	60%	16%	114,242
SAMEA2431828	liver (Xenopus tropicalis, 5, female, SAMEA2431828)	36,699,624	83%	23%	130,104
SAMEA2431829	liver (Xenopus tropicalis, 5, female, SAMEA2431829)	36,096,725	81%	22%	119,054
SAMEA2431830	liver (Xenopus tropicalis, 5, female, SAMEA2431830)	34,221,092	78%	21%	120,932
SAMN00861983	embryo (Xenopus tropicalis, SAMN00861983)	23,269,358	79%	26%	125,937
SAMN00861984	embryo (Xenopus tropicalis, SAMN00861984)	21,684,818	72%	25%	122,784
SAMN00861985	embryo (Xenopus tropicalis, SAMN00861985)	22,837,890	76%	26%	124,988
SAMN00861986	embryo (Xenopus tropicalis, SAMN00861986)	25,382,696	77%	27%	127,118
SAMN00861987	embryo (Xenopus tropicalis, SAMN00861987)	25,088,904	81%	30%	139,941
SAMN00861988	embryo (Xenopus tropicalis, SAMN00861988)	23,358,120	75%	27%	135,318
SAMN00861989	embryo (Xenopus tropicalis, SAMN00861989)	23,463,846	80%	29%	138,610
SAMN00861990	embryo (Xenopus tropicalis, SAMN00861990)	26,254,908	81%	30%	145,051
SAMN00861991	embryo (Xenopus tropicalis, SAMN00861991)	21,911,412	79%	29%	145,021
SAMN00861992	embryo (Xenopus tropicalis, SAMN00861992)	25,186,562	79%	29%	145,727
SAMN00861993	embryo (Xenopus tropicalis, SAMN00861993)	27,013,936	83%	32%	153,330
SAMN00861994	embryo (Xenopus tropicalis, SAMN00861994)	24,947,110	83%	32%	151,957
SAMN00861995	embryo (Xenopus tropicalis, SAMN00861995)	24,317,950	82%	32%	150,970
SAMN00861996	embryo (Xenopus tropicalis, SAMN00861996)	27,569,412	83%	33%	160,089
SAMN00861997	embryo (Xenopus tropicalis, SAMN00861997)	24,359,684	82%	32%	160,404
SAMN00861998	embryo (Xenopus tropicalis, SAMN00861998)	28,395,156	82%	32%	163,845
SAMN00861999	embryo (Xenopus tropicalis, SAMN00861999)	21,145,180	85%	29%	116,668
SAMN00862000	embryo (Xenopus tropicalis, SAMN00862000)	23,897,950	86%	29%	119,599
SAMN00862001	embryo (Xenopus tropicalis, SAMN00862001)	20,814,412	85%	29%	117,426
SAMN00862002	embryo (Xenopus tropicalis, SAMN00862002)	23,446,722	86%	29%	120,086
SAMN00862003	embryo (Xenopus tropicalis, SAMN00862003)	22,124,098	85%	29%	120,644
SAMN00862004	embryo (Xenopus tropicalis, SAMN00862004)	23,019,596	86%	29%	123,751
SAMN00862005	embryo (Xenopus tropicalis, SAMN00862005)	22,496,290	85%	28%	128,000
SAMN00862006	embryo (Xenopus tropicalis, SAMN00862006)	20,881,868	83%	27%	125,476
SAMN00862007	embryo (Xenopus tropicalis, SAMN00862007)	21,303,858	83%	28%	124,202
SAMN00862008	embryo (Xenopus tropicalis, SAMN00862008)	19,646,960	83%	28%	121,335
SAMN00862009	embryo (Xenopus tropicalis, SAMN00862009)	19,152,074	84%	29%	123,082
SAMN00862010	embryo (Xenopus tropicalis, SAMN00862010)	25,905,300	84%	30%	139,521
SAMN00862011	embryo (Xenopus tropicalis, SAMN00862011)	17,811,048	86%	30%	135,413
SAMN00862012	embryo (Xenopus tropicalis, SAMN00862012)	22,842,962	86%	31%	147,856
SAMN00862013	embryo (Xenopus tropicalis, SAMN00862013)	20,270,336	86%	31%	141,681
SAMN00862014	embryo (Xenopus tropicalis, SAMN00862014)	20,341,278	86%	30%	143,255
SAMN00862015	embryo (Xenopus tropicalis, SAMN00862015)	22,276,024	87%	31%	147,411
SAMN00862016	embryo (Xenopus tropicalis, SAMN00862016)	22,260,150	86%	32%	147,519
SAMN00862017	embryo (Xenopus tropicalis, SAMN00862017)	23,017,882	87%	32%	151,974
SAMN00862018	embryo (Xenopus tropicalis, SAMN00862018)	24,547,768	87%	32%	154,012
SAMN00862019	embryo (Xenopus tropicalis, SAMN00862019)	22,029,774	87%	32%	153,939
SAMN00862020	embryo (Xenopus tropicalis, SAMN00862020)	23,104,980	87%	33%	150,129
SAMN00862021	embryo (Xenopus tropicalis, SAMN00862021)	21,466,156	87%	33%	151,212
SAMN00862022	embryo (Xenopus tropicalis, SAMN00862022)	20,848,388	87%	33%	155,635
SAMN01758126	brain (Xenopus tropicalis, adult, mixed, SAMN01758126)	51,896,478	62%	7%	145,861
SAMN01758127	liver (Xenopus tropicalis, adult, mixed, SAMN01758127)	157,651,690	66%	16%	111,103
SAMN01758128	kidney (Xenopus tropicalis, adult, mixed, SAMN01758128)	191,851,606	62%	7%	131,768
SAMN01758129	heart (Xenopus tropicalis, adult, mixed, SAMN01758129)	68,509,352	64%	9%	141,351
SAMN01758130	skeletal muscle (Xenopus tropicalis, adult, mixed, SAMN01758130)	68,034,522	70%	12%	119,880
SAMN02230354	XT32_BB3KD_polyA-RNA_1st (Xenopus tropicalis, SAMN02230354)	233,458,046	81%	10%	189,723
SAMN02230355	XT32_BB3KD_polyA-RNA_3rd (Xenopus tropicalis, SAMN02230355)	246,130,048	81%	9%	193,454
SAMN02230356	XT32_BB3KD_polyA-RNA_2nd (Xenopus tropicalis, SAMN02230356)	170,922,670	80%	9%	186,936
SAMN02230357	XT32_ctrl_polyA-RNA_2nd (Xenopus tropicalis, SAMN02230357)	242,708,866	81%	10%	195,912
SAMN02230358	XT32_ctrl_polyA-RNA_1st (Xenopus tropicalis, SAMN02230358)	260,474,516	82%	10%	192,422
SAMN02230359	XT20_polyA-RNA (Xenopus tropicalis, SAMN02230359)	251,815,420	83%	10%	188,361
SAMN02230360	XT32_ctrl_polyA-RNA_3rd (Xenopus tropicalis, SAMN02230360)	173,835,658	81%	10%	184,625
SAMN02486393	st10.5 solvent-only control (Xenopus tropicalis, SAMN02486393)	98,233,294	81%	9%	143,708
SAMN02486396	st10.5 SB431542 treated (Xenopus tropicalis, SAMN02486396)	101,682,262	77%	8%	143,447
SAMN02486398	st10.5 uninjected control (Xenopus tropicalis, SAMN02486398)	21,855,459	81%	4%	100,744
SAMN02486399	st10.5 FoxH1 MO (Xenopus tropicalis, SAMN02486399)	20,594,222	81%	4%	100,088
SAMN03863231	whole embryo (Xenopus tropicalis, SAMN03863231)	62,815,056	70%	17%	129,466
SAMN03863232	whole embryo (Xenopus tropicalis, SAMN03863232)	62,037,866	84%	19%	166,891
SAMN03863233	whole embryo (Xenopus tropicalis, SAMN03863233)	62,271,108	84%	19%	167,426
SAMN03863234	whole embryo (Xenopus tropicalis, SAMN03863234)	66,904,198	78%	20%	147,506
SAMN03863235	whole embryo (Xenopus tropicalis, SAMN03863235)	60,859,870	84%	18%	166,120
SAMN03863236	whole embryo (Xenopus tropicalis, SAMN03863236)	74,373,192	83%	21%	170,556
SAMN03863237	whole embryo (Xenopus tropicalis, SAMN03863237)	51,558,792	71%	19%	130,571
SAMN03863238	whole embryo (Xenopus tropicalis, SAMN03863238)	59,833,356	82%	16%	159,754
SAMN03863239	whole embryo (Xenopus tropicalis, SAMN03863239)	53,470,328	82%	16%	154,660
SAMN03863240	whole embryo (Xenopus tropicalis, SAMN03863240)	47,068,886	68%	17%	117,515
SAMN03863241	whole embryo (Xenopus tropicalis, SAMN03863241)	62,055,324	81%	20%	167,141
SAMN03863242	whole embryo (Xenopus tropicalis, SAMN03863242)	64,463,206	82%	19%	168,189
SAMN03863243	whole embryo (Xenopus tropicalis, SAMN03863243)	59,375,386	66%	16%	125,320
SAMN03863244	whole embryo (Xenopus tropicalis, SAMN03863244)	64,890,466	83%	20%	170,468
SAMN03863245	whole embryo (Xenopus tropicalis, SAMN03863245)	62,866,418	83%	21%	168,119
SAMN03863246	whole embryo (Xenopus tropicalis, SAMN03863246)	81,926,778	80%	21%	148,965
SAMN03863247	whole embryo (Xenopus tropicalis, SAMN03863247)	70,224,064	85%	19%	169,810
SAMN03863248	whole embryo (Xenopus tropicalis, SAMN03863248)	67,628,566	85%	15%	162,282
SAMN04027313	Uninjected_1 (Xenopus tropicalis, SAMN04027313)	111,799,382	85%	20%	150,969
SAMN04027314	CoMO_1 (Xenopus tropicalis, SAMN04027314)	117,145,462	84%	20%	151,392
SAMN04027315	wnt8a_MO_1 (Xenopus tropicalis, SAMN04027315)	111,424,610	84%	20%	149,068
SAMN04027316	wnt8a_MO-CSKAwnt8a_1 (Xenopus tropicalis, SAMN04027316)	106,111,782	84%	20%	148,225
SAMN04027317	Uninjected_2 (Xenopus tropicalis, SAMN04027317)	108,693,096	84%	21%	151,557
SAMN04027318	CoMO_2 (Xenopus tropicalis, SAMN04027318)	108,653,902	85%	21%	152,337
SAMN04027319	wnt8a_MO_2 (Xenopus tropicalis, SAMN04027319)	106,305,728	85%	21%	152,541
SAMN04027320	wnt8a_MO-CSKAwnt8a_2 (Xenopus tropicalis, SAMN04027320)	97,101,146	85%	21%	151,958
SAMN04027321	Uninjected_3 (Xenopus tropicalis, SAMN04027321)	116,986,110	86%	22%	157,590
SAMN04027322	CoMO_3 (Xenopus tropicalis, SAMN04027322)	109,372,348	86%	24%	159,271
SAMN04027323	wnt8a_MO_3 (Xenopus tropicalis, SAMN04027323)	105,387,316	85%	23%	157,574
SAMN04027324	wnt8a_MO-CSKAwnt8a_3 (Xenopus tropicalis, SAMN04027324)	92,085,064	85%	21%	151,169
SAMN04272100	oocyte (Xenopus tropicalis, adult, female, SAMN04272100)	157,615,213	65%	19%	145,136
SAMN04578582	whole embryo (Xenopus tropicalis, 2h, pooled male and female, SAMN04578582)	14,909,217	55%	11%	106,111
SAMN04578583	whole embryo (Xenopus tropicalis, 5h, pooled male and female, SAMN04578583)	16,558,889	54%	10%	108,974
SAMN04578584	whole embryo (Xenopus tropicalis, 7h, pooled male and female, SAMN04578584)	30,276,571	47%	5%	109,668
SAMN04578585	whole embryo (Xenopus tropicalis, 9h, pooled male and female, SAMN04578585)	9,419,310	28%	3%	79,802
SAMN04578586	whole embryo (Xenopus tropicalis, 2h, pooled male and female, SAMN04578586)	15,990,337	59%	12%	111,215
SAMN04578587	whole embryo (Xenopus tropicalis, 9h, pooled male and female, SAMN04578587)	20,197,479	43%	3%	88,349

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent spliced reads
ERR045176	ERX022238	ERP002095	SAMEA1025683	10,917,912	83%	17%
ERR045178	ERX022240	ERP002095	SAMEA1025684	10,594,686	83%	17%
ERR045173	ERX022235	ERP002095	SAMEA1025685	3,486	84%	17%
ERR045175	ERX022237	ERP002095	SAMEA1025686	11,174,204	83%	17%
ERR045177	ERX022239	ERP002095	SAMEA1025687	11,329,040	72%	15%
ERR045174	ERX022236	ERP002095	SAMEA1025688	13,327,780	84%	17%
ERR218149	ERX192814	ERP002095	SAMEA1696413	30,752,654	78%	10%
ERR218150	ERX192815	ERP002095	SAMEA1696414	35,076,528	78%	9%
ERR218148	ERX192813	ERP002095	SAMEA1696415	32,742,590	79%	10%
ERR218153	ERX192818	ERP002095	SAMEA1696416	31,520,510	80%	10%
ERR218151	ERX192816	ERP002095	SAMEA1696417	29,609,058	80%	10%
ERR218152	ERX192817	ERP002095	SAMEA1696418	29,140,836	80%	10%
ERR469305	ERX435468	ERP005450	SAMEA2431823	37,226,659	84%	23%
ERR469306	ERX435469	ERP005450	SAMEA2431824	36,011,911	55%	15%
ERR469302	ERX435470	ERP005450	SAMEA2431825	34,646,332	77%	22%
ERR469309	ERX435471	ERP005450	SAMEA2431826	35,243,576	68%	19%
ERR469303	ERX435472	ERP005450	SAMEA2431827	35,637,492	60%	16%
ERR469308	ERX435473	ERP005450	SAMEA2431828	36,699,624	83%	23%
ERR469307	ERX435474	ERP005450	SAMEA2431829	36,096,725	81%	22%
ERR469304	ERX435475	ERP005450	SAMEA2431830	34,221,092	78%	21%
SRR489439	SRX143516	SRP012375	SAMN00861983	23,269,358	79%	26%
SRR489440	SRX143517	SRP012375	SAMN00861984	21,684,818	72%	25%
SRR489441	SRX143518	SRP012375	SAMN00861985	22,837,890	76%	26%
SRR489442	SRX143519	SRP012375	SAMN00861986	25,382,696	77%	27%
SRR489443	SRX143520	SRP012375	SAMN00861987	25,088,904	81%	30%
SRR489444	SRX143521	SRP012375	SAMN00861988	23,358,120	75%	27%
SRR489445	SRX143522	SRP012375	SAMN00861989	23,463,846	80%	29%
SRR489446	SRX143523	SRP012375	SAMN00861990	26,254,908	81%	30%
SRR489447	SRX143524	SRP012375	SAMN00861991	21,911,412	79%	29%
SRR489448	SRX143525	SRP012375	SAMN00861992	25,186,562	79%	29%
SRR489449	SRX143526	SRP012375	SAMN00861993	27,013,936	83%	32%
SRR489450	SRX143527	SRP012375	SAMN00861994	24,947,110	83%	32%
SRR489451	SRX143528	SRP012375	SAMN00861995	24,317,950	82%	32%
SRR489452	SRX143529	SRP012375	SAMN00861996	27,569,412	83%	33%
SRR489453	SRX143530	SRP012375	SAMN00861997	24,359,684	82%	32%
SRR489454	SRX143531	SRP012375	SAMN00861998	28,395,156	82%	32%
SRR489455	SRX143532	SRP012375	SAMN00861999	21,145,180	85%	29%
SRR489456	SRX143533	SRP012375	SAMN00862000	23,897,950	86%	29%
SRR489457	SRX143534	SRP012375	SAMN00862001	20,814,412	85%	29%
SRR489458	SRX143535	SRP012375	SAMN00862002	23,446,722	86%	29%
SRR489459	SRX143536	SRP012375	SAMN00862003	22,124,098	85%	29%
SRR489460	SRX143537	SRP012375	SAMN00862004	23,019,596	86%	29%
SRR489461	SRX143538	SRP012375	SAMN00862005	22,496,290	85%	28%
SRR489462	SRX143539	SRP012375	SAMN00862006	20,881,868	83%	27%
SRR489463	SRX143540	SRP012375	SAMN00862007	21,303,858	83%	28%
SRR489464	SRX143541	SRP012375	SAMN00862008	19,646,960	83%	28%
SRR489465	SRX143542	SRP012375	SAMN00862009	19,152,074	84%	29%
SRR489466	SRX143543	SRP012375	SAMN00862010	25,905,300	84%	30%
SRR489467	SRX143544	SRP012375	SAMN00862011	17,811,048	86%	30%
SRR489468	SRX143545	SRP012375	SAMN00862012	22,842,962	86%	31%
SRR489469	SRX143546	SRP012375	SAMN00862013	20,270,336	86%	31%
SRR489470	SRX143547	SRP012375	SAMN00862014	20,341,278	86%	30%
SRR489471	SRX143548	SRP012375	SAMN00862015	22,276,024	87%	31%
SRR489472	SRX143549	SRP012375	SAMN00862016	22,260,150	86%	32%
SRR489473	SRX143550	SRP012375	SAMN00862017	23,017,882	87%	32%
SRR489474	SRX143551	SRP012375	SAMN00862018	24,547,768	87%	32%
SRR489475	SRX143552	SRP012375	SAMN00862019	22,029,774	87%	32%
SRR489476	SRX143553	SRP012375	SAMN00862020	23,104,980	87%	33%
SRR489477	SRX143554	SRP012375	SAMN00862021	21,466,156	87%	33%
SRR489478	SRX143555	SRP012375	SAMN00862022	20,848,388	87%	33%
SRR579560	SRX191164	SRP015997	SAMN01758126	51,896,478	62%	7%
SRR579561	SRX191165	SRP015997	SAMN01758127	157,651,690	66%	16%
SRR579562	SRX191166	SRP015997	SAMN01758128	191,851,606	62%	7%
SRR579563	SRX191167	SRP015997	SAMN01758129	68,509,352	64%	9%
SRR579564	SRX191168	SRP015997	SAMN01758130	68,034,522	70%	12%
SRR929119	SRX319533	SRP026685	SAMN02230354	233,458,046	81%	10%
SRR929121	SRX319535	SRP026685	SAMN02230355	246,130,048	81%	9%
SRR929120	SRX319534	SRP026685	SAMN02230356	170,922,670	80%	9%
SRR929123	SRX319537	SRP026685	SAMN02230357	242,708,866	81%	10%
SRR929122	SRX319536	SRP026685	SAMN02230358	260,474,516	82%	10%
SRR929125	SRX319539	SRP026685	SAMN02230359	251,815,420	83%	10%
SRR929124	SRX319538	SRP026685	SAMN02230360	173,835,658	81%	10%
SRR1060749	SRX399452	SRP034731	SAMN02486393	98,233,294	81%	9%
SRR1060750	SRX399453	SRP034731	SAMN02486396	101,682,262	77%	8%
SRR1060751	SRX399454	SRP034731	SAMN02486398	21,855,459	81%	4%
SRR1060752	SRX399455	SRP034731	SAMN02486399	20,594,222	81%	4%
SRR2105075	SRX1099252	SRP061238	SAMN03863231	62,815,056	70%	17%
SRR2105076	SRX1099254	SRP061238	SAMN03863232	62,037,866	84%	19%
SRR2105077	SRX1099255	SRP061238	SAMN03863233	62,271,108	84%	19%
SRR2105078	SRX1099256	SRP061238	SAMN03863234	66,904,198	78%	20%
SRR2105079	SRX1099257	SRP061238	SAMN03863235	60,859,870	84%	18%
SRR2105080	SRX1099258	SRP061238	SAMN03863236	74,373,192	83%	21%
SRR2105081	SRX1099259	SRP061238	SAMN03863237	51,558,792	71%	19%
SRR2105082	SRX1099260	SRP061238	SAMN03863238	59,833,356	82%	16%
SRR2105083	SRX1099261	SRP061238	SAMN03863239	53,470,328	82%	16%
SRR2105084	SRX1099262	SRP061238	SAMN03863240	47,068,886	68%	17%
SRR2105085	SRX1099263	SRP061238	SAMN03863241	62,055,324	81%	20%
SRR2105086	SRX1099264	SRP061238	SAMN03863242	64,463,206	82%	19%
SRR2105087	SRX1099265	SRP061238	SAMN03863243	59,375,386	66%	16%
SRR2105088	SRX1099266	SRP061238	SAMN03863244	64,890,466	83%	20%
SRR2105089	SRX1099267	SRP061238	SAMN03863245	62,866,418	83%	21%
SRR2105090	SRX1099268	SRP061238	SAMN03863246	81,926,778	80%	21%
SRR2105091	SRX1099269	SRP061238	SAMN03863247	70,224,064	85%	19%
SRR2105092	SRX1099270	SRP061238	SAMN03863248	67,628,566	85%	15%
SRR2230069	SRX1178592	SRP063109	SAMN04027313	111,799,382	85%	20%
SRR2230070	SRX1178593	SRP063109	SAMN04027314	117,145,462	84%	20%
SRR2230071	SRX1178594	SRP063109	SAMN04027315	111,424,610	84%	20%
SRR2230072	SRX1178595	SRP063109	SAMN04027316	106,111,782	84%	20%
SRR2230073	SRX1178596	SRP063109	SAMN04027317	108,693,096	84%	21%
SRR2230074	SRX1178597	SRP063109	SAMN04027318	108,653,902	85%	21%
SRR2230075	SRX1178598	SRP063109	SAMN04027319	106,305,728	85%	21%
SRR2230076	SRX1178599	SRP063109	SAMN04027320	97,101,146	85%	21%
SRR2230077	SRX1178600	SRP063109	SAMN04027321	116,986,110	86%	22%
SRR2230078	SRX1178601	SRP063109	SAMN04027322	109,372,348	86%	24%
SRR2230079	SRX1178602	SRP063109	SAMN04027323	105,387,316	85%	23%
SRR2230080	SRX1178603	SRP063109	SAMN04027324	92,085,064	85%	21%
SRR2919165	SRX1434837	SRP066274	SAMN04272100	157,615,213	65%	19%
SRR3420413	SRX1660358	SRP072296	SAMN04578582	14,909,217	55%	11%
SRR3420425	SRX1660359	SRP072296	SAMN04578583	16,558,889	54%	10%
SRR3420424	SRX1660360	SRP072296	SAMN04578584	30,276,571	47%	5%
SRR3420423	SRX1660361	SRP072296	SAMN04578585	9,419,310	28%	3%
SRR3420422	SRX1660362	SRP072296	SAMN04578586	15,990,337	59%	12%
SRR3420427	SRX1660363	SRP072296	SAMN04578587	20,197,479	43%	3%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Danio rerio known RefSeq (NP_)	15,598	13,879 (88.98%)	13,879 (88.98%)	65.02%	69.51%
Xenopus GenBank	252	229 (90.87%)	229 (90.87%)	76.38%	81.42%
Xenopus laevis GenBank	18,037	17,634 (97.77%)	17,634 (97.77%)	75.71%	83.51%
Xenopus laevis known RefSeq (NP_)	10,995	10,803 (98.25%)	10,803 (98.25%)	76.37%	84.17%
Same-species GenBank	13,569	13,285 (97.91%)	13,285 (97.91%)	80.34%	85.39%
Same-species known RefSeq (NP_)	8,611	8,460 (98.25%)	8,460 (98.25%)	79.89%	85.28%
Gallus gallus known RefSeq (NP_)	6,390	5,938 (92.93%)	5,938 (92.93%)	69.46%	76.44%
Homo sapiens known RefSeq (NP_)	44,116	38,169 (86.52%)	38,169 (86.52%)	65.48%	68.98%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
Xenopus_tropicalis_v9.1 (Current) Coverage: 99.32%	Xenopus_tropicalis_v9.1 (Current) Coverage: 99.53%
Xtropicalis_v7 (Previous) Coverage: 99.61%	Xtropicalis_v7 (Previous) Coverage: 99.65%
Percent Identity: 99.99%	Percent Identity: 99.99%

Comparison of the current and previous annotations

The annotation produced for this release (103) was compared to the annotation in the previous release (102) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	Xenopus_tropicalis_v9.1 (Current) to Xtropicalis_v7 (Previous)
Identical	24%
Minor changes	49%
Major changes	11%
New	6%
Deprecated	10%
Other	10%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences