NCBI Rousettus aegyptiacus Annotation Release 101

The RefSeq genome records for Rousettus aegyptiacus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Rousettus aegyptiacus Annotation Release 101

Annotation release ID: 101
Date of Entrez queries for transcripts and proteins: Sep 25 2020
Date of submission of annotation to the public databases: Sep 28 2020
Software version: 8.5

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
mRouAeg1.p	GCF_014176215.1	Bat1K	08-07-2020	Reference	unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	mRouAeg1.p
Genes and pseudogenes	36,769
protein-coding	19,070
non-coding	5,160
transcribed pseudogenes	13
non-transcribed pseudogenes	12,454
genes with variants	11,528
immunoglobulin/T-cell receptor gene segments	72
other	0
mRNAs	56,391
fully-supported	55,150
with > 5% ab initio	631
partial	67
with filled gap(s)	5
known RefSeq (NM_)	2
model RefSeq (XM_)	56,389
non-coding RNAs	9,085
fully-supported	7,278
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	8,759
pseudo transcripts	13
fully-supported	10
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	13
CDSs	56,463
fully-supported	55,150
with > 5% ab initio	726
partial	69
with major correction(s)	831
known RefSeq (NP_)	2
model RefSeq (XP_)	56,389

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	24,230	40,948	13,000	50	2,693,686
All transcripts	65,476	4,121	3,265	50	104,134
mRNA	56,391	4,265	3,443	114	104,134
misc_RNA	2,062	4,292	3,294	119	35,540
tRNA	326	74	73	71	87
lncRNA	5,217	3,886	2,217	94	78,055
snoRNA	456	110	109	50	329
snRNA	455	114	107	62	197
guide_RNA	23	176	140	83	421
rRNA	546	125	119	117	1,869
Single-exon transcripts	1,387	1,619	981	114	19,406
coding transcripts (NM_/XM_ )	1,386	1,621	981	114	19,406
non-coding transcripts (NR_/XR_ )	1	119	119	119	119
CDSs	56,391	2,216	1,593	96	102,867
Exons	255,784	416	142	1	69,411
in coding transcripts (NM_/XM_ )	239,319	380	140	1	31,962
in non-coding transcripts (NR_/XR_ )	28,569	627	157	2	69,411
Introns	225,803	5,756	1,337	30	988,188
in coding transcripts (NM_/XM_ )	214,374	5,549	1,319	30	988,188
in non-coding transcripts (NR_/XR_ )	23,232	7,085	1,465	30	565,301

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.73	1	1	50
Number of exons per transcript	12.83	10	1	314

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 19070 coding genes, 18603 genes had a protein with an alignment covering 50% or more of the query and 16262 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
mRouAeg1.p	GCF_014176215.1	32.58%	25.93%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	2	2 (100.00%)	2 (100.00%)	99.84%	100.00%
Same-species Genbank	16	16 (100.00%)	15 (93.75%)	99.30%	99.89%
Chiroptera known RefSeq (NM_/NR_)	554	268 (48.38%)	235 (42.42%)	96.60%	98.90%
Chiroptera Genbank	2,267	2,082 (91.84%)	1,314 (57.96%)	91.99%	99.34%
Chiroptera TSA	285,130	209,986 (73.65%)	33,812 (11.86%)	96.62%	98.73%
Chiroptera EST	144	32 (22.22%)	13 (9.03%)	91.43%	97.23%
Homo sapiens known RefSeq (NM_/NR_)	77,811	64,271 (82.60%)	15,788 (20.29%)	89.26%	82.71%
Homo sapiens Genbank	325,155	149,388 (45.94%)	52,807 (16.24%)	89.90%	90.30%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	3,248,437,514	78%	20%	329,962
SAMN04262201	NA	Bone marrow (Rousettus aegyptiacus, female, SAMN04262201)	67,906,758	78%	14%	163,521
SAMN04262202	NA	Brain (Rousettus aegyptiacus, female, SAMN04262202)	55,025,549	74%	10%	176,985
SAMN04262203	NA	Heart (Rousettus aegyptiacus, female, SAMN04262203)	77,322,113	76%	16%	174,625
SAMN04262204	NA	Kidney (Rousettus aegyptiacus, female, SAMN04262204)	59,788,692	73%	14%	171,318
SAMN04262205	NA	Lung (Rousettus aegyptiacus, female, SAMN04262205)	155,024,704	72%	12%	199,163
SAMN04262206	NA	Axillary lymph nodes (Rousettus aegyptiacus, female, SAMN04262206)	63,191,347	74%	10%	163,079
SAMN04262207	NA	Liver (Rousettus aegyptiacus, female, SAMN04262207)	179,944,702	74%	23%	174,154
SAMN04262208	NA	Ovary (Rousettus aegyptiacus, female, SAMN04262208)	75,060,971	73%	13%	195,979
SAMN04262209	NA	Peripheral blood (PBMC) (Rousettus aegyptiacus, female, SAMN04262209)	56,561,148	77%	21%	151,255
SAMN04262210	NA	Spleen (Rousettus aegyptiacus, female, SAMN04262210)	56,157,476	74%	10%	165,097
SAMN04262211	NA	Bone marrow (Rousettus aegyptiacus, male, SAMN04262211)	95,979,066	73%	13%	176,180
SAMN04262212	NA	Brain (Rousettus aegyptiacus, male, SAMN04262212)	150,761,634	68%	11%	199,238
SAMN04262213	NA	Heart (Rousettus aegyptiacus, male, SAMN04262213)	40,086,000	74%	17%	144,684
SAMN04262214	NA	Kidney (Rousettus aegyptiacus, male, SAMN04262214)	142,959,930	68%	14%	191,939
SAMN04262215	NA	Lung (Rousettus aegyptiacus, male, SAMN04262215)	31,073,286	68%	11%	151,354
SAMN04262216	NA	Axillary lymph nodes (Rousettus aegyptiacus, male, SAMN04262216)	176,946,568	68%	11%	189,116
SAMN04262217	NA	Liver (Rousettus aegyptiacus, male, SAMN04262217)	54,785,630	71%	25%	139,991
SAMN04262218	NA	Peripheral blood (PBMC) (Rousettus aegyptiacus, male, SAMN04262218)	185,490,836	73%	25%	168,951
SAMN04262219	NA	Spleen (Rousettus aegyptiacus, male, SAMN04262219)	196,945,508	68%	11%	192,306
SAMN04262220	NA	Testes (Rousettus aegyptiacus, male, SAMN04262220)	192,961,826	69%	9%	214,898
SAMN08330235	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330235)	50,335,776	89%	28%	155,182
SAMN08330236	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330236)	43,896,384	89%	29%	154,880
SAMN08330237	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330237)	48,430,676	89%	19%	152,090
SAMN08330238	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330238)	41,375,938	90%	31%	153,649
SAMN08330239	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330239)	38,228,280	89%	27%	151,342
SAMN08330240	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330240)	39,266,508	72%	29%	148,814
SAMN08330241	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330241)	37,462,800	91%	29%	150,126
SAMN08330242	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330242)	48,483,858	90%	28%	157,470
SAMN08330243	NA	Model organism or animal sample from Rousettus aegyptiacus (Rousettus aegyptiacus, SAMN08330243)	45,541,344	88%	28%	157,920
SAMN09878720	NA	Kidney (Rousettus aegyptiacus, Adult, SAMN09878720)	218,452,606	89%	33%	198,807
SAMN14167039	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167039)	7,634,614	94%	26%	115,130
SAMN14167040	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167040)	7,116,895	95%	26%	111,871
SAMN14167041	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167041)	10,740,278	94%	27%	124,042
SAMN14167042	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167042)	7,803,021	94%	27%	113,869
SAMN14167043	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167043)	11,199,644	94%	26%	120,305
SAMN14167044	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167044)	7,135,544	94%	26%	116,207
SAMN14167045	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167045)	7,097,450	95%	28%	115,726
SAMN14167046	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167046)	5,849,864	86%	26%	105,298
SAMN14167047	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167047)	4,403,615	94%	26%	102,612
SAMN14167048	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167048)	7,523,634	94%	26%	116,948
SAMN14167049	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167049)	5,459,888	93%	26%	106,187
SAMN14167050	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167050)	6,630,556	91%	27%	113,474
SAMN14167051	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167051)	6,516,022	94%	34%	84,178
SAMN14167052	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167052)	7,380,586	94%	26%	113,066
SAMN14167053	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167053)	7,990,522	94%	25%	118,297
SAMN14167054	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167054)	11,355,745	95%	26%	124,462
SAMN14167055	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167055)	4,041,837	91%	25%	95,436
SAMN14167056	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167056)	7,530,714	94%	26%	117,161
SAMN14167057	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167057)	7,140,204	94%	27%	114,907
SAMN14167058	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167058)	3,398,683	94%	25%	92,460
SAMN14167059	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167059)	13,064,601	94%	26%	129,348
SAMN14167060	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167060)	17,296,763	95%	26%	132,564
SAMN14167061	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167061)	4,329,348	95%	26%	99,890
SAMN14167062	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167062)	4,704,038	94%	33%	79,084
SAMN14167063	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167063)	9,444,078	94%	26%	121,144
SAMN14167064	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167064)	4,621,074	93%	26%	104,686
SAMN14167065	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167065)	7,061,997	95%	26%	112,516
SAMN14167066	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167066)	9,379,717	90%	26%	119,116
SAMN14167067	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167067)	5,063,297	95%	26%	105,445
SAMN14167068	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167068)	6,241,345	94%	26%	112,484
SAMN14167069	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167069)	11,530,911	95%	26%	123,990
SAMN14167070	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167070)	5,924,925	95%	33%	85,406
SAMN14167071	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167071)	8,719,378	93%	25%	120,079
SAMN14167072	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167072)	11,777,527	94%	26%	124,023
SAMN14167073	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167073)	6,288,821	93%	26%	111,685
SAMN14167074	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167074)	10,343,899	94%	26%	123,453
SAMN14167075	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167075)	9,833,847	92%	34%	95,814
SAMN14167076	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167076)	8,127,664	93%	25%	113,144
SAMN14167077	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167077)	10,660,846	94%	26%	125,306
SAMN14167078	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167078)	11,280,747	94%	26%	125,341
SAMN14167079	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167079)	5,238,476	93%	26%	104,300
SAMN14167080	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167080)	7,296,409	94%	25%	116,632
SAMN14167081	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167081)	14,498,367	94%	25%	128,958
SAMN14167082	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167082)	3,576,034	93%	25%	96,218
SAMN14167083	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167083)	12,416,487	94%	26%	128,921
SAMN14167084	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167084)	7,350,760	94%	27%	115,130
SAMN14167085	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167085)	6,341,881	94%	25%	109,190
SAMN14167086	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167086)	10,579,709	94%	26%	123,092
SAMN14167087	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167087)	6,408,347	93%	34%	93,156
SAMN14167088	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167088)	6,810,425	95%	26%	110,218
SAMN14167089	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167089)	7,243,442	94%	26%	112,404
SAMN14167090	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167090)	11,580,083	95%	26%	125,400
SAMN14167091	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167091)	5,053,393	94%	25%	103,062
SAMN14167092	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167092)	6,358,649	93%	27%	112,279
SAMN14167093	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167093)	14,764,931	94%	26%	130,679
SAMN14167094	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167094)	6,549,405	91%	34%	83,203
SAMN14167095	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167095)	8,872,549	94%	27%	119,430
SAMN14167096	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167096)	8,224,049	94%	26%	117,255
SAMN14167097	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167097)	8,487,955	94%	34%	91,614
SAMN14167098	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167098)	7,650,759	94%	26%	115,299
SAMN14167099	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167099)	8,387,669	93%	27%	115,544
SAMN14167100	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167100)	5,168,960	95%	27%	103,936
SAMN14167101	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167101)	5,957,208	94%	26%	111,118
SAMN14167102	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167102)	7,456,814	95%	34%	101,235
SAMN14167103	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167103)	4,617,957	93%	26%	103,307
SAMN14167104	32231668	kidney, kidney fibroblasts, (Rousettus aegyptiacus, SAMN14167104)	6,454,743	94%	26%	113,473

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR2913352	SRX1428010	SRP066106	SAMN04262201	67,906,758	78%	14%
SRR2913353	SRX1428013	SRP066106	SAMN04262202	55,025,549	74%	10%
SRR2913354	SRX1428014	SRP066106	SAMN04262203	77,322,113	76%	16%
SRR2913355	SRX1428015	SRP066106	SAMN04262204	59,788,692	73%	14%
SRR2913598	SRX1428016	SRP066106	SAMN04262205	155,024,704	72%	12%
SRR2914051	SRX1428728	SRP066106	SAMN04262206	63,191,347	74%	10%
SRR2914059	SRX1428735	SRP066106	SAMN04262207	179,944,702	74%	23%
SRR2914063	SRX1428739	SRP066106	SAMN04262208	75,060,971	73%	13%
SRR2914068	SRX1428745	SRP066106	SAMN04262209	56,561,148	77%	21%
SRR2914113	SRX1428791	SRP066106	SAMN04262210	56,157,476	74%	10%
SRR2914282	SRX1428957	SRP066106	SAMN04262211	95,979,066	73%	13%
SRR2914295	SRX1428963	SRP066106	SAMN04262212	150,761,634	68%	11%
SRR2914359	SRX1428974	SRP066106	SAMN04262213	40,086,000	74%	17%
SRR2914360	SRX1429038	SRP066106	SAMN04262214	142,959,930	68%	14%
SRR2914366	SRX1429041	SRP066106	SAMN04262215	31,073,286	68%	11%
SRR2914368	SRX1429045	SRP066106	SAMN04262216	176,946,568	68%	11%
SRR2914369	SRX1429046	SRP066106	SAMN04262217	54,785,630	71%	25%
SRR2914370	SRX1429047	SRP066106	SAMN04262218	185,490,836	73%	25%
SRR2914371	SRX1429053	SRP066106	SAMN04262219	196,945,508	68%	11%
SRR2914372	SRX1429054	SRP066106	SAMN04262220	192,961,826	69%	9%
SRR6453208	SRX3544101	SRP128545	SAMN08330235	50,335,776	89%	28%
SRR6453211	SRX3544098	SRP128545	SAMN08330236	43,896,384	89%	29%
SRR6453212	SRX3544097	SRP128545	SAMN08330237	48,430,676	89%	19%
SRR6453209	SRX3544100	SRP128545	SAMN08330238	41,375,938	90%	31%
SRR6453210	SRX3544099	SRP128545	SAMN08330239	38,228,280	89%	27%
SRR6453215	SRX3544094	SRP128545	SAMN08330240	39,266,508	72%	29%
SRR6453216	SRX3544093	SRP128545	SAMN08330241	37,462,800	91%	29%
SRR6453213	SRX3544096	SRP128545	SAMN08330242	48,483,858	90%	28%
SRR6453214	SRX3544095	SRP128545	SAMN08330243	45,541,344	88%	28%
SRR7735102	SRX4591459	SRP158571	SAMN09878720	66,866,326	79%	42%
SRR7735101	SRX4591460	SRP158571	SAMN09878720	151,586,280	94%	29%
SRR11148704	SRX7784834	SRP250411	SAMN14167039	7,634,614	94%	26%
SRR11148703	SRX7784833	SRP250411	SAMN14167040	7,116,895	95%	26%
SRR11148702	SRX7784832	SRP250411	SAMN14167041	10,740,278	94%	27%
SRR11148701	SRX7784831	SRP250411	SAMN14167042	7,803,021	94%	27%
SRR11148700	SRX7784830	SRP250411	SAMN14167043	11,199,644	94%	26%
SRR11148699	SRX7784829	SRP250411	SAMN14167044	7,135,544	94%	26%
SRR11148698	SRX7784828	SRP250411	SAMN14167045	7,097,450	95%	28%
SRR11148697	SRX7784827	SRP250411	SAMN14167046	5,849,864	86%	26%
SRR11148696	SRX7784826	SRP250411	SAMN14167047	4,403,615	94%	26%
SRR11148695	SRX7784825	SRP250411	SAMN14167048	7,523,634	94%	26%
SRR11148694	SRX7784824	SRP250411	SAMN14167049	5,459,888	93%	26%
SRR11148693	SRX7784823	SRP250411	SAMN14167050	6,630,556	91%	27%
SRR11148692	SRX7784822	SRP250411	SAMN14167051	6,516,022	94%	34%
SRR11148691	SRX7784821	SRP250411	SAMN14167052	7,380,586	94%	26%
SRR11148690	SRX7784820	SRP250411	SAMN14167053	7,990,522	94%	25%
SRR11148689	SRX7784819	SRP250411	SAMN14167054	11,355,745	95%	26%
SRR11148688	SRX7784818	SRP250411	SAMN14167055	4,041,837	91%	25%
SRR11148723	SRX7784790	SRP250411	SAMN14167056	7,530,714	94%	26%
SRR11148722	SRX7784789	SRP250411	SAMN14167057	7,140,204	94%	27%
SRR11148721	SRX7784788	SRP250411	SAMN14167058	3,398,683	94%	25%
SRR11148720	SRX7784787	SRP250411	SAMN14167059	13,064,601	94%	26%
SRR11148719	SRX7784786	SRP250411	SAMN14167060	17,296,763	95%	26%
SRR11148718	SRX7784785	SRP250411	SAMN14167061	4,329,348	95%	26%
SRR11148687	SRX7784817	SRP250411	SAMN14167062	4,704,038	94%	33%
SRR11148686	SRX7784816	SRP250411	SAMN14167063	9,444,078	94%	26%
SRR11148685	SRX7784815	SRP250411	SAMN14167064	4,621,074	93%	26%
SRR11148684	SRX7784814	SRP250411	SAMN14167065	7,061,997	95%	26%
SRR11148683	SRX7784813	SRP250411	SAMN14167066	9,379,717	90%	26%
SRR11148682	SRX7784812	SRP250411	SAMN14167067	5,063,297	95%	26%
SRR11148681	SRX7784811	SRP250411	SAMN14167068	6,241,345	94%	26%
SRR11148680	SRX7784810	SRP250411	SAMN14167069	11,530,911	95%	26%
SRR11148679	SRX7784809	SRP250411	SAMN14167070	5,924,925	95%	33%
SRR11148678	SRX7784808	SRP250411	SAMN14167071	8,719,378	93%	25%
SRR11148677	SRX7784807	SRP250411	SAMN14167072	11,777,527	94%	26%
SRR11148676	SRX7784806	SRP250411	SAMN14167073	6,288,821	93%	26%
SRR11148675	SRX7784805	SRP250411	SAMN14167074	10,343,899	94%	26%
SRR11148674	SRX7784804	SRP250411	SAMN14167075	9,833,847	92%	34%
SRR11148673	SRX7784803	SRP250411	SAMN14167076	8,127,664	93%	25%
SRR11148672	SRX7784802	SRP250411	SAMN14167077	10,660,846	94%	26%
SRR11148671	SRX7784801	SRP250411	SAMN14167078	11,280,747	94%	26%
SRR11148670	SRX7784800	SRP250411	SAMN14167079	5,238,476	93%	26%
SRR11148669	SRX7784799	SRP250411	SAMN14167080	7,296,409	94%	25%
SRR11148668	SRX7784798	SRP250411	SAMN14167081	14,498,367	94%	25%
SRR11148667	SRX7784797	SRP250411	SAMN14167082	3,576,034	93%	25%
SRR11148666	SRX7784796	SRP250411	SAMN14167083	12,416,487	94%	26%
SRR11148665	SRX7784795	SRP250411	SAMN14167084	7,350,760	94%	27%
SRR11148664	SRX7784794	SRP250411	SAMN14167085	6,341,881	94%	25%
SRR11148663	SRX7784793	SRP250411	SAMN14167086	10,579,709	94%	26%
SRR11148662	SRX7784792	SRP250411	SAMN14167087	6,408,347	93%	34%
SRR11148661	SRX7784791	SRP250411	SAMN14167088	6,810,425	95%	26%
SRR11148660	SRX7784775	SRP250411	SAMN14167089	7,243,442	94%	26%
SRR11148659	SRX7784774	SRP250411	SAMN14167090	11,580,083	95%	26%
SRR11148658	SRX7784773	SRP250411	SAMN14167091	5,053,393	94%	25%
SRR11148717	SRX7784784	SRP250411	SAMN14167092	6,358,649	93%	27%
SRR11148716	SRX7784783	SRP250411	SAMN14167093	14,764,931	94%	26%
SRR11148715	SRX7784782	SRP250411	SAMN14167094	6,549,405	91%	34%
SRR11148714	SRX7784781	SRP250411	SAMN14167095	8,872,549	94%	27%
SRR11148713	SRX7784780	SRP250411	SAMN14167096	8,224,049	94%	26%
SRR11148712	SRX7784779	SRP250411	SAMN14167097	8,487,955	94%	34%
SRR11148711	SRX7784778	SRP250411	SAMN14167098	7,650,759	94%	26%
SRR11148710	SRX7784777	SRP250411	SAMN14167099	8,387,669	93%	27%
SRR11148709	SRX7784776	SRP250411	SAMN14167100	5,168,960	95%	27%
SRR11148708	SRX7784838	SRP250411	SAMN14167101	5,957,208	94%	26%
SRR11148707	SRX7784837	SRP250411	SAMN14167102	7,456,814	95%	34%
SRR11148706	SRX7784836	SRP250411	SAMN14167103	4,617,957	93%	26%
SRR11148705	SRX7784835	SRP250411	SAMN14167104	6,454,743	94%	26%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Myotis brandtii high-quality model RefSeq (XP_)	12,327	11,683 (94.78%)	11,683 (94.78%)	77.70%	87.84%
Eptesicus fuscus high-quality model RefSeq (XP_)	14,723	13,774 (93.55%)	13,774 (93.55%)	77.83%	87.76%
Chiroptera GenBank	2,026	1,757 (86.72%)	1,757 (86.72%)	78.35%	90.47%
Chiroptera known RefSeq (NP_)	60	53 (88.33%)	53 (88.33%)	81.75%	92.78%
Pteropus alecto high-quality model RefSeq (XP_)	11,044	10,397 (94.14%)	10,397 (94.14%)	82.28%	90.63%
Same-species GenBank	16	11 (68.75%)	11 (68.75%)	81.42%	92.33%
Same-species high-quality model RefSeq (XP_)	14,784	13,509 (91.38%)	13,509 (91.38%)	83.91%	90.23%
Same-species known RefSeq (NP_)	2	2 (100.00%)	2 (100.00%)	74.80%	85.80%
Homo sapiens known RefSeq (NP_)	59,721	44,387 (74.32%)	44,387 (74.32%)	78.31%	85.66%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
mRouAeg1.p (Current) Coverage: 98.76%	mRouAeg1.p (Current) Coverage: 98.95%
Raegyp2.0 (Previous) Coverage: 96.55%	Raegyp2.0 (Previous) Coverage: 98.38%
Percent Identity: 98.90%	Percent Identity: 98.82%

Comparison of the current and previous annotations

The annotation produced for this release (101) was compared to the annotation in the previous release (100) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	mRouAeg1.p (Current) to Raegyp2.0 (Previous)
Identical	6%
Minor changes	44%
Major changes	9%
New	40%
Deprecated	8%
Other	2%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences