NCBI Jatropha curcas Annotation Release 102

The RefSeq genome records for Jatropha curcas were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Jatropha curcas Annotation Release 102

Annotation release ID: 102
Date of Entrez queries for transcripts and proteins: Nov 6 2020
Date of submission of annotation to the public databases: Nov 16 2020
Software version: 8.5

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
RJC1_Hi-C	GCF_014843425.1	Reliance Industries Ltd.	10-05-2020	Reference	1 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	RJC1_Hi-C
Genes and pseudogenes	22,718
protein-coding	19,420
non-coding	2,518
transcribed pseudogenes	0
non-transcribed pseudogenes	780
genes with variants	5,874
immunoglobulin/T-cell receptor gene segments	0
other	0
mRNAs	29,502
fully-supported	27,480
with > 5% ab initio	1,406
partial	427
with filled gap(s)	272
known RefSeq (NM_)	194
model RefSeq (XM_)	29,308
non-coding RNAs	4,498
fully-supported	3,560
with > 5% ab initio	0
partial	15
with filled gap(s)	12
known RefSeq (NR_)	0
model RefSeq (XR_)	4,049
pseudo transcripts	0
fully-supported	0
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	0
CDSs	29,586
fully-supported	27,480
with > 5% ab initio	1,451
partial	397
with major correction(s)	755
known RefSeq (NP_)	194
model RefSeq (XP_)	29,392

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	21,938	3,975	2,862	63	82,165
All transcripts	34,000	1,851	1,649	63	17,070
mRNA	29,502	1,945	1,716	161	17,070
misc_RNA	1,693	2,159	1,930	124	13,502
tRNA	441	75	73	69	93
lncRNA	1,884	954	721	88	6,841
snoRNA	379	105	107	63	220
snRNA	67	154	154	86	201
rRNA	34	455	119	103	3,390
Single-exon transcripts	2,836	1,326	1,146	161	4,896
coding transcripts (NM_/XM_ )	2,830	1,327	1,146	161	4,896
non-coding transcripts (NR_/XR_ )	6	949	1,338	183	1,966
CDSs	29,586	1,433	1,203	90	16,413
Exons	135,485	311	169	1	7,932
in coding transcripts (NM_/XM_ )	128,407	309	166	1	7,932
in non-coding transcripts (NR_/XR_ )	13,221	284	161	2	5,131
Introns	108,533	541	205	28	74,265
in coding transcripts (NM_/XM_ )	104,206	525	203	28	74,265
in non-coding transcripts (NR_/XR_ )	10,319	699	249	30	44,821

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	1.57	1	1	26
Number of exons per transcript	6.66	5	1	79

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the Arabidopsis thaliana known RefSeq proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 19336 coding genes, 18089 genes had a protein with an alignment covering 50% or more of the query and 8893 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: Arabidopsis thaliana known RefSeq proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
RJC1_Hi-C	GCF_014843425.1	24.72%	32.38%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	216	203 (93.98%)	186 (86.11%)	99.74%	99.59%
Same-species Genbank	604	559 (92.55%)	498 (82.45%)	99.55%	98.90%
Same-species EST	46,859	39,418 (84.12%)	32,402 (69.15%)	99.39%	99.04%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	5,581,161,492	87%	28%	132,684
SAMEA2342024	23075845	TropiTree_RNA_PE_Jatropha curcas (Jatropha curcas, SAMEA2342024)	5,271,080	9%	24%	36,178
SAMN00003846	NA	developing seed (Jatropha curcas, SAMN00003846)	195,692	38%	43%	16,285
SAMN00188808	21492485	roots, mature leaves, flowers, developing seeds, embryos (Jatropha curcas, SAMN00188808)	383,937	59%	50%	63,761
SAMN02350404	NA	immature seeds (Jatropha curcas, SAMN02350404)	16,653,188	82%	24%	85,638
SAMN02350405	NA	intermediate seeds (Jatropha curcas, SAMN02350405)	43,328,830	77%	27%	102,395
SAMN02350406	NA	mature seeds (Jatropha curcas, SAMN02350406)	35,062,185	72%	26%	95,623
SAMN02356859	NA	Sample for Jatropha curcas 454 sequencing (Jatropha curcas, SAMN02356859)	1,714,433	77%	58%	91,691
SAMN02905749	NA	Leaf, resistant control (Jatropha curcas, SAMN02905749)	90,560,792	81%	25%	105,818
SAMN02905750	NA	Leaf, resistant induced (Jatropha curcas, SAMN02905750)	74,923,274	80%	24%	103,412
SAMN02905751	NA	Leaf, susceptible control (Jatropha curcas, SAMN02905751)	102,338,758	83%	25%	108,615
SAMN02905752	NA	Leaf, susceptible induced (Jatropha curcas, SAMN02905752)	76,993,984	79%	24%	104,979
SAMN03145040	NA	Leaf; Root; flower (Jatropha curcas, SAMN03145040)	167,085,070	84%	25%	121,654
SAMN03152301	25400171	Inflorescences (Jatropha curcas, SAMN03152301)	703,755	72%	53%	54,176
SAMN03486846	NA	leaf (Jatropha curcas, seedling, SAMN03486846)	54,987,425	167%	22%	107,139
SAMN05827448	NA	Shoot (Jatropha curcas, 2 years, SAMN05827448)	58,434,958	87%	29%	106,224
SAMN05827449	NA	Inflorescences (Jatropha curcas, 2 years, SAMN05827449)	46,322,163	86%	30%	108,467
SAMN05827450	NA	Shoot (Jatropha curcas, 2 years, SAMN05827450)	51,203,205	87%	29%	103,885
SAMN05827451	NA	Inflorescences (Jatropha curcas, 2 years, SAMN05827451)	56,062,944	86%	30%	110,326
SAMN05827452	NA	Shoot (Jatropha curcas, 2 years, SAMN05827452)	50,863,218	87%	28%	105,111
SAMN05827453	NA	Inflorescences (Jatropha curcas, 2 years, SAMN05827453)	66,298,912	86%	29%	110,925
SAMN05827454	NA	Shoot (Jatropha curcas, 2 years, SAMN05827454)	42,699,070	87%	29%	104,067
SAMN05827455	NA	Shoot (Jatropha curcas, 2 years, SAMN05827455)	47,423,637	87%	29%	103,592
SAMN05827456	NA	Shoot (Jatropha curcas, 2 years, SAMN05827456)	50,405,875	87%	28%	103,382
SAMN05827457	NA	Inflorescences (Jatropha curcas, 2 years, SAMN05827457)	58,202,433	87%	28%	104,519
SAMN05827458	NA	Shoot (Jatropha curcas, 2 years, SAMN05827458)	46,218,305	87%	27%	103,789
SAMN05827459	NA	Shoot (Jatropha curcas, 2 years, SAMN05827459)	57,567,401	85%	29%	103,045
SAMN05827460	NA	Shoot (Jatropha curcas, 2 years, SAMN05827460)	48,485,694	87%	28%	102,343
SAMN05827461	NA	Inflorescences (Jatropha curcas, 2 years, SAMN05827461)	51,981,297	87%	27%	106,522
SAMN05827462	NA	Shoot (Jatropha curcas, 2 years, SAMN05827462)	45,134,569	86%	29%	104,969
SAMN05827463	NA	Shoot (Jatropha curcas, 2 years, SAMN05827463)	42,352,532	87%	29%	103,043
SAMN05827464	NA	Shoot (Jatropha curcas, 2 years, SAMN05827464)	54,573,642	87%	30%	106,062
SAMN05827465	NA	Inflorescences (Jatropha curcas, 2 years, SAMN05827465)	44,640,267	86%	29%	108,838
SAMN05949192	NA	flower inflorescence buds, monoecious, 3-4d (Jatropha curcas, Two years, SAMN05949192)	49,065,896	85%	31%	110,337
SAMN05949193	NA	flower inflorescence buds, monoecious, 3-4d (Jatropha curcas, Two years, SAMN05949193)	49,392,394	85%	29%	108,551
SAMN05949194	NA	flower inflorescence buds, monoecious, 3-4d (Jatropha curcas, Two years, SAMN05949194)	49,668,810	85%	31%	110,060
SAMN05949195	NA	flower inflorescence buds, monoecious, 8-9d (Jatropha curcas, Two years, SAMN05949195)	39,256,618	84%	30%	108,237
SAMN05949196	NA	flower inflorescence buds, monoecious, 8-9d (Jatropha curcas, Two years, SAMN05949196)	54,242,280	85%	30%	111,621
SAMN05949197	NA	flower inflorescence buds, monoecious, 8-9d (Jatropha curcas, Two years, SAMN05949197)	42,788,554	84%	29%	107,598
SAMN05949198	NA	flower inflorescence buds, gynoecious, 3-4d (Jatropha curcas, Two years, SAMN05949198)	51,850,822	82%	31%	109,637
SAMN05949199	NA	flower inflorescence buds, gynoecious, 3-4d (Jatropha curcas, Two years, SAMN05949199)	48,715,656	84%	31%	108,957
SAMN05949200	NA	flower inflorescence buds, gynoecious, 3-4d (Jatropha curcas, Two years, SAMN05949200)	62,050,746	84%	31%	111,411
SAMN05949201	NA	flower inflorescence buds, gynoecious, 8-9d (Jatropha curcas, Two years, SAMN05949201)	53,003,790	79%	29%	105,957
SAMN05949202	NA	flower inflorescence buds, gynoecious, 8-9d (Jatropha curcas, Two years, SAMN05949202)	53,585,986	82%	30%	108,487
SAMN05949203	NA	flower inflorescence buds, gynoecious, 8-9d (Jatropha curcas, Two years, SAMN05949203)	51,252,702	84%	31%	112,618
SAMN07487333	NA	female flower buds (Jatropha curcas, 15, SAMN07487333)	40,754,540	168%	30%	116,295
SAMN07487340	NA	female flower buds (Jatropha curcas, 15, SAMN07487340)	40,352,870	168%	30%	115,576
SAMN07487341	NA	female flower buds (Jatropha curcas, 15, SAMN07487341)	55,060,729	167%	30%	117,660
SAMN07487343	NA	male flower buds (Jatropha curcas, 15, SAMN07487343)	61,982,614	148%	30%	120,188
SAMN07487350	NA	male flower buds (Jatropha curcas, 15, SAMN07487350)	56,410,894	166%	30%	119,870
SAMN07487351	NA	male flower buds (Jatropha curcas, 15, SAMN07487351)	58,294,754	147%	30%	119,536
SAMN07525082	29180629	STD1 (Jatropha curcas, three years, SAMN07525082)	49,212,964	72%	31%	107,289
SAMN07525083	29180629	IND (Jatropha curcas, three years, SAMN07525083)	53,085,642	78%	31%	109,623
SAMN07525084	29180629	PID2 (Jatropha curcas, three years, SAMN07525084)	51,253,614	75%	33%	111,993
SAMN07525085	29180629	PID1 (Jatropha curcas, three years, SAMN07525085)	47,233,024	73%	31%	106,809
SAMN07525086	29180629	STD2 (Jatropha curcas, three years, SAMN07525086)	57,115,000	75%	34%	112,088
SAMN07527279	30059608	leaf (Jatropha curcas, SAMN07527279)	54,483,884	63%	26%	105,612
SAMN07527280	30059608	male flower (Jatropha curcas, SAMN07527280)	138,851,246	86%	25%	112,142
SAMN07527281	30059608	female flower (Jatropha curcas, SAMN07527281)	129,919,100	86%	24%	113,712
SAMN07527282	30059608	seed endosperm (Jatropha curcas, SAMN07527282)	102,336,044	87%	25%	110,216
SAMN07527283	30059608	seed endosperm (Jatropha curcas, SAMN07527283)	140,623,586	86%	25%	113,562
SAMN07527284	30059608	seed endosperm (Jatropha curcas, SAMN07527284)	141,262,294	87%	26%	105,313
SAMN07527285	30059608	seed endosperm (Jatropha curcas, SAMN07527285)	160,089,596	85%	23%	104,294
SAMN07527286	30059608	root (Jatropha curcas, SAMN07527286)	47,286,950	82%	26%	110,276
SAMN07527287	30059608	stem (Jatropha curcas, SAMN07527287)	40,867,354	67%	20%	90,713
SAMN07527288	30059608	leaf (Jatropha curcas, SAMN07527288)	40,398,958	81%	23%	96,180
SAMN07828338	NA	Shoot tip (Jatropha curcas, Two years, SAMN07828338)	54,508,900	73%	28%	100,710
SAMN07828339	NA	Shoot tip (Jatropha curcas, Two years, SAMN07828339)	52,050,062	81%	29%	100,471
SAMN07828340	NA	Shoot tip (Jatropha curcas, Two years, SAMN07828340)	57,123,682	82%	28%	103,853
SAMN07828341	NA	Inflorescence bud (Jatropha curcas, Two years, SAMN07828341)	45,478,684	81%	30%	102,426
SAMN07828342	NA	Inflorescence bud (Jatropha curcas, Two years, SAMN07828342)	48,242,130	83%	30%	106,269
SAMN07828343	NA	Inflorescence bud (Jatropha curcas, Two years, SAMN07828343)	52,289,596	82%	29%	102,589
SAMN07828344	NA	Shoot tip (Jatropha curcas, Two years, SAMN07828344)	43,555,958	85%	31%	102,340
SAMN07828345	NA	Shoot tip (Jatropha curcas, Two years, SAMN07828345)	48,818,744	81%	30%	101,301
SAMN07828346	NA	Shoot tip (Jatropha curcas, Two years, SAMN07828346)	41,349,678	85%	30%	100,624
SAMN07828347	NA	Inflorescence bud (Jatropha curcas, Two years, SAMN07828347)	50,630,820	84%	32%	105,455
SAMN07828348	NA	Inflorescence bud (Jatropha curcas, Two years, SAMN07828348)	46,468,732	84%	31%	104,084
SAMN07828349	NA	Inflorescence bud (Jatropha curcas, Two years, SAMN07828349)	47,435,920	85%	30%	104,269
SAMN07829362	NA	Hypocotyl (Jatropha curcas, SAMN07829362)	43,663,102	66%	17%	95,024
SAMN07829363	NA	Hypocotyl (Jatropha curcas, SAMN07829363)	52,058,852	40%	18%	96,182
SAMN08364588	NA	Reproductive stage, flower buds (Jatropha curcas, 6 years, SAMN08364588)	56,802,230	83%	22%	110,633
SAMN08364589	NA	flower buds (Jatropha curcas, 6 years, pooled male and female, SAMN08364589)	61,425,906	80%	21%	110,907
SAMN08364805	NA	flower buds (Jatropha curcas, 6 years, pooled male and female, SAMN08364805)	65,917,526	68%	20%	108,061
SAMN09842629	32272887	whole seed (Jatropha curcas, SAMN09842629)	875,172	56%	63%	50,152
SAMN12696844	NA	axillary buds (Jatropha curcas, 12 days, SAMN12696844)	53,602,124	85%	32%	105,654
SAMN12696845	NA	axillary buds (Jatropha curcas, 12 days, SAMN12696845)	53,567,318	85%	33%	106,191
SAMN12696846	NA	axillary buds (Jatropha curcas, 12 days, SAMN12696846)	44,641,280	85%	31%	103,340
SAMN12696847	NA	axillary buds (Jatropha curcas, 12 days, SAMN12696847)	61,059,618	85%	33%	106,286
SAMN12696848	NA	axillary buds (Jatropha curcas, 12 days, SAMN12696848)	67,107,336	86%	33%	107,672
SAMN12696849	NA	axillary buds (Jatropha curcas, 12 days, SAMN12696849)	69,823,506	86%	33%	106,907
SAMN12696850	NA	axillary buds (Jatropha curcas, 14 days, SAMN12696850)	51,218,054	84%	30%	105,700
SAMN12696851	NA	axillary buds (Jatropha curcas, 14 days, SAMN12696851)	42,575,110	84%	28%	103,173
SAMN12696852	NA	axillary buds (Jatropha curcas, 14 days, SAMN12696852)	59,136,358	84%	29%	108,159
SAMN12696853	NA	leaves (Jatropha curcas, 14 days, SAMN12696853)	43,649,410	73%	38%	103,765
SAMN12696854	NA	leaves (Jatropha curcas, 14 days, SAMN12696854)	38,506,288	73%	36%	103,127
SAMN12696855	NA	leaves (Jatropha curcas, 14 days, SAMN12696855)	44,193,754	79%	27%	93,145
SAMN12696856	NA	axillary buds (Jatropha curcas, 14 days, SAMN12696856)	50,674,750	84%	31%	106,594
SAMN12696857	NA	axillary buds (Jatropha curcas, 14 days, SAMN12696857)	46,303,158	84%	30%	105,872
SAMN12696858	NA	axillary buds (Jatropha curcas, 14 days, SAMN12696858)	48,418,138	84%	31%	106,082
SAMN12696859	NA	leaves (Jatropha curcas, 14 days, SAMN12696859)	50,549,540	80%	37%	105,017
SAMN12696860	NA	leaves (Jatropha curcas, 14 days, SAMN12696860)	62,251,258	79%	34%	105,126
SAMN12696861	NA	leaves (Jatropha curcas, 14 days, SAMN12696861)	64,366,362	79%	31%	100,445

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
ERR420522	ERX386815	ERP004701	SAMEA2342024	5,271,080	9%	24%
SRR027577	SRX011411	SRP001241	SAMN00003846	195,692	38%	43%
SRR087417	SRX035761	SRP004898	SAMN00188808	383,937	59%	50%
SRR972445	SRX346819	SRP029638	SAMN02350404	16,653,188	82%	24%
SRR972446	SRX346820	SRP029638	SAMN02350405	43,328,830	77%	27%
SRR972447	SRX346821	SRP029638	SAMN02350406	35,062,185	72%	26%
SRR998547	SRX352166	SRP029977	SAMN02356859	1,621,838	77%	59%
SRR998927	SRX352166	SRP029977	SAMN02356859	92,595	81%	48%
SRR1539206	SRX672142	SRP044808	SAMN02905749	90,560,792	81%	25%
SRR1560724	SRX689084	SRP044808	SAMN02905750	74,923,274	80%	24%
SRR1560722	SRX689082	SRP044808	SAMN02905751	102,338,758	83%	25%
SRR1560723	SRX689083	SRP044808	SAMN02905752	76,993,984	79%	24%
SRR1635040	SRX744542	SRP049319	SAMN03145040	85,386,014	84%	24%
SRR1635045	SRX747345	SRP049319	SAMN03145040	81,699,056	85%	26%
SRR1663442	SRX763342	SRP050030	SAMN03152301	182,722	78%	57%
SRR1663443	SRX768574	SRP050030	SAMN03152301	172,682	83%	50%
SRR1663444	SRX768575	SRP050030	SAMN03152301	181,699	53%	48%
SRR1663445	SRX768576	SRP050030	SAMN03152301	166,652	77%	55%
SRR2102905	SRX1097498	SRP057220	SAMN03486846	28,493,022	166%	21%
SRR2101829	SRX997124	SRP057220	SAMN03486846	26,494,403	168%	23%
SRR4308595	SRX2200465	SRP090662	SAMN05827448	58,434,958	87%	29%
SRR4426289	SRX2248218	SRP090662	SAMN05827449	46,322,163	86%	30%
SRR4426290	SRX2248219	SRP090662	SAMN05827450	51,203,205	87%	29%
SRR4426295	SRX2248224	SRP090662	SAMN05827451	56,062,944	86%	30%
SRR4426296	SRX2248225	SRP090662	SAMN05827452	50,863,218	87%	28%
SRR4426312	SRX2248241	SRP090662	SAMN05827453	66,298,912	86%	29%
SRR4426313	SRX2248242	SRP090662	SAMN05827454	42,699,070	87%	29%
SRR4426314	SRX2248243	SRP090662	SAMN05827455	47,423,637	87%	29%
SRR4449122	SRX2248244	SRP090662	SAMN05827456	50,405,875	87%	28%
SRR4446116	SRX2248245	SRP090662	SAMN05827457	58,202,433	87%	28%
SRR4449125	SRX2267588	SRP090662	SAMN05827458	46,218,305	87%	27%
SRR4449140	SRX2267590	SRP090662	SAMN05827459	57,567,401	85%	29%
SRR4449144	SRX2267604	SRP090662	SAMN05827460	48,485,694	87%	28%
SRR4449147	SRX2267606	SRP090662	SAMN05827461	51,981,297	87%	27%
SRR4449148	SRX2267607	SRP090662	SAMN05827462	45,134,569	86%	29%
SRR4449151	SRX2267609	SRP090662	SAMN05827463	42,352,532	87%	29%
SRR4449154	SRX2267612	SRP090662	SAMN05827464	54,573,642	87%	30%
SRR4449157	SRX2267614	SRP090662	SAMN05827465	44,640,267	86%	29%
SRR4473571	SRX2279490	SRP092157	SAMN05949192	49,065,896	85%	31%
SRR4473572	SRX2279491	SRP092157	SAMN05949193	49,392,394	85%	29%
SRR4473565	SRX2279484	SRP092157	SAMN05949194	49,668,810	85%	31%
SRR4473566	SRX2279485	SRP092157	SAMN05949195	39,256,618	84%	30%
SRR4473567	SRX2279486	SRP092157	SAMN05949196	54,242,280	85%	30%
SRR4473568	SRX2279487	SRP092157	SAMN05949197	42,788,554	84%	29%
SRR4473569	SRX2279488	SRP092157	SAMN05949198	51,850,822	82%	31%
SRR4473570	SRX2279489	SRP092157	SAMN05949199	48,715,656	84%	31%
SRR4473575	SRX2279494	SRP092157	SAMN05949200	62,050,746	84%	31%
SRR4473576	SRX2279495	SRP092157	SAMN05949201	53,003,790	79%	29%
SRR4473573	SRX2279492	SRP092157	SAMN05949202	53,585,986	82%	30%
SRR4473574	SRX2279493	SRP092157	SAMN05949203	51,252,702	84%	31%
SRR5921380	SRX3082190	SRP115141	SAMN07487333	13,523,982	168%	30%
SRR5921382	SRX3082190	SRP115141	SAMN07487333	12,690,050	168%	31%
SRR5921385	SRX3082190	SRP115141	SAMN07487333	14,540,508	167%	31%
SRR5921377	SRX3082191	SRP115141	SAMN07487340	13,884,512	167%	31%
SRR5921379	SRX3082191	SRP115141	SAMN07487340	11,902,048	168%	30%
SRR5921384	SRX3082191	SRP115141	SAMN07487340	14,566,310	167%	30%
SRR5921376	SRX3082192	SRP115141	SAMN07487341	14,191,078	169%	30%
SRR5921378	SRX3082192	SRP115141	SAMN07487341	14,458,067	167%	30%
SRR5921381	SRX3082192	SRP115141	SAMN07487341	14,591,690	164%	29%
SRR5921383	SRX3082192	SRP115141	SAMN07487341	11,819,894	166%	30%
SRR5921366	SRX3082194	SRP115141	SAMN07487343	17,425,813	169%	30%
SRR5921371	SRX3082194	SRP115141	SAMN07487343	16,022,462	166%	30%
SRR5921372	SRX3082194	SRP115141	SAMN07487343	13,126,586	84%	29%
SRR5921374	SRX3082194	SRP115141	SAMN07487343	15,407,753	157%	30%
SRR5921365	SRX3082193	SRP115141	SAMN07487350	14,436,570	167%	30%
SRR5921369	SRX3082193	SRP115141	SAMN07487350	13,570,278	169%	30%
SRR5921370	SRX3082193	SRP115141	SAMN07487350	12,837,809	165%	30%
SRR5921375	SRX3082193	SRP115141	SAMN07487350	15,566,237	165%	31%
SRR5921364	SRX3082195	SRP115141	SAMN07487351	14,099,904	85%	32%
SRR5921367	SRX3082195	SRP115141	SAMN07487351	14,585,237	170%	30%
SRR5921368	SRX3082195	SRP115141	SAMN07487351	14,406,989	165%	30%
SRR5921373	SRX3082195	SRP115141	SAMN07487351	15,202,624	166%	29%
SRR5952357	SRX3110816	SRP115917	SAMN07525082	49,212,964	72%	31%
SRR5952356	SRX3110815	SRP115917	SAMN07525083	53,085,642	78%	31%
SRR5952360	SRX3110819	SRP115917	SAMN07525084	51,253,614	75%	33%
SRR5952359	SRX3110818	SRP115917	SAMN07525085	47,233,024	73%	31%
SRR5952358	SRX3110817	SRP115917	SAMN07525086	57,115,000	75%	34%
SRR5974846	SRX3131425	SRP116161	SAMN07527279	54,483,884	63%	26%
SRR5974843	SRX3131428	SRP116161	SAMN07527280	138,851,246	86%	25%
SRR5974844	SRX3131427	SRP116161	SAMN07527281	129,919,100	86%	24%
SRR5974841	SRX3131430	SRP116161	SAMN07527282	102,336,044	87%	25%
SRR5974842	SRX3131429	SRP116161	SAMN07527283	140,623,586	86%	25%
SRR5974857	SRX3131414	SRP116161	SAMN07527284	141,262,294	87%	26%
SRR5974858	SRX3131413	SRP116161	SAMN07527285	160,089,596	85%	23%
SRR5974855	SRX3131416	SRP116161	SAMN07527286	47,286,950	82%	26%
SRR5974856	SRX3131415	SRP116161	SAMN07527287	40,867,354	67%	20%
SRR5974853	SRX3131418	SRP116161	SAMN07527288	40,398,958	81%	23%
SRR6227309	SRX3335908	SRP122257	SAMN07828338	54,508,900	73%	28%
SRR6227310	SRX3335907	SRP122257	SAMN07828339	52,050,062	81%	29%
SRR6227307	SRX3335910	SRP122257	SAMN07828340	57,123,682	82%	28%
SRR6227308	SRX3335909	SRP122257	SAMN07828341	45,478,684	81%	30%
SRR6227305	SRX3335912	SRP122257	SAMN07828342	48,242,130	83%	30%
SRR6227306	SRX3335911	SRP122257	SAMN07828343	52,289,596	82%	29%
SRR6227303	SRX3335914	SRP122257	SAMN07828344	43,555,958	85%	31%
SRR6227304	SRX3335913	SRP122257	SAMN07828345	48,818,744	81%	30%
SRR6227311	SRX3335906	SRP122257	SAMN07828346	41,349,678	85%	30%
SRR6227312	SRX3335905	SRP122257	SAMN07828347	50,630,820	84%	32%
SRR6227301	SRX3335916	SRP122257	SAMN07828348	46,468,732	84%	31%
SRR6227302	SRX3335915	SRP122257	SAMN07828349	47,435,920	85%	30%
SRR6287185	SRX3388562	SRP124918	SAMN07829362	43,663,102	66%	17%
SRR6287184	SRX3388563	SRP124918	SAMN07829363	52,058,852	40%	18%
SRR6473049	SRX3562933	SRP129513	SAMN08364588	56,802,230	83%	22%
SRR6473050	SRX3562935	SRP129513	SAMN08364589	61,425,906	80%	21%
SRR6473067	SRX3562952	SRP129513	SAMN08364805	65,917,526	68%	20%
SRR7701127	SRX4559398	SRP158102	SAMN09842629	875,172	56%	63%
SRR10076327	SRX6809698	SRP220547	SAMN12696844	53,602,124	85%	32%
SRR10076326	SRX6809699	SRP220547	SAMN12696845	53,567,318	85%	33%
SRR10076317	SRX6809708	SRP220547	SAMN12696846	44,641,280	85%	31%
SRR10076316	SRX6809709	SRP220547	SAMN12696847	61,059,618	85%	33%
SRR10076315	SRX6809710	SRP220547	SAMN12696848	67,107,336	86%	33%
SRR10076314	SRX6809711	SRP220547	SAMN12696849	69,823,506	86%	33%
SRR10076313	SRX6809712	SRP220547	SAMN12696850	51,218,054	84%	30%
SRR10076312	SRX6809713	SRP220547	SAMN12696851	42,575,110	84%	28%
SRR10076311	SRX6809714	SRP220547	SAMN12696852	59,136,358	84%	29%
SRR10076310	SRX6809715	SRP220547	SAMN12696853	43,649,410	73%	38%
SRR10076325	SRX6809700	SRP220547	SAMN12696854	38,506,288	73%	36%
SRR10076324	SRX6809701	SRP220547	SAMN12696855	44,193,754	79%	27%
SRR10076323	SRX6809702	SRP220547	SAMN12696856	50,674,750	84%	31%
SRR10076322	SRX6809703	SRP220547	SAMN12696857	46,303,158	84%	30%
SRR10076321	SRX6809704	SRP220547	SAMN12696858	48,418,138	84%	31%
SRR10076320	SRX6809705	SRP220547	SAMN12696859	50,549,540	80%	37%
SRR10076319	SRX6809706	SRP220547	SAMN12696860	62,251,258	79%	34%
SRR10076318	SRX6809707	SRP220547	SAMN12696861	64,366,362	79%	31%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species GenBank	445	230 (51.69%)	230 (51.69%)	79.65%	87.44%
Same-species known RefSeq (NP_)	216	212 (98.15%)	212 (98.15%)	80.32%	86.54%
Cucumis melo high-quality model RefSeq (XP_)	11,899	11,335 (95.26%)	11,335 (95.26%)	68.61%	76.42%
Cucumis melo known RefSeq (NP_)	117	109 (93.16%)	109 (93.16%)	69.27%	83.32%
Arabidopsis thaliana known RefSeq (NP_)	48,147	34,420 (71.49%)	34,420 (71.49%)	65.87%	69.15%
Glycine max high-quality model RefSeq (XP_)	23,025	21,514 (93.44%)	21,514 (93.44%)	67.82%	75.24%
Glycine max known RefSeq (NP_)	7,942	7,272 (91.56%)	7,272 (91.56%)	70.02%	76.95%
Eucalyptus grandis high-quality model RefSeq (XP_)	16,668	15,389 (92.33%)	15,389 (92.33%)	67.17%	75.15%
Eucalyptus grandis known RefSeq (NP_)	37	36 (97.30%)	36 (97.30%)	75.21%	82.01%
Populus euphratica high-quality model RefSeq (XP_)	18,422	17,021 (92.39%)	17,021 (92.39%)	70.56%	80.00%
Populus euphratica known RefSeq (NP_)	39	39 (100.00%)	39 (100.00%)	66.62%	78.94%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
RJC1_Hi-C (Current) Coverage: 75.19%	RJC1_Hi-C (Current) Coverage: 77.19%
JatCur_1.0 (Previous) Coverage: 76.15%	JatCur_1.0 (Previous) Coverage: 79.76%
Percent Identity: 97.86%	Percent Identity: 97.19%

Comparison of the current and previous annotations

The annotation produced for this release (102) was compared to the annotation in the previous release (101) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	RJC1_Hi-C (Current) to JatCur_1.0 (Previous)
Identical	8%
Minor changes	70%
Major changes	10%
New	12%
Deprecated	21%
Other	1%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences