NCBI Homo sapiens Updated Annotation Release 109.20190905

The RefSeq genome records for Homo sapiens were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies.

Updated Annotation Release 109.20190905 is an update of NCBI Homo sapiens Annotation Release 109. The known RefSeq transcripts (with NM_ and NR_ prefixes) that were current on Sep 5 2019 were placed on the genome and used to update the annotated features. In addition, model RefSeq predicted in the last full annotation (Annotation Release 109) that were still current on Sep 5 2019 were included in the updated annotation. These models were not re-calculated for this update. For more information on the evidence used for generating the model RefSeq, please consult the report for NCBI Homo sapiens Annotation Release 109.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.


Annotation Release information

This annotation should be referred to as NCBI Homo sapiens Updated Annotation Release 109.20190905

Annotation release ID: 109.20190905
Date of Entrez queries for transcripts and proteins: Sep 5 2019
Date of submission of annotation to the public databases: Sep 9 2019
Software version: 8.2

Assemblies

The following assemblies were included in this annotation run:
Assembly nameAssembly accessionSubmitterAssembly dateReference/AlternateAssembly content
GRCh38.p13GCF_000001405.39Genome Reference Consortium02-28-2019Reference25 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

FeatureGRCh38.p13GRCh38.p13
Primary Assembly
GRCh38.p13
All Alt Loci
GRCh38.p13
PATCHES
Genes and pseudogenes help54,63954,3122,3981,529
  protein-coding19,84419,755832598
  non-coding18,01217,860676386
  Transcribed pseudogenes1,1351,1279080
  Non-transcribed pseudogenes15,20415,137631431
  genes with variants19,96519,886606380
  Immunoglobulin/T-cell receptor gene segments39738816122
  other4745812
  placed on multiple assembly-units help3,433na658na
mRNAs112,430112,1691,9171,382
  fully-supported112,283112,0381,9061,377
  with > 5% ab initio help887873
  partial2631267176
  with filled gap(s) help0000
  placed on multiple assembly-units help2,901na638na
  known RefSeq (NM_) help54,06953,9811,7811,345
  model RefSeq (XM_)58,36158,18813637
non-coding RNAs help47,04145,1671,820758
  fully-supported45,16843,7751,519730
  with > 5% ab initio help0000
  partial745639
  with filled gap(s) help0000
  placed on multiple assembly-units help695na183na
  known RefSeq (NR_) help15,31915,310506359
  model RefSeq (XR_) help29,86328,4791,013371
pseudo transcripts help1,4421,42610990
  fully-supported1,4311,41710790
  with > 5% ab initio help0000
  partial-2196
  with filled gap(s) help0000
  placed on multiple assembly-units helpnananana
  known RefSeq (NR_) help1,3341,32910187
  model RefSeq (XR_) help1089783
CDSs113,017112,5582,0771,390
  fully-supported112,283112,0381,9061,377
  with > 5% ab initio help118104104
  partial514348364161
  with major correction(s) help69666157
  known RefSeq (NP_) help54,06953,9811,7781,331
  model RefSeq (XP_) help58,37458,18813637

Detailed reports

The counts below do not include pseudogenes.

References

Support Center