Genome sequence of the model rice variety KitaakeX

BMC Genomics. 2019 Nov 27;20(1):905. doi: 10.1186/s12864-019-6262-4.

Abstract

Background: The availability of thousands of complete rice genome sequences from diverse varieties and accessions has laid the foundation for in-depth exploration of the rice genome. One drawback to these collections is that most of these rice varieties have long life cycles, and/or low transformation efficiencies, which limits their usefulness as model organisms for functional genomics studies. In contrast, the rice variety Kitaake has a rapid life cycle (9 weeks seed to seed) and is easy to transform and propagate. For these reasons, Kitaake has emerged as a model for studies of diverse monocotyledonous species.

Results: Here, we report the de novo genome sequencing and analysis of Oryza sativa ssp. japonica variety KitaakeX, a Kitaake plant carrying the rice XA21 immune receptor. Our KitaakeX sequence assembly contains 377.6 Mb, consisting of 33 scaffolds (476 contigs) with a contig N50 of 1.4 Mb. Complementing the assembly are detailed gene annotations of 35,594 protein coding genes. We identified 331,335 genomic variations between KitaakeX and Nipponbare (ssp. japonica), and 2,785,991 variations between KitaakeX and Zhenshan97 (ssp. indica). We also compared Kitaake resequencing reads to the KitaakeX assembly and identified 219 small variations. The high-quality genome of the model rice plant KitaakeX will accelerate rice functional genomics.

Conclusions: The high quality, de novo assembly of the KitaakeX genome will serve as a useful reference genome for rice and will accelerate functional genomics studies of rice and other species.

Keywords: De novo genome assembly; Kitaake; KitaakeX; Nipponbare; Rice; Whole genome sequence; XA21 immune receptor; Zhenshan97.

MeSH terms

  • Computational Biology / methods
  • Genetic Variation
  • Genome, Plant*
  • Genomics* / methods
  • Molecular Sequence Annotation
  • Oryza / classification
  • Oryza / genetics*
  • Phenotype
  • Whole Genome Sequencing*