Identification of hydroxy fatty acid and triacylglycerol metabolism-related genes in lesquerella through seed transcriptome analysis

BMC Genomics. 2015 Mar 24;16(1):230. doi: 10.1186/s12864-015-1413-8.

Abstract

Background: Castor oil is the only commercial source of hydroxy fatty acid that has industrial value. The production of castor oil is hampered by the presence of the toxin ricin in its seed. Lesquerella seed also accumulates hydroxy fatty acid and is free of ricin, and thus it is being developed as a new crop for hydroxy fatty acid production. A high-throughput, large-scale sequencing of transcripts from developing lesquerella seeds was carried out by 454 pyrosequencing to generate a database for quality improvement of seed oil and other agronomic traits. Deep mining and characterization of acyl-lipid genes were conducted to uncover candidate genes for further studies of mechanisms underlying hydroxy fatty acid and seed oil synthesis.

Results: A total of 651 megabases of raw sequences from an mRNA sample of developing seeds was acquired. Bioinformatic analysis of these sequences revealed 59,914 transcripts representing 26,995 unique genes that include nearly all known seed expressed genes. Based on sequence similarity with known plant proteins, about 74% (19,861) genes matched with annotated coding genes. Among them, 95% (18,868) showed highest sequence homology with Arabidopsis genes, which will allow translation of genomics and genetics findings from Arabidopsis to lesquerella. Using Arabidopsis acyl-lipid genes as queries, we searched the transcriptome assembly and identified 615 lesquerella genes involved in all known pathways of acyl-lipid metabolism. Further deep mining the transcriptome assembly led to identification of almost all lesquerella genes involved in fatty acid and triacylglycerol synthesis. Moreover, we characterized the spatial and temporal expression profiles of 15 key genes using the quantitative PCR assay.

Conclusions: We have built a lesquerella seed transcriptome that provides a valuable reference in addition to the castor database for discovering genes involved in the synthesis of triacylglycerols enriched with hydroxy fatty acids. The information obtained from data mining and gene expression profiling will provide a resource not only for the study of hydroxy fatty acid metabolism, but also for the biotechnological production of hydroxy fatty acids in existing oilseed crops.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acetyltransferases / classification
  • Acetyltransferases / genetics
  • Acetyltransferases / metabolism
  • Amino Acid Sequence
  • Arabidopsis / genetics
  • Brassicaceae / genetics*
  • Databases, Genetic
  • Expressed Sequence Tags
  • Fatty Acid Desaturases / classification
  • Fatty Acid Desaturases / genetics
  • Fatty Acid Desaturases / metabolism
  • Fatty Acid Elongases
  • Fatty Acids / biosynthesis
  • Gene Expression Profiling
  • Genes, Plant*
  • Lipid Metabolism / genetics
  • Mixed Function Oxygenases / genetics
  • Mixed Function Oxygenases / metabolism
  • Molecular Sequence Data
  • Plant Proteins / classification
  • Plant Proteins / genetics
  • Plant Proteins / metabolism
  • Plastids / metabolism
  • Real-Time Polymerase Chain Reaction
  • Seeds / genetics
  • Sequence Alignment
  • Transcriptome*
  • Triglycerides / biosynthesis

Substances

  • Fatty Acids
  • Plant Proteins
  • Triglycerides
  • Mixed Function Oxygenases
  • Fatty Acid Desaturases
  • Acetyltransferases
  • Fatty Acid Elongases