De novo assembly, gene annotation, and marker discovery in stored-product pest Liposcelis entomophila (Enderlein) using transcriptome sequences

PLoS One. 2013 Nov 14;8(11):e80046. doi: 10.1371/journal.pone.0080046. eCollection 2013.

Abstract

Background: As a major stored-product pest insect, Liposcelis entomophila has developed high levels of resistance to various insecticides in grain storage systems. However, the molecular mechanisms underlying resistance and environmental stress have not been characterized. To date, there is a lack of genomic information for this species. Therefore, studies aimed at profiling the L. entomophila transcriptome would provide a better understanding of the biological functions at the molecular levels.

Methodology/principal findings: We applied Illumina sequencing technology to sequence the transcriptome of L. entomophila. A total of 54,406,328 clean reads were obtained and that de novo assembled into 54,220 unigenes, with an average length of 571 bp. Through a similarity search, 33,404 (61.61%) unigenes were matched to known proteins in the NCBI non-redundant (Nr) protein database. These unigenes were further functionally annotated with gene ontology (GO), cluster of orthologous groups of proteins (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. A large number of genes potentially involved in insecticide resistance were manually curated, including 68 putative cytochrome P450 genes, 37 putative glutathione S-transferase (GST) genes, 19 putative carboxyl/cholinesterase (CCE) genes, and other 126 transcripts to contain target site sequences or encoding detoxification genes representing eight types of resistance enzymes. Furthermore, to gain insight into the molecular basis of the L. entomophila toward thermal stresses, 25 heat shock protein (Hsp) genes were identified. In addition, 1,100 SSRs and 57,757 SNPs were detected and 231 pairs of SSR primes were designed for investigating the genetic diversity in future.

Conclusions/significance: We developed a comprehensive transcriptomic database for L. entomophila. These sequences and putative molecular markers would further promote our understanding of the molecular mechanisms underlying insecticide resistance or environmental stress, and will facilitate studies on population genetics for psocids, as well as providing useful information for functional genomic research in the future.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cholinesterases / classification
  • Cholinesterases / genetics
  • Cytochrome P-450 Enzyme System / classification
  • Cytochrome P-450 Enzyme System / genetics
  • Databases, Genetic
  • Edible Grain / parasitology
  • Expressed Sequence Tags
  • Gene Expression
  • Gene Expression Profiling
  • Genetic Markers
  • Glutathione Transferase / classification
  • Glutathione Transferase / genetics
  • Heat-Shock Proteins / classification
  • Heat-Shock Proteins / genetics
  • Insect Proteins / classification
  • Insect Proteins / genetics*
  • Insecta / genetics*
  • Insecticide Resistance / genetics*
  • Microsatellite Repeats
  • Molecular Sequence Annotation*
  • Phylogeny
  • Polymorphism, Single Nucleotide
  • Sequence Analysis, DNA
  • Transcriptome*

Substances

  • Genetic Markers
  • Heat-Shock Proteins
  • Insect Proteins
  • Cytochrome P-450 Enzyme System
  • Glutathione Transferase
  • Cholinesterases

Grants and funding

This study was funded in part by the Fundamental Research Funds for the Central Universities, China (XDJK2013C051; XDJK2013A005), the Doctor Foundation Project of Southwest University (SWU112046), the National Natural Sciences Foundation (31000860), and the Specialized Research Fund for the Doctoral Program of Higher Education (20100182120022). The funders had no role in study design, data collection and analysis, the decision to publish, or the preparation of the manuscript.