Transcriptome Characterization for Non-Model Endangered Lycaenids, Protantigius superans and Spindasis takanosis, Using Illumina HiSeq 2500 Sequencing

Int J Mol Sci. 2015 Dec 16;16(12):29948-70. doi: 10.3390/ijms161226213.

Abstract

The Lycaenidae butterflies, Protantigius superans and Spindasis takanosis, are endangered insects in Korea known for their symbiotic association with ants. However, necessary genomic and transcriptomics data are lacking in these species, limiting conservation efforts. In this study, the P. superans and S. takanosis transcriptomes were deciphered using Illumina HiSeq 2500 sequencing. The P. superans and S. takanosis transcriptome data included a total of 254,340,693 and 245,110,582 clean reads assembled into 159,074 and 170,449 contigs and 107,950 and 121,140 unigenes, respectively. BLASTX hits (E-value of 1.0 × 10(-5)) against the known protein databases annotated a total of 46,754 and 51,908 transcripts for P. superans and S. takanosis. Approximately 41.25% and 38.68% of the unigenes for P. superans and S. takanosis found homologous sequences in Protostome DB (PANM-DB). BLAST2GO analysis confirmed 18,611 unigenes representing Gene Ontology (GO) terms and a total of 5259 unigenes assigned to 116 pathways for P. superans. For S. takanosis, a total of 6697 unigenes were assigned to 119 pathways using the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database. Additionally, 382,164 and 390,516 Simple Sequence Repeats (SSRs) were compiled from the unigenes of P. superans and S. takanosis, respectively. This is the first report to record new genes and their utilization for conservation of lycaenid species population and as a reference information for closely related species.

Keywords: BLAST2GO; Illumina sequencing; Protantigius superans; SSRs (simple sequence repeats); Spindasis takanosis; endangered species; transcriptome.

MeSH terms

  • Animals
  • Butterflies / genetics*
  • Cluster Analysis
  • Databases, Nucleic Acid
  • Endangered Species*
  • Gene Expression Profiling
  • Gene Ontology
  • High-Throughput Nucleotide Sequencing / methods*
  • Insect Proteins / chemistry
  • Insect Proteins / genetics
  • Microsatellite Repeats / genetics
  • Molecular Sequence Annotation
  • Nucleotide Motifs / genetics
  • Protein Structure, Tertiary
  • Sequence Homology, Nucleic Acid
  • Species Specificity
  • Transcriptome / genetics*

Substances

  • Insect Proteins