Development of a RNA-Seq Based Prognostic Signature in Lung Adenocarcinoma

J Natl Cancer Inst. 2016 Oct 5;109(1):djw200. doi: 10.1093/jnci/djw200. Print 2017 Jan.

Abstract

Background: Precision therapy for lung cancer will require comprehensive genomic testing to identify actionable targets as well as ascertain disease prognosis. RNA-seq is a robust platform that meets these requirements, but microarray-derived prognostic signatures are not optimal for RNA-seq data. Thus, we undertook the first prognostic analysis of lung adenocarcinoma RNA-seq data and generated a prognostic signature.

Methods: Lung adenocarcinoma RNA-seq and clinical data from The Cancer Genome Atlas (TCGA) were divided chronologically into training (n = 255) and validation (n = 157) cohorts. In the training cohort, prognostic association was assessed by univariate Cox analysis. A prognostic signature was built with stepwise multivariable Cox analysis. Outcomes by risk group, stage, and mutation status were analyzed with Kaplan-Meier and multivariable Cox analyses. All the statistical tests were two-sided.

Results: In the training cohort, 96 genes had prognostic association with P values of less than or equal to 1.00x10-4, including five long noncoding RNAs (lncRNAs). Stepwise regression generated a four-gene signature, including one lncRNA. Signature high-risk cases had worse overall survival (OS) in the TCGA validation cohort (hazard ratio [HR] = 3.07, 95% confidence interval [CI] = 2.00 to 14.62) and a University of Michigan institutional cohort (n = 67; HR = 2.05, 95% CI = 1.18 to 4.55), and worse metastasis-free survival in the TCGA validation cohort (HR = 3.05, 95% CI = 2.31 to 13.37). The four-gene prognostic signature also statistically significantly stratified overall survival in important clinical subsets, including stage I (HR = 2.78, 95% CI = 1.91 to 11.13), EGFR wild-type (HR = 3.01, 95% CI = 1.73 to 14.98), and EGFR mutant (HR = 8.99, 95% CI = 62.23 to 141.44). The four-gene prognostic signature also stood out on top when compared with other prognostic signatures.

Conclusions: Here, we present the first RNA-seq prognostic signature for lung adenocarcinoma that can provide a powerful prognostic tool for precision oncology as part of an integrated RNA-seq clinical sequencing program.

Publication types

  • Validation Study

MeSH terms

  • Adenocarcinoma / genetics*
  • Adenocarcinoma / secondary
  • Aged
  • Antigens, CD / genetics
  • Databases, Genetic
  • Female
  • GPI-Linked Proteins / genetics
  • GTP-Binding Proteins / genetics
  • Humans
  • Kaplan-Meier Estimate
  • Lung Neoplasms / genetics*
  • Lung Neoplasms / pathology
  • Male
  • Membrane Proteins / genetics
  • Middle Aged
  • Mutation
  • Neoplasm Proteins / genetics
  • Neoplasm Staging
  • Nerve Tissue Proteins / genetics
  • Prognosis
  • Proportional Hazards Models
  • Risk Assessment
  • Sequence Analysis, RNA*
  • Survival Rate
  • Transcriptome*

Substances

  • Antigens, CD
  • CD109 protein, human
  • FRRS1L protein, human
  • GPI-Linked Proteins
  • Membrane Proteins
  • Neoplasm Proteins
  • Nerve Tissue Proteins
  • RHOV protein, human
  • GTP-Binding Proteins