Evaluation of read count based RNAseq analysis methods

BMC Genomics. 2013;14 Suppl 8(Suppl 8):S2. doi: 10.1186/1471-2164-14-S8-S2. Epub 2013 Dec 9.

Abstract

Background: RNAseq technology is replacing microarray technology as the tool of choice for gene expression profiling. While providing much richer data than microarray, analysis of RNAseq data has been much more challenging. To date, there has not been a consensus on the best approach for conducting robust RNAseq analysis.

Results: In this study, we designed a thorough experiment to evaluate six read count-based RNAseq analysis methods (DESeq, DEGseq, edgeR, NBPSeq, TSPM and baySeq) using both real and simulated data. We found the six methods produce similar fold changes and reasonable overlapping of differentially expressed genes based on p-values. However, all six methods suffer from over-sensitivity.

Conclusions: Based on the evaluation of runtime using real data and area under the receiver operating characteristic curve (AUC-ROC) using simulated data, we found that edgeR achieves a better balance between speed and accuracy than the other methods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Area Under Curve
  • Computer Simulation
  • Gene Expression Profiling
  • Genomics
  • Humans
  • ROC Curve
  • Sensitivity and Specificity
  • Sequence Analysis, RNA / methods*
  • Software*