A new variance stabilizing transformation for gene expression data analysis

Stat Appl Genet Mol Biol. 2013 Dec;12(6):653-66. doi: 10.1515/sagmb-2012-0030.

Abstract

In this paper, we introduce a new family of power transformations, which has the generalized logarithm as one of its members, in the same manner as the usual logarithm belongs to the family of Box-Cox power transformations. Although the new family has been developed for analyzing gene expression data, it allows a wider scope of mean-variance related data to be reached. We study the analytical properties of the new family of transformations, as well as the mean-variance relationships that are stabilized by using its members. We propose a methodology based on this new family, which includes a simple strategy for selecting the family member adequate for a data set. We evaluate the finite sample behavior of different classical and robust estimators based on this strategy by Monte Carlo simulations. We analyze real genomic data by using the proposed transformation to empirically show how the new methodology allows the variance of these data to be stabilized.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Gene Expression Profiling*
  • Humans
  • Linear Models
  • Models, Genetic
  • Monte Carlo Method
  • Oligonucleotide Array Sequence Analysis
  • Software