Detecting differential protein expression in large-scale population proteomics

Bioinformatics. 2014 Oct;30(19):2741-6. doi: 10.1093/bioinformatics/btu341. Epub 2014 Jun 12.

Abstract

Motivation: Mass spectrometry (MS)-based high-throughput quantitative proteomics shows great potential in large-scale clinical biomarker studies, identifying and quantifying thousands of proteins in biological samples. However, there are unique challenges in analyzing the quantitative proteomics data. One issue is that the quantification of a given peptide is often missing in a subset of the experiments, especially for less abundant peptides. Another issue is that different MS experiments of the same study have significantly varying numbers of peptides quantified, which can result in more missing peptide abundances in an experiment that has a smaller total number of quantified peptides. To detect as many biomarker proteins as possible, it is necessary to develop bioinformatics methods that appropriately handle these challenges.

Results: We propose a Significance Analysis for Large-scale Proteomics Studies (SALPS) that handles missing peptide intensity values caused by the two mechanisms mentioned above. Our model has a robust performance in both simulated data and proteomics data from a large clinical study. Because varying patients' sample qualities and deviating instrument performances are not avoidable for clinical studies performed over the course of several years, we believe that our approach will be useful to analyze large-scale clinical proteomics data.

Availability and implementation: R codes for SALPS are available at http://www.stanford.edu/%7eclairesr/software.html.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computational Biology / methods
  • Computer Simulation
  • Gene Expression Regulation*
  • Humans
  • Mass Spectrometry / methods
  • Peptides / chemistry
  • Proteins / chemistry
  • Proteome / analysis*
  • Proteomics / methods*

Substances

  • Peptides
  • Proteins
  • Proteome