Multi-omic and multi-view clustering algorithms: review and cancer benchmark

Nimrod Rappoport; Ron Shamir

doi:10.1093/nar/gky889

Multi-omic and multi-view clustering algorithms: review and cancer benchmark

Nucleic Acids Res. 2018 Nov 16;46(20):10546-10562. doi: 10.1093/nar/gky889.

Authors

Nimrod Rappoport¹, Ron Shamir¹

Affiliation

¹ Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.

Abstract

Recent high throughput experimental methods have been used to collect large biomedical omics datasets. Clustering of single omic datasets has proven invaluable for biological and medical research. The decreasing cost and development of additional high throughput methods now enable measurement of multi-omic data. Clustering multi-omic data has the potential to reveal further systems-level insights, but raises computational and biological challenges. Here, we review algorithms for multi-omics clustering, and discuss key issues in applying these algorithms. Our review covers methods developed specifically for omic data as well as generic multi-view methods developed in the machine learning community for joint clustering of multiple data types. In addition, using cancer data from TCGA, we perform an extensive benchmark spanning ten different cancer types, providing the first systematic comparison of leading multi-omics and multi-view clustering algorithms. The results highlight key issues regarding the use of single- versus multi-omics, the choice of clustering strategy, the power of generic multi-view methods and the use of approximated p-values for gauging solution quality. Due to the growing use of multi-omics data, we expect these issues to be important for future progress in the field.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Algorithms
Bayes Theorem
Cluster Analysis
Computational Biology / methods*
Databases, Factual
Genomics / methods*
Humans
Machine Learning
Models, Statistical
Neoplasms / genetics*
Neoplasms / mortality
Probability
Prognosis
Proteomics / methods*