Integration of Online Omics-Data Resources for Cancer Research

Front Genet. 2020 Oct 23:11:578345. doi: 10.3389/fgene.2020.578345. eCollection 2020.

Abstract

The manifestations of cancerous phenotypes necessitate alterations at different levels of information-flow from genome to proteome. The molecular alterations at different information processing levels serve as the basis for the cancer phenotype to emerge. To understand the underlying mechanisms that drive the acquisition of cancer hallmarks it is required to interrogate cancer cells using multiple levels of information flow represented by different omics - such as genomics, epigenomics, transcriptomics, and proteomics. The advantage of multi-omics data integration comes with a trade-off in the form of an added layer of complexity originating from inherently diverse types of omics-datasets that may pose a challenge to integrate the omics-data in a biologically meaningful manner. The plethora of cancer-specific online omics-data resources, if able to be integrated efficiently and systematically, may facilitate the generation of new biological insights for cancer research. In this review, we provide a comprehensive overview of the online single- and multi-omics resources that are dedicated to cancer. We catalog various online omics-data resources such as The Cancer Genome Atlas (TCGA) along with various TCGA-associated data portals and tools for multi-omics analysis and visualization, the International Cancer Genome Consortium (ICGC), Catalogue of Somatic Mutations in Cancer (COSMIC), The Pathology Atlas, Gene Expression Omnibus (GEO), and PRoteomics IDEntifications (PRIDE). By comparing the strengths and limitations of the respective online resources, we aim to highlight the current biological and technological challenges and possible strategies to overcome these challenges. We outline the available schemes for the integration of the multi-omics dimensions for stratifying cancer patients and biomarker prediction based on the integrated molecular-signatures of cancer. Finally, we propose the multi-omics driven systems-biology approaches to realize the potential of precision onco-medicine as the future of cancer research. We believe this systematic review will encourage scientists and clinicians worldwide to utilize the online resources to explore and integrate the available omics datasets that may provide a window of opportunity to generate new biological insights and contribute to the advancement of the field of cancer research.

Keywords: cancer; data-integration; multi-omics; proteogenomic analysis; systems biology.

Publication types

  • Review