Machine learning methods for prediction of cancer driver genes: a survey paper. 2022

Renan Andrades, and Mariana Recamonde-Mendoza
Institute of Informatics, Universidade Federal do Rio Grande do Sul, Porto Alegre/RS, Brazil.

Identifying the genes and mutations that drive the emergence of tumors is a critical step to improving our understanding of cancer and identifying new directions for disease diagnosis and treatment. Despite the large volume of genomics data, the precise detection of driver mutations and their carrying genes, known as cancer driver genes, from the millions of possible somatic mutations remains a challenge. Computational methods play an increasingly important role in discovering genomic patterns associated with cancer drivers and developing predictive models to identify these elements. Machine learning (ML), including deep learning, has been the engine behind many of these efforts and provides excellent opportunities for tackling remaining gaps in the field. Thus, this survey aims to perform a comprehensive analysis of ML-based computational approaches to identify cancer driver mutations and genes, providing an integrated, panoramic view of the broad data and algorithmic landscape within this scientific problem. We discuss how the interactions among data types and ML algorithms have been explored in previous solutions and outline current analytical limitations that deserve further attention from the scientific community. We hope that by helping readers become more familiar with significant developments in the field brought by ML, we may inspire new researchers to address open problems and advance our knowledge towards cancer driver discovery.

UI MeSH Term Description Entries
D009154 Mutation Any detectable and heritable change in the genetic material that causes a change in the GENOTYPE and which is transmitted to daughter cells and to succeeding generations. Mutations
D009369 Neoplasms New abnormal growth of tissue. Malignant neoplasms show a greater degree of anaplasia and have the properties of invasion and metastasis, compared to benign neoplasms. Benign Neoplasm,Cancer,Malignant Neoplasm,Tumor,Tumors,Benign Neoplasms,Malignancy,Malignant Neoplasms,Neoplasia,Neoplasm,Neoplasms, Benign,Cancers,Malignancies,Neoplasias,Neoplasm, Benign,Neoplasm, Malignant,Neoplasms, Malignant
D009857 Oncogenes Genes whose gain-of-function alterations lead to NEOPLASTIC CELL TRANSFORMATION. They include, for example, genes for activators or stimulators of CELL PROLIFERATION such as growth factors, growth factor receptors, protein kinases, signal transducers, nuclear phosphoproteins, and transcription factors. A prefix of "v-" before oncogene symbols indicates oncogenes captured and transmitted by RETROVIRUSES; the prefix "c-" before the gene symbol of an oncogene indicates it is the cellular homolog (PROTO-ONCOGENES) of a v-oncogene. Transforming Genes,Oncogene,Transforming Gene,Gene, Transforming,Genes, Transforming
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000069550 Machine Learning A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data. Transfer Learning,Learning, Machine,Learning, Transfer
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D019295 Computational Biology A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets. Bioinformatics,Molecular Biology, Computational,Bio-Informatics,Biology, Computational,Computational Molecular Biology,Bio Informatics,Bio-Informatic,Bioinformatic,Biologies, Computational Molecular,Biology, Computational Molecular,Computational Molecular Biologies,Molecular Biologies, Computational

Related Publications

Renan Andrades, and Mariana Recamonde-Mendoza
September 2019, PLoS computational biology,
Renan Andrades, and Mariana Recamonde-Mendoza
June 2021, Scientific reports,
Renan Andrades, and Mariana Recamonde-Mendoza
January 2021, Briefings in bioinformatics,
Renan Andrades, and Mariana Recamonde-Mendoza
May 2019, Nucleic acids research,
Renan Andrades, and Mariana Recamonde-Mendoza
January 2013, Methods in molecular biology (Clifton, N.J.),
Renan Andrades, and Mariana Recamonde-Mendoza
August 2015, Genetics and molecular research : GMR,
Renan Andrades, and Mariana Recamonde-Mendoza
January 2021, Briefings in bioinformatics,
Renan Andrades, and Mariana Recamonde-Mendoza
July 2021, Briefings in bioinformatics,
Renan Andrades, and Mariana Recamonde-Mendoza
January 2021, Theranostics,
Renan Andrades, and Mariana Recamonde-Mendoza
January 2008, IEEE reviews in biomedical engineering,
Copied contents to your clipboard!