Evaluating machine learning methodologies for identification of cancer driver genes. 2021

Sharaf J Malebary, and Yaser Daanial Khan
Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, P.O. Box 344, Rabigh, 21911, Saudi Arabia.

Cancer is driven by distinctive sorts of changes and basic variations in genes. Recognizing cancer driver genes is basic for accurate oncological analysis. Numerous methodologies to distinguish and identify drivers presently exist, but efficient tools to combine and optimize them on huge datasets are few. Most strategies for prioritizing transformations depend basically on frequency-based criteria. Strategies are required to dependably prioritize organically dynamic driver changes over inert passengers in high-throughput sequencing cancer information sets. This study proposes a model namely PCDG-Pred which works as a utility capable of distinguishing cancer driver and passenger attributes of genes based on sequencing data. Keeping in view the significance of the cancer driver genes an efficient method is proposed to identify the cancer driver genes. Further, various validation techniques are applied at different levels to establish the effectiveness of the model and to obtain metrics like accuracy, Mathew's correlation coefficient, sensitivity, and specificity. The results of the study strongly indicate that the proposed strategy provides a fundamental functional advantage over other existing strategies for cancer driver genes identification. Subsequently, careful experiments exhibit that the accuracy metrics obtained for self-consistency, independent set, and cross-validation tests are 91.08%., 87.26%, and 92.48% respectively.

UI MeSH Term Description Entries
D009369 Neoplasms New abnormal growth of tissue. Malignant neoplasms show a greater degree of anaplasia and have the properties of invasion and metastasis, compared to benign neoplasms. Benign Neoplasm,Cancer,Malignant Neoplasm,Tumor,Tumors,Benign Neoplasms,Malignancy,Malignant Neoplasms,Neoplasia,Neoplasm,Neoplasms, Benign,Cancers,Malignancies,Neoplasias,Neoplasm, Benign,Neoplasm, Malignant,Neoplasms, Malignant
D009857 Oncogenes Genes whose gain-of-function alterations lead to NEOPLASTIC CELL TRANSFORMATION. They include, for example, genes for activators or stimulators of CELL PROLIFERATION such as growth factors, growth factor receptors, protein kinases, signal transducers, nuclear phosphoproteins, and transcription factors. A prefix of "v-" before oncogene symbols indicates oncogenes captured and transmitted by RETROVIRUSES; the prefix "c-" before the gene symbol of an oncogene indicates it is the cellular homolog (PROTO-ONCOGENES) of a v-oncogene. Transforming Genes,Oncogene,Transforming Gene,Gene, Transforming,Genes, Transforming
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000069550 Machine Learning A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data. Transfer Learning,Learning, Machine,Learning, Transfer
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D012372 ROC Curve A graphic means for assessing the ability of a screening test to discriminate between healthy and diseased persons; may also be used in other studies, e.g., distinguishing stimuli responses as to a faint stimuli or nonstimuli. ROC Analysis,Receiver Operating Characteristic,Analysis, ROC,Analyses, ROC,Characteristic, Receiver Operating,Characteristics, Receiver Operating,Curve, ROC,Curves, ROC,ROC Analyses,ROC Curves,Receiver Operating Characteristics
D014408 Biomarkers, Tumor Molecular products metabolized and secreted by neoplastic tissue and characterized biochemically in cells or BODY FLUIDS. They are indicators of tumor stage and grade as well as useful for monitoring responses to treatment and predicting recurrence. Many chemical groups are represented including HORMONES; ANTIGENS; amino and NUCLEIC ACIDS; ENZYMES; POLYAMINES; and specific CELL MEMBRANE PROTEINS and LIPIDS. Biochemical Tumor Marker,Cancer Biomarker,Carcinogen Markers,Markers, Tumor,Metabolite Markers, Neoplasm,Tumor Biomarker,Tumor Marker,Tumor Markers, Biochemical,Tumor Markers, Biological,Biochemical Tumor Markers,Biological Tumor Marker,Biological Tumor Markers,Biomarkers, Cancer,Marker, Biochemical Tumor,Marker, Biologic Tumor,Marker, Biological Tumor,Marker, Neoplasm Metabolite,Marker, Tumor Metabolite,Markers, Biochemical Tumor,Markers, Biological Tumor,Markers, Neoplasm Metabolite,Markers, Tumor Metabolite,Metabolite Markers, Tumor,Neoplasm Metabolite Markers,Tumor Markers, Biologic,Tumor Metabolite Marker,Biologic Tumor Marker,Biologic Tumor Markers,Biomarker, Cancer,Biomarker, Tumor,Cancer Biomarkers,Marker, Tumor,Markers, Biologic Tumor,Markers, Carcinogen,Metabolite Marker, Neoplasm,Metabolite Marker, Tumor,Neoplasm Metabolite Marker,Tumor Biomarkers,Tumor Marker, Biochemical,Tumor Marker, Biologic,Tumor Marker, Biological,Tumor Markers,Tumor Metabolite Markers
D015203 Reproducibility of Results The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results. Reliability and Validity,Reliability of Result,Reproducibility Of Result,Reproducibility of Finding,Validity of Result,Validity of Results,Face Validity,Reliability (Epidemiology),Reliability of Results,Reproducibility of Findings,Test-Retest Reliability,Validity (Epidemiology),Finding Reproducibilities,Finding Reproducibility,Of Result, Reproducibility,Of Results, Reproducibility,Reliabilities, Test-Retest,Reliability, Test-Retest,Result Reliabilities,Result Reliability,Result Validities,Result Validity,Result, Reproducibility Of,Results, Reproducibility Of,Test Retest Reliability,Validity and Reliability,Validity, Face
D060388 Support Vector Machine SUPERVISED MACHINE LEARNING algorithm which learns to assign labels to objects from a set of training examples. Examples are learning to recognize fraudulent credit card activity by examining hundreds or thousands of fraudulent and non-fraudulent credit card activity, or learning to make disease diagnosis or prognosis based on automatic classification of microarray gene expression profiles drawn from hundreds or thousands of samples. Support Vector Network,Machine, Support Vector,Machines, Support Vector,Network, Support Vector,Networks, Support Vector,Support Vector Machines,Support Vector Networks,Vector Machine, Support,Vector Machines, Support,Vector Network, Support,Vector Networks, Support
D019295 Computational Biology A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets. Bioinformatics,Molecular Biology, Computational,Bio-Informatics,Biology, Computational,Computational Molecular Biology,Bio Informatics,Bio-Informatic,Bioinformatic,Biologies, Computational Molecular,Biology, Computational Molecular,Computational Molecular Biologies,Molecular Biologies, Computational

Related Publications

Sharaf J Malebary, and Yaser Daanial Khan
July 2021, Briefings in bioinformatics,
Sharaf J Malebary, and Yaser Daanial Khan
May 2022, Briefings in bioinformatics,
Sharaf J Malebary, and Yaser Daanial Khan
May 2019, Nucleic acids research,
Sharaf J Malebary, and Yaser Daanial Khan
January 2022, iScience,
Sharaf J Malebary, and Yaser Daanial Khan
January 2023, Frontiers in immunology,
Sharaf J Malebary, and Yaser Daanial Khan
November 2014, Proceedings of the 2014 6th International Advanced Research Workshop on In Silico Oncology and Cancer Investigation : the CHIC Project Workshop (IARWISOCI) : Athens, Greece, 3-4 November 2014. International Advanced Research Workshop on...,
Sharaf J Malebary, and Yaser Daanial Khan
December 2016, Proceedings of the National Academy of Sciences of the United States of America,
Sharaf J Malebary, and Yaser Daanial Khan
September 2019, PLoS computational biology,
Sharaf J Malebary, and Yaser Daanial Khan
February 2022, Biology,
Copied contents to your clipboard!