Omics Data and Data Representations for Deep Learning-Based Predictive Modeling. 2022

Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
MLV Research Group, Department of Computer Science, International Hellenic University, 65404 Kavala, Greece.

Medical discoveries mainly depend on the capability to process and analyze biological datasets, which inundate the scientific community and are still expanding as the cost of next-generation sequencing technologies is decreasing. Deep learning (DL) is a viable method to exploit this massive data stream since it has advanced quickly with there being successive innovations. However, an obstacle to scientific progress emerges: the difficulty of applying DL to biology, and this because both fields are evolving at a breakneck pace, thus making it hard for an individual to occupy the front lines of both of them. This paper aims to bridge the gap and help computer scientists bring their valuable expertise into the life sciences. This work provides an overview of the most common types of biological data and data representations that are used to train DL models, with additional information on the models themselves and the various tasks that are being tackled. This is the essential information a DL expert with no background in biology needs in order to participate in DL-based research projects in biomedicine, biotechnology, and drug discovery. Alternatively, this study could be also useful to researchers in biology to understand and utilize the power of DL to gain better insights into and extract important information from the omics data.

UI MeSH Term Description Entries
D000077321 Deep Learning Supervised or unsupervised machine learning methods that use multiple layers of data representations generated by nonlinear transformations, instead of individual task-specific ALGORITHMS, to build and train neural network models. Hierarchical Learning,Learning, Deep,Learning, Hierarchical
D001709 Biotechnology Body of knowledge related to the use of organisms, cells or cell-derived constituents for the purpose of developing products which are technically, scientifically and clinically useful. Alteration of biologic function at the molecular level (i.e., GENETIC ENGINEERING) is a central focus; laboratory methods used include TRANSFECTION and CLONING technologies, sequence and structure analysis algorithms, computer databases, and gene and protein structure function analysis and prediction. Biotechnologies
D055808 Drug Discovery The process of finding chemicals for potential therapeutic use. Drug Prospecting,Discovery, Drug,Prospecting, Drug

Related Publications

Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
October 2023, Biology,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
May 2021, Briefings in bioinformatics,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
February 2018, Molecular omics,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
January 2019, IEEE journal of biomedical and health informatics,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
January 2020, BioData mining,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
August 2020, Medical image analysis,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
January 2021, Computational and structural biotechnology journal,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
January 2020, Frontiers in big data,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
April 2023, Computer methods and programs in biomedicine,
Stefanos Tsimenidis, and Eleni Vrochidou, and George A Papakostas
August 2022, Genome biology,
Copied contents to your clipboard!