A benchmark study of deep learning-based multi-omics data fusion methods for cancer. 2022

Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
Institute of Health Service and Transfusion Medicine, Beijing, People's Republic of China.

A fused method using a combination of multi-omics data enables a comprehensive study of complex biological processes and highlights the interrelationship of relevant biomolecules and their functions. Driven by high-throughput sequencing technologies, several promising deep learning methods have been proposed for fusing multi-omics data generated from a large number of samples. In this study, 16 representative deep learning methods are comprehensively evaluated on simulated, single-cell, and cancer multi-omics datasets. For each of the datasets, two tasks are designed: classification and clustering. The classification performance is evaluated by using three benchmarking metrics including accuracy, F1 macro, and F1 weighted. Meanwhile, the clustering performance is evaluated by using four benchmarking metrics including the Jaccard index (JI), C-index, silhouette score, and Davies Bouldin score. For the cancer multi-omics datasets, the methods' strength in capturing the association of multi-omics dimensionality reduction results with survival and clinical annotations is further evaluated. The benchmarking results indicate that moGAT achieves the best classification performance. Meanwhile, efmmdVAE, efVAE, and lfmmdVAE show the most promising performance across all complementary contexts in clustering tasks. Our benchmarking results not only provide a reference for biomedical researchers to choose appropriate deep learning-based multi-omics data fusion methods, but also suggest the future directions for the development of more effective multi-omics data fusion methods. The deep learning frameworks are available at https://github.com/zhenglinyi/DL-mo .

UI MeSH Term Description Entries
D009369 Neoplasms New abnormal growth of tissue. Malignant neoplasms show a greater degree of anaplasia and have the properties of invasion and metastasis, compared to benign neoplasms. Benign Neoplasm,Cancer,Malignant Neoplasm,Tumor,Tumors,Benign Neoplasms,Malignancy,Malignant Neoplasms,Neoplasia,Neoplasm,Neoplasms, Benign,Cancers,Malignancies,Neoplasias,Neoplasm, Benign,Neoplasm, Malignant,Neoplasms, Malignant
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000077321 Deep Learning Supervised or unsupervised machine learning methods that use multiple layers of data representations generated by nonlinear transformations, instead of individual task-specific ALGORITHMS, to build and train neural network models. Hierarchical Learning,Learning, Deep,Learning, Hierarchical
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D016000 Cluster Analysis A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both. Clustering,Analyses, Cluster,Analysis, Cluster,Cluster Analyses,Clusterings
D019985 Benchmarking Method of measuring performance against established standards of best practice. Benchmarking, Health Care,Benchmarks,Best Practice Analysis,Metrics,Benchmark,Benchmarking, Healthcare,Analysis, Best Practice,Health Care Benchmarking,Healthcare Benchmarking

Related Publications

Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
January 2020, BioData mining,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
October 2023, Biology,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
May 2021, Briefings in bioinformatics,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
October 2022, BMC bioinformatics,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
January 2022, Briefings in bioinformatics,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
June 2021, Cancers,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
July 2021, Computers in biology and medicine,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
July 2023, Briefings in bioinformatics,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
January 2021, Computational and structural biotechnology journal,
Dongjin Leng, and Linyi Zheng, and Yuqi Wen, and Yunhao Zhang, and Lianlian Wu, and Jing Wang, and Meihong Wang, and Zhongnan Zhang, and Song He, and Xiaochen Bo
August 2021, Bioinformatics (Oxford, England),
Copied contents to your clipboard!