Linked matrix factorization. 2019

Michael J O'Connell, and Eric F Lock
Department of Statistics, Miami University, Oxford, Ohio 45056.

Several recent methods address the dimension reduction and decomposition of linked high-content data matrices. Typically, these methods consider one dimension, rows or columns, that is shared among the matrices. This shared dimension may represent common features measured for different sample sets (horizontal integration) or a common sample set with features from different platforms (vertical integration). We introduce an approach for simultaneous horizontal and vertical integration, Linked Matrix Factorization (LMF), for the general case where some matrices share rows (e.g., features) and some share columns (e.g., samples). Our motivating application is a cytotoxicity study with accompanying genomic and molecular chemical attribute data. The toxicity matrix (cell lines chemicals) shares samples with a genotype matrix (cell lines SNPs) and shares features with a molecular attribute matrix (chemicals attributes). LMF gives a unified low-rank factorization of these three matrices, which allows for the decomposition of systematic variation that is shared and systematic variation that is specific to each matrix. This allows for efficient dimension reduction, exploratory visualization, and the imputation of missing data even when entire rows or columns are missing. We present theoretical results concerning the uniqueness, identifiability, and minimal parametrization of LMF, and evaluate it with extensive simulation studies.

UI MeSH Term Description Entries
D008956 Models, Chemical Theoretical representations that simulate the behavior or activity of chemical processes or phenomena; includes the use of mathematical equations, computers, and other electronic equipment. Chemical Models,Chemical Model,Model, Chemical
D008962 Models, Theoretical Theoretical representations that simulate the behavior or activity of systems, processes, or phenomena. They include the use of mathematical equations, computers, and other electronic equipment. Experimental Model,Experimental Models,Mathematical Model,Model, Experimental,Models (Theoretical),Models, Experimental,Models, Theoretic,Theoretical Study,Mathematical Models,Model (Theoretical),Model, Mathematical,Model, Theoretical,Models, Mathematical,Studies, Theoretical,Study, Theoretical,Theoretical Model,Theoretical Models,Theoretical Studies
D003198 Computer Simulation Computer-based representation of physical systems and phenomena such as chemical processes. Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D003603 Cytotoxins Substances that are toxic to cells; they may be involved in immunity or may be contained in venoms. These are distinguished from CYTOSTATIC AGENTS in degree of effect. Some of them are used as CYTOTOXIC ANTIBIOTICS. The mechanism of action of many of these are as ALKYLATING AGENTS or MITOSIS MODULATORS. Cytolysins,Cytotoxic Agent,Cytotoxic Agents,Cytotoxin,Agent, Cytotoxic
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia
D023281 Genomics The systematic study of the complete DNA sequences (GENOME) of organisms. Included is construction of complete genetic, physical, and transcript maps, and the analysis of this structural genomic information on a global scale such as in GENOME WIDE ASSOCIATION STUDIES. Functional Genomics,Structural Genomics,Comparative Genomics,Genomics, Comparative,Genomics, Functional,Genomics, Structural

Related Publications

Michael J O'Connell, and Eric F Lock
March 2022, The annals of applied statistics,
Michael J O'Connell, and Eric F Lock
February 2018, Neural networks : the official journal of the International Neural Network Society,
Michael J O'Connell, and Eric F Lock
October 2014, IEEE transactions on pattern analysis and machine intelligence,
Michael J O'Connell, and Eric F Lock
November 2023, IEEE transactions on neural networks and learning systems,
Michael J O'Connell, and Eric F Lock
January 2021, Journal of machine learning research : JMLR,
Michael J O'Connell, and Eric F Lock
March 2006, IEEE transactions on pattern analysis and machine intelligence,
Michael J O'Connell, and Eric F Lock
October 2016, IEEE transactions on neural networks and learning systems,
Michael J O'Connell, and Eric F Lock
September 2019, Physical review letters,
Michael J O'Connell, and Eric F Lock
January 2021, IEEE transactions on neural networks and learning systems,
Michael J O'Connell, and Eric F Lock
September 2017, IEEE transactions on neural networks and learning systems,
Copied contents to your clipboard!