A generalization of functional clustering for discrete multivariate longitudinal data. 2020

Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
Department of Applied Statistics, Chung-Ang University, Seoul, Republic of Korea.

This paper presents a new model-based generalized functional clustering method for discrete longitudinal data, such as multivariate binomial and Poisson distributed data. For this purpose, we propose a multivariate functional principal component analysis (MFPCA)-based clustering procedure for a latent multivariate Gaussian process instead of the original functional data directly. The main contribution of this study is two-fold: modeling of discrete longitudinal data with the latent multivariate Gaussian process and developing of a clustering algorithm based on MFPCA coupled with the latent multivariate Gaussian process. Numerical experiments, including real data analysis and a simulation study, demonstrate the promising empirical properties of the proposed approach.

UI MeSH Term Description Entries
D003198 Computer Simulation Computer-based representation of physical systems and phenomena such as chemical processes. Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D015999 Multivariate Analysis A set of techniques used when variation in several variables are studied simultaneously. In statistics, multivariate analysis is interpreted as any analytic method that allows simultaneous study of two or more dependent variables. Analysis, Multivariate,Multivariate Analyses
D016000 Cluster Analysis A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both. Clustering,Analyses, Cluster,Analysis, Cluster,Cluster Analyses,Clusterings
D016011 Normal Distribution Continuous frequency distribution of infinite range. Its properties are as follows: 1, continuous, symmetrical distribution with both tails extending to infinity; 2, arithmetic mean, mode, and median identical; and 3, shape completely determined by the mean and standard deviation. Gaussian Distribution,Distribution, Gaussian,Distribution, Normal,Distributions, Normal,Normal Distributions

Related Publications

Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
January 2016, Journal of biopharmaceutical statistics,
Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
January 2022, Statistics in medicine,
Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
January 2023, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America,
Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
March 2017, Biometrics,
Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
June 2016, Biometrika,
Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
March 1967, Behavioral science,
Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
January 2021, Theoretical biology forum,
Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
September 2010, The annals of applied statistics,
Yaeji Lim, and Ying Kuen Cheung, and Hee-Seok Oh
April 1979, IEEE transactions on pattern analysis and machine intelligence,
Copied contents to your clipboard!