Variable selection in a flexible parametric mixture cure model with interval-censored data. 2016

Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA), Université catholique de Louvain, Louvain-la-Neuve, Belgium.

In standard survival analysis, it is generally assumed that every individual will experience someday the event of interest. However, this is not always the case, as some individuals may not be susceptible to this event. Also, in medical studies, it is frequent that patients come to scheduled interviews and that the time to the event is only known to occur between two visits. That is, the data are interval-censored with a cure fraction. Variable selection in such a setting is of outstanding interest. Covariates impacting the survival are not necessarily the same as those impacting the probability to experience the event. The objective of this paper is to develop a parametric but flexible statistical model to analyze data that are interval-censored and include a fraction of cured individuals when the number of potential covariates may be large. We use the parametric mixture cure model with an accelerated failure time regression model for the survival, along with the extended generalized gamma for the error term. To overcome the issue of non-stable and non-continuous variable selection procedures, we extend the adaptive LASSO to our model. By means of simulation studies, we show good performance of our method and discuss the behavior of estimates with varying cure and censoring proportion. Lastly, our proposed method is illustrated with a real dataset studying the time until conversion to mild cognitive impairment, a possible precursor of Alzheimer's disease.

UI MeSH Term Description Entries
D011336 Probability The study of chance processes or the relative frequency characterizing a chance process. Probabilities
D003198 Computer Simulation Computer-based representation of physical systems and phenomena such as chemical processes. Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D000544 Alzheimer Disease A degenerative disease of the BRAIN characterized by the insidious onset of DEMENTIA. Impairment of MEMORY, judgment, attention span, and problem solving skills are followed by severe APRAXIAS and a global loss of cognitive abilities. The condition primarily occurs after age 60, and is marked pathologically by severe cortical atrophy and the triad of SENILE PLAQUES; NEUROFIBRILLARY TANGLES; and NEUROPIL THREADS. (From Adams et al., Principles of Neurology, 6th ed, pp1049-57) Acute Confusional Senile Dementia,Alzheimer's Diseases,Dementia, Alzheimer Type,Dementia, Senile,Presenile Alzheimer Dementia,Senile Dementia, Alzheimer Type,Alzheimer Dementia,Alzheimer Disease, Early Onset,Alzheimer Disease, Late Onset,Alzheimer Sclerosis,Alzheimer Syndrome,Alzheimer Type Senile Dementia,Alzheimer's Disease,Alzheimer's Disease, Focal Onset,Alzheimer-Type Dementia (ATD),Dementia, Presenile,Dementia, Primary Senile Degenerative,Early Onset Alzheimer Disease,Familial Alzheimer Disease (FAD),Focal Onset Alzheimer's Disease,Late Onset Alzheimer Disease,Primary Senile Degenerative Dementia,Senile Dementia, Acute Confusional,Alzheimer Dementias,Alzheimer Disease, Familial (FAD),Alzheimer Diseases,Alzheimer Type Dementia,Alzheimer Type Dementia (ATD),Alzheimers Diseases,Dementia, Alzheimer,Dementia, Alzheimer-Type (ATD),Familial Alzheimer Diseases (FAD),Presenile Dementia,Sclerosis, Alzheimer,Senile Dementia
D012307 Risk Factors An aspect of personal behavior or lifestyle, environmental exposure, inborn or inherited characteristic, which, based on epidemiological evidence, is known to be associated with a health-related condition considered important to prevent. Health Correlates,Risk Factor Scores,Risk Scores,Social Risk Factors,Population at Risk,Populations at Risk,Correlates, Health,Factor, Risk,Factor, Social Risk,Factors, Social Risk,Risk Factor,Risk Factor Score,Risk Factor, Social,Risk Factors, Social,Risk Score,Score, Risk,Score, Risk Factor,Social Risk Factor
D013997 Time Factors Elements of limited time intervals, contributing to particular results or situations. Time Series,Factor, Time,Time Factor
D015233 Models, Statistical Statistical formulations or analyses which, when applied to data and found to fit the data, are then used to verify the assumptions and parameters used in the analysis. Examples of statistical models are the linear model, binomial model, polynomial model, two-parameter model, etc. Probabilistic Models,Statistical Models,Two-Parameter Models,Model, Statistical,Models, Binomial,Models, Polynomial,Statistical Model,Binomial Model,Binomial Models,Model, Binomial,Model, Polynomial,Model, Probabilistic,Model, Two-Parameter,Models, Probabilistic,Models, Two-Parameter,Polynomial Model,Polynomial Models,Probabilistic Model,Two Parameter Models,Two-Parameter Model
D016019 Survival Analysis A class of statistical procedures for estimating the survival function (function of time, starting with a population 100% well at a given time and providing the percentage of the population still well at later times). The survival analysis is then used for making inferences about the effects of treatments, prognostic factors, exposures, and other covariates on the function. Analysis, Survival,Analyses, Survival,Survival Analyses
D056808 Biostatistics The application of STATISTICS to biological systems and organisms involving the retrieval or collection, analysis, reduction, and interpretation of qualitative and quantitative data. Biological Statistics,Biological Statistic,Statistic, Biological,Statistics, Biological

Related Publications

Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
February 2023, Statistics in medicine,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
April 2024, Lifetime data analysis,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
April 2011, Statistics in medicine,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
January 2008, Statistics in medicine,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
January 2023, Biometrical journal. Biometrische Zeitschrift,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
January 2021, Lifetime data analysis,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
September 2013, Biometrical journal. Biometrische Zeitschrift,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
December 2023, Biometrics,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
January 2018, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America,
Sylvie Scolas, and Anouar El Ghouch, and Catherine Legrand, and Abderrahim Oulhaj
December 2009, Journal of the American Statistical Association,
Copied contents to your clipboard!