Regression analysis with missing covariate data using estimating equations. 1996

L P Zhao, and S Lipsitz, and D Lew
Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98104, USA.

In regression analysis, missing covariate data has been among the most common problems. Frequently, practitioners adopt the so-called complete-case analysis, i.e., performing the analysis on only a complete dataset after excluding records with missing covariates. Performing a complete-case analysis is convenient with existing statistical packages, but it may be inefficient since the observed outcomes and covariates on those records with missing covariates are not used. It can even give misleading statistical inference if missing is not completely at random. This paper introduces a joint estimating equation (JEE) for regression analysis in the presence of missing observations on one covariate, which may be thought of as a method in a general framework for the missing covariate data problem proposed by Robins, Rotnitzky, and Zhao (1994, Journal of the American Statistical Association 89, 846-866). A generalization of JEE to more than one such covariate is discussed. The JEE is generally applicable to estimating regression coefficients from a regression model, including linear and logistic regression. Provided that the missing covariate data is either missing completely at random or missing at random (in addition to mild regularity conditions), estimates of regression coefficients from the JEE are consistent and have an asymptotic normal distribution. Simulation results show that the asymptotic distribution of estimated coefficients performs well in finite samples. Also shown through the simulation study is that the validity of JEE estimates depends on the correct specification of the probability function that characterizes the missing mechanism, suggesting a need for further research on how to robustify the estimation from making this nuisance assumption. Finally, the JEE is illustrated with an application from a case-control study of diet and thyroid cancer.

UI MeSH Term Description Entries
D007455 Iodine A nonmetallic element of the halogen group that is represented by the atomic symbol I, atomic number 53, and atomic weight of 126.90. It is a nutritionally essential element, especially important in thyroid hormone synthesis. In solution, it has anti-infective properties and is used topically. Iodine-127,Iodine 127
D011247 Pregnancy The status during which female mammals carry their developing young (EMBRYOS or FETUSES) in utero before birth, beginning from FERTILIZATION to BIRTH. Gestation,Pregnancies
D012044 Regression Analysis Procedures for finding the mathematical function which best describes the relationship between a dependent variable and one or more independent variables. In linear regression (see LINEAR MODELS) the relationship is constrained to be a straight line and LEAST-SQUARES ANALYSIS is used to determine the best fit. In logistic regression (see LOGISTIC MODELS) the dependent variable is qualitative rather than continuously variable and LIKELIHOOD FUNCTIONS are used to find the best relationship. In multiple regression, the dependent variable is considered to depend on more than a single independent variable. Regression Diagnostics,Statistical Regression,Analysis, Regression,Analyses, Regression,Diagnostics, Regression,Regression Analyses,Regression, Statistical,Regressions, Statistical,Statistical Regressions
D003627 Data Interpretation, Statistical Application of statistical procedures to analyze specific observed or assumed facts from a particular study. Data Analysis, Statistical,Data Interpretations, Statistical,Interpretation, Statistical Data,Statistical Data Analysis,Statistical Data Interpretation,Analyses, Statistical Data,Analysis, Statistical Data,Data Analyses, Statistical,Interpretations, Statistical Data,Statistical Data Analyses,Statistical Data Interpretations
D005260 Female Females
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D001699 Biometry The use of statistical and mathematical methods to analyze biological observations and phenomena. Biometric Analysis,Biometrics,Analyses, Biometric,Analysis, Biometric,Biometric Analyses
D013964 Thyroid Neoplasms Tumors or cancer of the THYROID GLAND. Cancer of Thyroid,Thyroid Cancer,Cancer of the Thyroid,Neoplasms, Thyroid,Thyroid Adenoma,Thyroid Carcinoma,Adenoma, Thyroid,Adenomas, Thyroid,Cancer, Thyroid,Cancers, Thyroid,Carcinoma, Thyroid,Carcinomas, Thyroid,Neoplasm, Thyroid,Thyroid Adenomas,Thyroid Cancers,Thyroid Carcinomas,Thyroid Neoplasm
D014801 Vitamin A Retinol and derivatives of retinol that play an essential role in metabolic functioning of the retina, the growth of and differentiation of epithelial tissue, the growth of bone, reproduction, and the immune response. Dietary vitamin A is derived from a variety of CAROTENOIDS found in plants. It is enriched in the liver, egg yolks, and the fat component of dairy products. Retinol,11-cis-Retinol,3,7-dimethyl-9-(2,6,6-trimethyl-1-cyclohexen-1-yl)-2,4,6,8-nonatetraen-1-ol, (all-E)-Isomer,All-Trans-Retinol,Aquasol A,Vitamin A1,All Trans Retinol
D015233 Models, Statistical Statistical formulations or analyses which, when applied to data and found to fit the data, are then used to verify the assumptions and parameters used in the analysis. Examples of statistical models are the linear model, binomial model, polynomial model, two-parameter model, etc. Probabilistic Models,Statistical Models,Two-Parameter Models,Model, Statistical,Models, Binomial,Models, Polynomial,Statistical Model,Binomial Model,Binomial Models,Model, Binomial,Model, Polynomial,Model, Probabilistic,Model, Two-Parameter,Models, Probabilistic,Models, Two-Parameter,Polynomial Model,Polynomial Models,Probabilistic Model,Two Parameter Models,Two-Parameter Model

Related Publications

L P Zhao, and S Lipsitz, and D Lew
August 2002, Statistics in medicine,
L P Zhao, and S Lipsitz, and D Lew
September 2010, Journal of the American Statistical Association,
L P Zhao, and S Lipsitz, and D Lew
September 1999, Biometrics,
L P Zhao, and S Lipsitz, and D Lew
March 2011, Statistics in medicine,
L P Zhao, and S Lipsitz, and D Lew
October 2016, Lifetime data analysis,
L P Zhao, and S Lipsitz, and D Lew
April 2013, Biostatistics (Oxford, England),
L P Zhao, and S Lipsitz, and D Lew
September 1997, Biometrics,
L P Zhao, and S Lipsitz, and D Lew
April 2014, Statistica Sinica,
L P Zhao, and S Lipsitz, and D Lew
November 1997, Statistics in medicine,
Copied contents to your clipboard!