Some results on extensions and modifications of the Theil-Sen regression estimator. 2004

Rand R Wilcox
University of Southern California, Los Angeles, CA 90089-1061, USA. rwilcox@usc.edu

Many robust regression estimators have been proposed that have a high, finite-sample breakdown point, roughly meaning that a large proportion of points must be altered to drive the value of an estimator to infinity. But despite this, many of them can be inordinately influenced by two properly placed outliers. With one predictor, an estimator that appears to correct this problem to a fair degree, and simultaneously maintain good efficiency when standard assumptions are met, consists of checking for outliers using a projection-type method, removing any that are found, and applying the Theil-Sen estimator to the data that remain. When dealing with multiple predictors, there are two generalizations of the Theil-Sen estimator that might be used, but nothing is known about how their small-sample properties compare. Also, there are no results on testing the hypothesis of zero slopes, and there is no information about the effect on efficiency when outliers are removed. In terms of hypothesis testing, using the more obvious percentile bootstrap method in conjunction with a slight modification of Mahalanobis distance was found to avoid Type I error probabilities above the nominal level, but in some situations the actual Type I error probabilities can be substantially smaller than intended when the sample size is small. An alternative method is found to be more satisfactory.

UI MeSH Term Description Entries
D008962 Models, Theoretical Theoretical representations that simulate the behavior or activity of systems, processes, or phenomena. They include the use of mathematical equations, computers, and other electronic equipment. Experimental Model,Experimental Models,Mathematical Model,Model, Experimental,Models (Theoretical),Models, Experimental,Models, Theoretic,Theoretical Study,Mathematical Models,Model (Theoretical),Model, Mathematical,Model, Theoretical,Models, Mathematical,Studies, Theoretical,Study, Theoretical,Theoretical Model,Theoretical Models,Theoretical Studies
D011584 Psychology The science dealing with the study of mental processes and behavior in man and animals. Factors, Psychological,Psychological Factors,Psychological Side Effects,Psychologists,Psychosocial Factors,Side Effects, Psychological,Factor, Psychological,Factor, Psychosocial,Factors, Psychosocial,Psychological Factor,Psychological Side Effect,Psychologist,Psychosocial Factor,Side Effect, Psychological
D012044 Regression Analysis Procedures for finding the mathematical function which best describes the relationship between a dependent variable and one or more independent variables. In linear regression (see LINEAR MODELS) the relationship is constrained to be a straight line and LEAST-SQUARES ANALYSIS is used to determine the best fit. In logistic regression (see LOGISTIC MODELS) the dependent variable is qualitative rather than continuously variable and LIKELIHOOD FUNCTIONS are used to find the best relationship. In multiple regression, the dependent variable is considered to depend on more than a single independent variable. Regression Diagnostics,Statistical Regression,Analysis, Regression,Analyses, Regression,Diagnostics, Regression,Regression Analyses,Regression, Statistical,Regressions, Statistical,Statistical Regressions
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
Copied contents to your clipboard!