Large margin nearest neighbor classifiers. 2005

Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
Computer Science Department, University of California, Riverside, CA 92521, USA. carlotta@ise.gmu.edu

The nearest neighbor technique is a simple and appealing approach to addressing classification problems. It relies on the assumption of locally constant class conditional probabilities. This assumption becomes invalid in high dimensions with a finite number of examples due to the curse of dimensionality. Severe bias can be introduced under these conditions when using the nearest neighbor rule. The employment of a locally adaptive metric becomes crucial in order to keep class conditional probabilities close to uniform, thereby minimizing the bias of estimates. We propose a technique that computes a locally flexible metric by means of support vector machines (SVMs). The decision function constructed by SVMs is used to determine the most discriminant direction in a neighborhood around the query. Such a direction provides a local feature weighting scheme. We formally show that our method increases the margin in the weighted space where classification takes place. Moreover, our method has the important advantage of online computational efficiency over competing locally adaptive techniques for nearest neighbor classification. We demonstrate the efficacy of our method using both real and simulated data.

UI MeSH Term Description Entries
D008954 Models, Biological Theoretical representations that simulate the behavior or activity of biological processes or diseases. For disease models in living animals, DISEASE MODELS, ANIMAL is available. Biological models include the use of mathematical equations, computers, and other electronic equipment. Biological Model,Biological Models,Model, Biological,Models, Biologic,Biologic Model,Biologic Models,Model, Biologic
D009716 Numerical Analysis, Computer-Assisted Computer-assisted study of methods for obtaining useful quantitative solutions to problems that have been expressed mathematically. Analysis, Computer-Assisted Numerical,Computer-Assisted Numerical Analysis,Analyses, Computer-Assisted Numerical,Analysis, Computer Assisted Numerical,Computer Assisted Numerical Analysis,Computer-Assisted Numerical Analyses,Numerical Analyses, Computer-Assisted,Numerical Analysis, Computer Assisted
D010363 Pattern Recognition, Automated In INFORMATION RETRIEVAL, machine-sensing or identification of visible patterns (shapes, forms, and configurations). (Harrod's Librarians' Glossary, 7th ed) Automated Pattern Recognition,Pattern Recognition System,Pattern Recognition Systems
D001943 Breast Neoplasms Tumors or cancer of the human BREAST. Breast Cancer,Breast Tumors,Cancer of Breast,Breast Carcinoma,Cancer of the Breast,Human Mammary Carcinoma,Malignant Neoplasm of Breast,Malignant Tumor of Breast,Mammary Cancer,Mammary Carcinoma, Human,Mammary Neoplasm, Human,Mammary Neoplasms, Human,Neoplasms, Breast,Tumors, Breast,Breast Carcinomas,Breast Malignant Neoplasm,Breast Malignant Neoplasms,Breast Malignant Tumor,Breast Malignant Tumors,Breast Neoplasm,Breast Tumor,Cancer, Breast,Cancer, Mammary,Cancers, Mammary,Carcinoma, Breast,Carcinoma, Human Mammary,Carcinomas, Breast,Carcinomas, Human Mammary,Human Mammary Carcinomas,Human Mammary Neoplasm,Human Mammary Neoplasms,Mammary Cancers,Mammary Carcinomas, Human,Neoplasm, Breast,Neoplasm, Human Mammary,Neoplasms, Human Mammary,Tumor, Breast
D003198 Computer Simulation Computer-based representation of physical systems and phenomena such as chemical processes. Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D003205 Computing Methodologies Computer-assisted analysis and processing of problems in a particular area. High Performance Computing,Methodologies, Computing,Computing Methodology,Computing, High Performance,Methodology, Computing,Performance Computing, High
D003661 Decision Support Techniques Mathematical or statistical procedures used as aids in making a decision. They are frequently used in medical decision-making. Decision Analysis,Decision Modeling,Models, Decision Support,Analysis, Decision,Decision Aids,Decision Support Technics,Aid, Decision,Aids, Decision,Analyses, Decision,Decision Aid,Decision Analyses,Decision Support Model,Decision Support Models,Decision Support Technic,Decision Support Technique,Model, Decision Support,Modeling, Decision,Technic, Decision Support,Technics, Decision Support,Technique, Decision Support,Techniques, Decision Support
D003920 Diabetes Mellitus A heterogeneous group of disorders characterized by HYPERGLYCEMIA and GLUCOSE INTOLERANCE.
D003936 Diagnosis, Computer-Assisted Application of computer programs designed to assist the physician in solving a diagnostic problem. Computer-Assisted Diagnosis,Computer Assisted Diagnosis,Computer-Assisted Diagnoses,Diagnoses, Computer-Assisted,Diagnosis, Computer Assisted
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man

Related Publications

Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
February 2010, IEEE transactions on pattern analysis and machine intelligence,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
December 2008, International journal of neural systems,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
October 2005, IEEE transactions on pattern analysis and machine intelligence,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
January 2004, Statistical applications in genetics and molecular biology,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
October 2017, IEEE transactions on neural networks and learning systems,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
February 1991, International journal of bio-medical computing,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
January 2019, Technometrics : a journal of statistics for the physical, chemical, and engineering sciences,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
January 2013, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
November 2018, IEEE transactions on neural networks and learning systems,
Carlotta Domeniconi, and Dimitrios Gunopulos, and Jing Peng
December 2019, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy,
Copied contents to your clipboard!