1 Class Conditional Nearest Neighbor and Large Margin Instance Selection
BibTeX
@MISC{Marchiori_1class,
author = {Elena Marchiori},
title = {1 Class Conditional Nearest Neighbor and Large Margin Instance Selection},
year = {}
}
OpenURL
Abstract
Abstract—The one nearest neighbor (1-NN) rule uses instance proximity followed by class labeling information for classifying new instances. This paper presents a framework for studying properties of the training set related to proximity and labeling information, in order to improve the performance of the 1-NN rule. To this aim, a so-called class conditional nearest neighbor (c.c.n.n.) relation is introduced, consisting of those pairs of training instances (a, b) such that b is the nearest neighbor of a among those instances (excluded a) in one of the classes of the training set. A graph-based representation of c.c.n.n. is used for a comparative analysis of c.c.n.n. and of other interesting proximity-based concepts. In particular, a scoring function on instances is introduced, which measures the effect of removing one instance on the hypothesis-margin of other instances. This scoring function is employed to develop an effective large margin instance selection algorithm, which is empirically demonstrated to improve storage and accuracy performance of the 1-NN rule on artificial and real-life data sets.







