A Note on Support Vector Machines Degeneracy
Author(s)
Rifkin, Ryan; Pontil, Massimiliano; Verri, Alessandro
DownloadAIM-1661.ps (1.065Mb)
Additional downloads
Metadata
Show full item recordAbstract
When training Support Vector Machines (SVMs) over non-separable data sets, one sets the threshold $b$ using any dual cost coefficient that is strictly between the bounds of $0$ and $C$. We show that there exist SVM training problems with dual optimal solutions with all coefficients at bounds, but that all such problems are degenerate in the sense that the "optimal separating hyperplane" is given by ${f w} = {f 0}$, and the resulting (degenerate) SVM will classify all future points identically (to the class that supplies more training data). We also derive necessary and sufficient conditions on the input data for this to occur. Finally, we show that an SVM training problem can always be made degenerate by the addition of a single data point belonging to a certain unboundedspolyhedron, which we characterize in terms of its extreme points and rays.
Date issued
1999-08-11Other identifiers
AIM-1661
CBCL-177
Series/Report no.
AIM-1661CBCL-177
Keywords
AI, MIT, Artificial Intelligence, Support Vector Machines, Scale Sensitive Loss Function, Statistical Learning Theory.