9.6 Mathematics of the SVC

  • The SVC classifies a test observation based on which side of the hyperplane it lies.

maxβ0...βp,ϵ1...ϵn,M M subject to pj=1β2j=1 yi(β0+β1Xi1+β2Xi2...+βpXip)M(1ϵi) ϵi0,ni=1ϵiC

  • C is a nonnegative tuning parameter, typically chosen through cross-validation, and can be thought of as the budget for margin violation by the observations.

  • The ϵi are slack variables that allow individual observations to be on the wrong side of the margin or hyperplane. The ϵi indicates where the ith observation is located with regards to the margin and hyperplane.

    • If ϵi=0, the observation is on the correct side of the margin.
    • If ϵi>0, the observation is on the wrong side of margin
    • If ϵi>1, the observation is on the wrong side of the hyperplane.
  • Since C constrains the sum of the ϵi, it determines the number and magnitude of violations to the margin. If C=0, there is no margin for violation, thus all the ϵ1,...,ϵn=0.

  • Note that if C>0, no more than C observations can be on wrong side of hyperplane, since in these cases ϵi>1.