9.2 Separating Hyperplane

  • Consider a matrix X of dimensions np, and a yi{1,1}. We have a new observation, x, which is a vector x=(x1...xp)T which we wish to classify to one of two groups.
  • We will use a separating hyperplane to classify the observation.

  • We can label the blue observations as yi=1 and the pink observations as yi=1.
  • Thus, a separating hyperplane has the property s.t. β0+β1Xi1+β2Xi2...+βpXip>0 if yi=1 and β0+β1Xi1+β2Xi2...+βpXip<0 if yi=1.
  • In other words, a separating hyperplane has the property s.t. yi(β0+β1Xi1+β2Xi2...+βpXip)>0 for all i=1...n.
  • Consider also the magnitude of f(x). If it is far from zero, we are confident in its classification, whereas if it is close to 0, then x is located near the hyperplane, and we are less confident about its classification.