Processing math: 100%
Separating Hyperplane
- Consider a matrix X of dimensions n∗p, and a yi∈{−1,1}. We have a new observation, x∗, which is a vector x∗=(x∗1...x∗p)T which we wish to classify to one of two groups.
- We will use a separating hyperplane to classify the observation.

- We can label the blue observations as yi=1 and the pink observations as yi=−1.
- Thus, a separating hyperplane has the property s.t. β0+β1Xi1+β2Xi2...+βpXip>0 if yi=1 and β0+β1Xi1+β2Xi2...+βpXip<0 if yi=−1.
- In other words, a separating hyperplane has the property s.t. yi(β0+β1Xi1+β2Xi2...+βpXip)>0 for all i=1...n.
- Consider also the magnitude of f(x∗). If it is far from zero, we are confident in its classification, whereas if it is close to 0, then x∗ is located near the hyperplane, and we are less confident about its classification.