Probit Approximation
- avoid long computation time
- assume likelihood is also normally distributed
\[p(w|D) = \text{N}(w|\mu, \Sigma)\]
Approximation of Posterior
- sigmoid is similar to normal CDF
\[\sigma(a) \approx \Phi\left(\frac{a\sqrt{\pi}}{\sqrt{8}}\right)\]
- approx posterior via sigmoid
\[\begin{array}{rcccl} p(y=1|x,D) & = & \sigma\left(\frac{m}{\sqrt{1+\frac{\pi v}{8}}}\right) \\ m & = & \text{E}[a] & = & x^{T}\mu \\ v & = & \text{V}[a] & = & x^{T}\Sigma x \\ a & = & x^{T}w \end{array}\]
- produces estimates that are closer to the decision boundary