Bayesian Logistic Regression
In addition to point estimates, we may want to measure uncertainty
- need to approximate posterior distribution (MAP: \(\hat{w}\))
\[p(y|x,D) \approx \displaystyle\int \! p(y|x,w)\delta(w - \hat{w}) \, dw = p(y|x,\hat{w})\] * comparative advantage with smaller data sets
Laplace Approximation
For a unique solution, we employ a spherical Gaussian prior
\[\text{N}(w|0, \sigma^{2}I)\]
- informative prior (small \(\sigma^{2}\)): better sensitivity
- vague prior (large \(\sigma^{2}\)): better specificity