Bayesian Interpretation

\[X = (X_1, ..., X_p)\] \[\beta = (\beta_0, \beta_1, ..., \beta_p)^T\] \[P(\beta|X, Y) \propto f(Y|X,\beta)P(\beta|X) = f(Y|X, \beta)P(\beta)\]

Often the prior takes the form:

\[P(\beta) = \prod_{j=1}^p{g(\beta_j)}\]

  • Gaussian prior for each \(\beta\) corresponds to ridge regression.
  • Double exponential prior for each \(\beta\) corresponds to lasso.