Bayesian Interpretation

X=(X1,...,Xp) β=(β0,β1,...,βp)T P(β|X,Y)f(Y|X,β)P(β|X)=f(Y|X,β)P(β)

Often the prior takes the form:

P(β)=pj=1g(βj)

  • Gaussian prior for each β corresponds to ridge regression.
  • Double exponential prior for each β corresponds to lasso.