Bayesian Interpretation
X=(X1,...,Xp) β=(β0,β1,...,βp)T
P(β|X,Y)∝f(Y|X,β)P(β|X)=f(Y|X,β)P(β)
Often the prior takes the form:
P(β)=p∏j=1g(βj)
- Gaussian prior for each β corresponds to ridge regression.
- Double exponential prior for each β corresponds to lasso.
