4.4 Generative Models for Classification
Why Logistic Regression is not ideal?
When there is substantial separation between the two classes, the parameter estimates for the logistic regression model are surprisingly unstable.
If the distribution of the predictors X is approximately normal in each of the classes and the sample size is small, then the generative modelling may be more accurate than logistic regression.
Generative modelling can be naturally extended to the case of more than two response classes.
Common notations:
- K \(\Longrightarrow\) response class
\(π_k \Longrightarrow\) overall or prior probability that a randomly chosen observation comes from the prior kth class; can be obtained from the random sample from the population
\(f_k(X) ≡ Pr(X|Y = k)^1 \Longrightarrow\) the density function of X density for an observation that comes from the kth class; requires some underlying assumption to estimate
Bayes’ theorem states that
\[Pr(Y = k|X = x) = \frac {π_k f_k(x)}{\sum_{l =1}^{k} π_lf_l(x)}\]
- \(p_k(x) = Pr(Y = k|X = x) \Longrightarrow\) posterior probability that an observation posterior X = x belongs to the kth class; computed from \(f_k(X)\)