3.8 Qualitative Predictors
- Dummy variables: if there are \(k\) levels, introduce \(k-1\) dummy variables which are equal to one (“one hot”) when the underlying qualitative predictor takes that value. For example if there are 3 levels, introduce two new dummy variables and fit the model:
\[y_i = \beta_0 + \beta_1 x_{i1} + \beta_2 x_{i2} + \epsilon_i\]
Qualitative Predicitor | \(x_{i1}\) | \(x_{i2}\) |
---|---|---|
level 0 (baseline) | 0 | 0 |
level 1 | 1 | 0 |
level 2 | 0 | 1 |
Coefficients are interpreted the average effect relative to the baseline.
Alternative is to use index variables, a different coefficient for each level:
\[y_i = \beta_{0 1} + \beta_{0 2} +\beta_{0 3} + \epsilon_i\]