NBC Math

MLEs

  • binary features

ˆθdc=NdcNc

  • discrete features

ˆθdck=NdckNc

  • numerical features

ˆμdc=1Ndcn:yn=cxndˆσ2dc=1Ndcn:yn=c(xndˆμdc)2

  • MAP: add-one smoothing

ˉθdc=1+Ndc12+Ndcp(y=c|x,D)ˉπcdkˉθdckI(xd=k)

Imputation

Suppose that we are missing the value of xj

  • Gaussian discriminant analysis

p(y=c|xij,θ)=p(y=c)xjp(xj,xij|y=c,θ)

  • Naive Bayes classifier

xjp(xj,xij|y=c,θ)=Dijp(xi|θdc)