Supervised Classification

Given a set of labelled data $D : (x_{i}, y_{i}), i = 1, ..., N$ (a data set of $N$ observations $x_{i}$ each with a class label $y_{i} = C_{i}$ if $x_{i} \in C_{i}$ )

We wish to construct the posterior probabilities, or class probabilities, $P (C_{j} ∣ x)$ from the given data $D$ . There are two methods to do this:

Generative - through means of Naive Bayes
1. Estimate the class-conditional probabilities, $p (x ∣ C_{j})$ - which can be used to generate new data points, hence the name ‘generative model’
2. Calculate the posterior probability using Bayes Theorem: $P (C_{j} ∣ x) \propto p (x ∣ C_{j}) \cdot P (C_{j})$
Discriminative - through means of Logistic Regression
1. Directly compute $P (C_{j} ∣ x)$ without first calculating class conditionals

So which one do I choose?

Well, here’s quick comparison:

Generative	Discriminative
More Flexible	Less Flexible
Less Efficient for Classification	More Efficient for Classification
Simpler Training (per class)	Harder Training
Class Data	All Data
Models each class	Focusses on class differences

In other words: The generative model is trained per class and ignores the properties of the other classes, while the discriminative model considers all data during training.

📓 Daniel's Notes

Explorer

Supervised Classification

Graph View

Backlinks