Supervised Learning

Template Matching

Note:

Note:

simple and very robust
k=1 always leads to overfitting
multi-class classifier
search for nearest neighbor is super expensive, need special data structure like kd-tree, octree to speed up.

Note:

just assume C is diagonal, reduce training time and amount of required data.

Note:

versatile, can approximate many real-life distributions
sensitive to the choice or estimation of model orders Mj
nonconvex optimization, possible convergence to local optimum, sensitive to initialization of EM

This approach can be considered as directly modeling posteriori with estimating the likelihood.

see course deep learning

basic ideas:

new ideas:

Hard margin SVM:

Note:

Soft margin SVM:

Note:

can solve significantly larger set of problems
sensitive to the choice of hyperparameters $\gamma$ and C. They have to be optimized

Multi-class SVM:

one against one
- need to train $\tbinom{c}{2}$ SVMs for all pairs of class
- the class with most wins wins
one against the rest
- need to train c SVMs
- choose the class with the highest f(x)
hierarchical

k-fold cross validation: (k − 2)/1/1 folds for training/validation/test