Processing math: 100%

What is an hyperparameter?

Nov 01, 2018

An hyperparameter is a parameter of the machine-learning algorithm. While the parameters are learned by the machine-learning algorithm, an hyperparameter dictates how the algorithm learns.

For instance, recall the training objective for a ridge regression given a train-set Strain:

G(fw,Strain)=LMSE(fw,Strain)+λw22

The variable λ regulates the weight of the regularizer term in the training objective. Changing λ will change the learned parameter w. It is an hyperparameter.

Other hyperparameters often used are the depth and width of a neural network, of the maximal degree of a polynomial feature augmentation.

Choosing the best hyperparameters

The process of choosing the best hyperparameters is called model selection.