Gradient Boosted Decision Tree

GBDT is an additive regression algorithm consisting of an ensemble of trees, fitted to current residuals, gradients of the loss function, in a forward step-wise manner. It iteratively fits an additive model as

$f_{t}(x)=T_{t}(x;\Theta )+\lambda {\overset {T}{\underset {t=1}{\sum }}}\beta _{t}T_{t}(x;\Theta _{t})$

such that a certain loss function $L(y_{i},f_{T}(x+i))$ is minimized, where $T_{t}(x;\Theta _{t})$ is a tree at iteration $t$ , weighted by parameter $\sigma _{t}$ , with a finite number of parameters, $\Theta _{t}$ and $\lambda$ is the learning rate. At iteration $t$ , tree $T_{t}(x;\sigma )$ is induced to fit the negative gradient by least squares. That is