undefined | Better HN

0 pointsclaytonjy8y ago0 comments

I've half-convinced myself it's because we're talking about GBM's and not Random Forests (where my mind goes first). One of the smart things about XGBoost is parallelizing training by multithreading the variable selection at each node, but that doesn't apply to inference; I imagine you gotta predict trees sequentially since each takes the previous output as an input? Now I wonder what those extra threads were even doing...

0 comments

volker488y ago

Each tree can be traversed independently. Each tree is traversed until a leaf node is reached. Those leaf values are summed for all the trees. The sum of all the leafs plus a base score is then returned. In the case of binary classification that sum is passed through a sigmoid function. For linear regression the sum is returned.

j / k navigate · click thread line to collapse

0 pointsclaytonjy8y ago0 comments

0 comments

volker488y ago

j / k navigate · click thread line to collapse