How boosting differs from a single tree and from random forests, how gradient boosting fits residuals for regression and classification, and the loss-and-gradient idea behind every boosting library.
Welcome to Boosting Foundations, the first module of the Gradient Boosting & XGBoost course. Before you touch XGBoost, you’ll build a rock-solid understanding of what gradient boosting actually does. You’ll see why boosting — training trees one after another, each fixing the last one’s mistakes — differs from the parallel averaging of a random forest, and why it so often wins on tabular data.
You’ll learn the mechanics step by step: the additive model that keeps adding shrunken trees to fit the leftover residuals, how the same idea works for classification in log-odds space, and the unifying principle that ties it all together — every tree fits the negative gradient of a loss function. The module ends with a guided project where you build a complete gradient boosting regressor from scratch in NumPy and scikit-learn, then check it against the real thing.
Every model here is trained for real on the California Housing and Adult Income datasets. Start with Lesson 1, where you’ll see exactly why a crowd of weak trees beats a single strong one.
Complete all 5 lessons to finish the Boosting Foundations module.