<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Capstone on DATATWEETS</title><link>https://datatweets.com/courses/gradient-boosting/capstone/</link><description>Recent content in Capstone on DATATWEETS</description><generator>Hugo</generator><language>en</language><copyright>Copyright (c) 2025 Datatweets</copyright><lastBuildDate>Sun, 05 Jul 2026 09:00:00 +0200</lastBuildDate><atom:link href="https://datatweets.com/courses/gradient-boosting/capstone/index.xml" rel="self" type="application/rss+xml"/><item><title>Lesson 1 - Guided Project: An Honest, Tuned, Explained Model</title><link>https://datatweets.com/courses/gradient-boosting/capstone/lesson-1-guided-project-an-honest-tuned-explained-model/</link><pubDate>Sun, 05 Jul 2026 09:00:00 +0200</pubDate><guid>https://datatweets.com/courses/gradient-boosting/capstone/lesson-1-guided-project-an-honest-tuned-explained-model/</guid><description>The course capstone takes one real dataset, Adult Income (48,842 rows, ~24 percent earn &amp;gt;50K), from raw data to a trustworthy model using skills from all four modules. A naive XGBClassifier baseline scores 0.8650 accuracy and 0.9183 ROC AUC but a weak 0.6506 positive-class recall. Adding scale_pos_weight and early stopping, then a 25-trial Optuna study, and finally SHAP explanations produces a model that lifts recall to 0.8563 and ROC AUC to 0.9299 while average precision climbs from 0.8100 to 0.8336. SHAP ranks marital-status, age, and capital-gain as the top drivers and decomposes a single high-earner prediction exactly in log-odds space.</description></item></channel></rss>