NovaBank built a new gradient-boosted classifier to predict 30-day credit card default at account origination. The team wants to know whether the model has real predictive signal or is only showing weak lift over the current scorecard, especially because approvals affect both loss rates and customer growth.
| Metric | Existing Scorecard | New Classifier | Notes |
|---|---|---|---|
| AUC-ROC | 0.681 | 0.742 | Evaluated on 120,000 holdout applicants |
| Log Loss | 0.412 | 0.356 | Lower is better |
| Precision @ top 10% risk | 0.214 | 0.318 | Default rate in population = 0.082 |
| Recall @ threshold 0.20 | 0.46 | 0.61 | Same operating threshold for comparison |
| F1 @ threshold 0.20 | 0.29 | 0.39 | Positive class = default |
| Brier Score | 0.071 | 0.064 | Probability quality |
| KS Statistic | 0.27 | 0.39 | Rank separation |
Leadership is asking whether these results indicate strong enough signal to replace the current scorecard and how much of the gain is due to ranking power versus threshold choice or calibration. You need to assess whether the classifier is genuinely useful, where it still fails, and what changes would improve deployment readiness.