LendWise uses a gradient boosted binary classifier to predict whether a personal loan applicant will default within 90 days. In the last model review, the team found that the new model improves overall ranking performance versus the production scorecard, but default rates in one applicant segment appear materially underestimated.
| Metric | Production Scorecard | New Model | Change |
|---|---|---|---|
| AUC-ROC | 0.74 | 0.81 | +0.07 |
| Precision @ threshold 0.40 | 0.61 | 0.68 | +0.07 |
| Recall @ threshold 0.40 | 0.58 | 0.49 | -0.09 |
| F1 Score | 0.59 | 0.57 | -0.02 |
| Log Loss | 0.49 | 0.43 | -0.06 |
| Brier Score | 0.162 | 0.191 | +0.029 |
| Approval Rate | 72% | 76% | +4 pts |
| Observed 90-day default rate | 8.4% | 8.4% | — |
| Predicted default rate | 8.1% | 6.7% | -1.4 pts |
You need to decide what validation steps are required before presenting these findings to the Head of Risk and recommending rollout. The ranking metrics look better, but threshold metrics, calibration, and segment behavior raise concerns about whether the model is truly safer for deployment.