LendWise uses a gradient boosting classifier to approve or decline unsecured personal loan applications. The model performed well in offline validation, but after 8 weeks in production the risk team observed that default rates are higher than expected in some applicant segments, raising concerns about real-world reliability.
| Metric | Offline Validation | Production (8 weeks) | Change |
|---|---|---|---|
| AUC-ROC | 0.84 | 0.79 | -0.05 |
| Precision (default class) | 0.61 | 0.58 | -0.03 |
| Recall (default class) | 0.74 | 0.49 | -0.25 |
| F1 Score | 0.67 | 0.53 | -0.14 |
| Log Loss | 0.41 | 0.52 | +0.11 |
| Approval Rate | 68% | 71% | +3 pts |
| 60-day Default Rate | 4.8% | 7.1% | +2.3 pts |
| Avg predicted default risk for approved loans | 3.9% | 3.7% | -0.2 pts |
The model appears to be underestimating risk in production and missing a meaningful share of future defaulters. Leadership wants a validation plan that goes beyond a single holdout score and demonstrates whether the model is reliable enough for continued deployment.