LendWise uses a binary classification model to predict whether a personal loan applicant will default within 12 months. The team is unsure how to interpret a seemingly strong ROC-AUC score because business results at the current decision threshold are only moderate.
| Metric | Validation Set | Notes |
|---|---|---|
| ROC-AUC | 0.86 | Based on 120,000 applications |
| Precision | 0.58 | At threshold = 0.50 |
| Recall | 0.41 | At threshold = 0.50 |
| F1 Score | 0.48 | At threshold = 0.50 |
| Accuracy | 0.89 | Default rate is 9% |
| Log Loss | 0.31 | Probability quality is mixed |
| True Positives | 4,428 | Correctly flagged defaults |
| False Positives | 3,206 | Good applicants incorrectly flagged |
| False Negatives | 6,372 | Missed defaults |
| True Negatives | 105,994 | Correctly approved non-defaults |
The risk team sees ROC-AUC = 0.86 and calls the model “strong,” but loan officers point out that recall is low and many defaults are still being approved. You need to explain what ROC-AUC actually means here, what it does not guarantee, and whether the model is good enough for deployment at the current threshold.