LendWise built a binary classification model to predict whether a personal loan applicant will default within 12 months. The team wants to deploy the model to support underwriting decisions, but risk leadership wants a clear validation plan before launch because false approvals increase credit losses while false declines reduce revenue and create customer friction.
The model was trained on 180,000 historical applications and evaluated on a holdout set of 20,000 recent applications.
| Metric | Training | Validation | Holdout Test |
|---|---|---|---|
| Accuracy | 0.86 | 0.82 | 0.81 |
| Precision | 0.74 | 0.66 | 0.64 |
| Recall | 0.69 | 0.58 | 0.55 |
| F1 Score | 0.71 | 0.62 | 0.59 |
| AUC-ROC | 0.88 | 0.81 | 0.80 |
| Default Rate in Data | 0.18 | 0.19 | 0.20 |
The model looks directionally useful, but performance drops from training to test, and recall is materially lower than precision. The lending team needs to know whether the model is reliable enough for deployment, what additional validation is required, and whether the current threshold is appropriate for business use.