NorthStar Bank uses a supervised learning model to predict whether a personal loan applicant will default within 12 months. The current model performs well on training data but degrades noticeably after deployment, so the team wants a more robust approach that explicitly addresses overfitting.
You are given a historical loan dataset used for binary classification.
| Feature Group | Count | Examples |
|---|---|---|
| Applicant demographics | 6 | age, employment_length, home_ownership |
| Financial attributes | 9 | annual_income, debt_to_income, revolving_utilization |
| Credit history | 7 | fico_score, delinquencies_2y, inquiries_6m |
| Loan attributes | 5 | loan_amount, interest_rate, term_months |
| Engineered behavior flags | 5 | high_utilization_flag, recent_inquiry_ratio |
A good solution should improve generalization, not just training accuracy. Target performance is ROC-AUC >= 0.82 on a held-out test set, with a train-test AUC gap <= 0.03 and stable cross-validation results.