NorthStar Bank scores small-business loan applications and wants a model that predicts 90-day default risk before underwriting review. The current model performs well in training but degrades noticeably on new applications, so the team suspects high variance and wants a structured diagnosis and mitigation plan.
| Feature Group | Count | Examples |
|---|---|---|
| Applicant financials | 12 | annual_revenue, debt_to_income, cash_reserves, credit_utilization |
| Loan attributes | 8 | loan_amount, term_months, interest_rate, collateral_flag |
| Behavioral / bureau | 10 | prior_delinquencies, inquiries_6m, avg_payment_delay_days |
| Categorical profile | 6 | industry, region, business_type, channel |
| Engineered history | 6 | revenue_trend_3m, utilization_change_90d, missed_payment_ratio |
A good solution should clearly explain the bias-variance tradeoff, diagnose whether the model is suffering from high variance using train/validation behavior, and improve generalization enough to reach ROC-AUC >= 0.82 and PR-AUC >= 0.46 on a held-out test set. The final recommendation should also identify which interventions reduce variance without making the model too biased.