NorthStar Bank uses a binary classification model to predict whether a personal loan applicant will default within 12 months. The model is already in production for pre-approval screening, and the hiring team wants to assess whether you can identify and mitigate common model vulnerabilities before they become business, compliance, or reliability issues.
You are given a historical underwriting dataset used to train the current model.
| Feature Group | Count | Examples |
|---|---|---|
| Applicant financials | 12 | annual_income, debt_to_income, revolving_utilization, credit_history_length |
| Credit bureau signals | 9 | delinquencies_2y, inquiries_6m, public_records, fico_band |
| Loan attributes | 6 | loan_amount, term_months, interest_rate, purpose |
| Demographics / geography | 5 | state, employment_type, home_ownership, age_band |
| Operational fields | 4 | application_channel, broker_flag, submission_hour, prior_customer |
default_12m (1 if defaulted within 12 months, else 0)A strong solution should identify the main vulnerabilities of the model, explain how to test for them, and propose practical mitigations. “Good enough” means the model maintains stable out-of-time performance, avoids leakage, is reasonably interpretable for risk review, and does not materially degrade for key applicant segments.