LendWise, a mid-size digital lending platform, wants to use machine learning in its credit risk analysis workflow. The goal is to predict whether a newly approved loan applicant will default within 90 days so the risk team can prioritize manual review and adjust approval thresholds.
You are given a historical training dataset of past loan applications and repayment outcomes.
| Feature Group | Count | Examples |
|---|---|---|
| Applicant demographics | 6 | age, employment_status, education_level, residential_status |
| Financial profile | 10 | annual_income, debt_to_income_ratio, existing_loans, credit_utilization |
| Application details | 7 | loan_amount, loan_term_months, interest_rate, purpose |
| Behavioral / bureau signals | 9 | prior_delinquencies, credit_score, inquiries_6m, account_age_months |
default_90d — whether the borrower defaults within 90 days of originationA good solution should achieve strong ranking quality and usable recall for the risk team. Target performance is ROC-AUC >= 0.82 and F1 >= 0.55 at an operational threshold, while keeping the model explainable enough for analysts to review decisions.
default_90d.