NovaCredit, a mid-market consumer lending fintech, wants to improve its pre-approval underwriting model for personal loans. The current rules-based system approves too many risky applicants and rejects some creditworthy ones, so the team needs a machine learning model that predicts 90-day default risk at application time.
You are given a historical application dataset built from loan originations over the last 24 months.
| Feature Group | Count | Examples |
|---|---|---|
| Applicant demographics | 6 | age, employment_status, residential_status |
| Credit bureau signals | 14 | fico_band, revolving_utilization, delinquencies_12m, inquiries_6m |
| Financial attributes | 11 | annual_income, debt_to_income, requested_amount, existing_monthly_debt |
| Application behavior | 7 | device_type, application_hour, session_length_sec, document_upload_count |
| Derived temporal features | 8 | days_since_last_delinquency, income_to_loan_ratio, recent_credit_velocity |
default_90d — whether the borrower missed payments severely enough to be classified as default within 90 days of originationA good solution should improve risk ranking enough to support underwriting decisions and achieve:
default_90d.