TalentMatch, a recruiting platform processing ~120K internship applications per hiring cycle, wants a model to predict whether a candidate will pass the first-round technical screen. Recruiters need a practical baseline model to prioritize reviews while keeping the process explainable and fair.
You are given historical candidate records from internships and student projects. Each row represents one application.
| Feature Group | Count | Examples |
|---|---|---|
| Education | 6 | degree_level, major, GPA, graduation_year |
| Experience | 8 | internship_count, project_count, months_experience, hackathon_count |
| Skills | 10 | python, sql, sklearn, pytorch, cloud, statistics |
| Assessment | 5 | resume_score, coding_test_score, communication_score |
| Metadata | 4 | university_tier, region, referral_flag, application_source |
A good solution should achieve ROC-AUC >= 0.82, F1 >= 0.68, and recall >= 0.75 for the positive class at an operational threshold. The model should also provide interpretable feature importance for recruiters.