Royal Cyber wants to improve retention for its managed commerce and digital transformation clients by predicting which customer accounts are likely to churn in the next 90 days. The current model performs inconsistently across training and validation data, and your task is to identify whether the issue is high bias, high variance, or both, then improve the model without making it too slow or opaque for the customer success team.
You are given an account-level dataset built from Royal Cyber CRM, support, billing, and product usage systems.
| Feature Group | Count | Examples |
|---|---|---|
| Usage metrics | 18 | weekly_active_users, feature_adoption_rate, session_depth, api_calls_30d |
| Support signals | 9 | open_tickets_30d, escalations_90d, avg_resolution_hours |
| Billing & contract | 8 | arr, payment_delay_days, renewal_in_60d, contract_term_months |
| Account profile | 7 | industry, region, account_age_days, implementation_partner |
| Derived trends | 10 | usage_drop_30d_vs_90d, ticket_growth_rate, login_volatility |
A strong solution should clearly diagnose bias vs. variance using train/validation behavior, improve generalization, and achieve ROC-AUC >= 0.84, PR-AUC >= 0.46, and F1 >= 0.58 on the held-out test set.