Diagnose Bias-Variance in Churn Models

Business Context

Royal Cyber wants to improve retention for its managed commerce and digital transformation clients by predicting which customer accounts are likely to churn in the next 90 days. The current model performs inconsistently across training and validation data, and your task is to identify whether the issue is high bias, high variance, or both, then improve the model without making it too slow or opaque for the customer success team.

Dataset

You are given an account-level dataset built from Royal Cyber CRM, support, billing, and product usage systems.

Feature Group	Count	Examples
Usage metrics	18	weekly_active_users, feature_adoption_rate, session_depth, api_calls_30d
Support signals	9	open_tickets_30d, escalations_90d, avg_resolution_hours
Billing & contract	8	arr, payment_delay_days, renewal_in_60d, contract_term_months
Account profile	7	industry, region, account_age_days, implementation_partner
Derived trends	10	usage_drop_30d_vs_90d, ticket_growth_rate, login_volatility

Size: 92,000 account-month records from the last 24 months, 52 features
Target: Binary label indicating whether the account churns within the next 90 days
Class balance: 11.4% churn, 88.6% retained
Missing data: 12% missing in support fields, 6% missing in usage fields for newly onboarded accounts

Success Criteria

A strong solution should clearly diagnose bias vs. variance using train/validation behavior, improve generalization, and achieve ROC-AUC >= 0.84, PR-AUC >= 0.46, and F1 >= 0.58 on the held-out test set.

Constraints

Predictions run nightly in batch for ~40K active accounts
The customer success team needs understandable drivers behind risk scores
Retraining should fit within a 2-hour Royal Cyber data science workflow

Deliverables

Build a baseline model and quantify train vs. validation performance.
Diagnose whether the baseline suffers from high bias, high variance, or both.
Propose and implement fixes using model complexity, regularization, feature engineering, or data strategy.
Compare at least two model families and justify the final choice.
Report final metrics, threshold choice, and top features influencing churn risk.

Business Context

Dataset

You are given an account-level dataset built from Royal Cyber CRM, support, billing, and product usage systems.

Feature Group	Count	Examples
Usage metrics	18	weekly_active_users, feature_adoption_rate, session_depth, api_calls_30d
Support signals	9	open_tickets_30d, escalations_90d, avg_resolution_hours
Billing & contract	8	arr, payment_delay_days, renewal_in_60d, contract_term_months
Account profile	7	industry, region, account_age_days, implementation_partner
Derived trends	10	usage_drop_30d_vs_90d, ticket_growth_rate, login_volatility

Size: 92,000 account-month records from the last 24 months, 52 features
Target: Binary label indicating whether the account churns within the next 90 days
Class balance: 11.4% churn, 88.6% retained
Missing data: 12% missing in support fields, 6% missing in usage fields for newly onboarded accounts

Success Criteria

Constraints

Predictions run nightly in batch for ~40K active accounts
The customer success team needs understandable drivers behind risk scores
Retraining should fit within a 2-hour Royal Cyber data science workflow

Deliverables

Build a baseline model and quantify train vs. validation performance.
Diagnose whether the baseline suffers from high bias, high variance, or both.
Propose and implement fixes using model complexity, regularization, feature engineering, or data strategy.
Compare at least two model families and justify the final choice.
Report final metrics, threshold choice, and top features influencing churn risk.

Business Context

Dataset

You are given an account-level dataset built from Royal Cyber CRM, support, billing, and product usage systems.

Feature Group	Count	Examples
Usage metrics	18	weekly_active_users, feature_adoption_rate, session_depth, api_calls_30d
Support signals	9	open_tickets_30d, escalations_90d, avg_resolution_hours
Billing & contract	8	arr, payment_delay_days, renewal_in_60d, contract_term_months
Account profile	7	industry, region, account_age_days, implementation_partner
Derived trends	10	usage_drop_30d_vs_90d, ticket_growth_rate, login_volatility

Size: 92,000 account-month records from the last 24 months, 52 features
Target: Binary label indicating whether the account churns within the next 90 days
Class balance: 11.4% churn, 88.6% retained
Missing data: 12% missing in support fields, 6% missing in usage fields for newly onboarded accounts

Success Criteria

Constraints

Predictions run nightly in batch for ~40K active accounts
The customer success team needs understandable drivers behind risk scores
Retraining should fit within a 2-hour Royal Cyber data science workflow

Deliverables

Build a baseline model and quantify train vs. validation performance.
Diagnose whether the baseline suffers from high bias, high variance, or both.
Propose and implement fixes using model complexity, regularization, feature engineering, or data strategy.
Compare at least two model families and justify the final choice.
Report final metrics, threshold choice, and top features influencing churn risk.

Business Context

Dataset

You are given an account-level dataset built from Royal Cyber CRM, support, billing, and product usage systems.

Feature Group	Count	Examples
Usage metrics	18	weekly_active_users, feature_adoption_rate, session_depth, api_calls_30d
Support signals	9	open_tickets_30d, escalations_90d, avg_resolution_hours
Billing & contract	8	arr, payment_delay_days, renewal_in_60d, contract_term_months
Account profile	7	industry, region, account_age_days, implementation_partner
Derived trends	10	usage_drop_30d_vs_90d, ticket_growth_rate, login_volatility

Size: 92,000 account-month records from the last 24 months, 52 features
Target: Binary label indicating whether the account churns within the next 90 days
Class balance: 11.4% churn, 88.6% retained
Missing data: 12% missing in support fields, 6% missing in usage fields for newly onboarded accounts

Success Criteria

Constraints

Predictions run nightly in batch for ~40K active accounts
The customer success team needs understandable drivers behind risk scores
Retraining should fit within a 2-hour Royal Cyber data science workflow

Deliverables

Build a baseline model and quantify train vs. validation performance.
Diagnose whether the baseline suffers from high bias, high variance, or both.
Propose and implement fixes using model complexity, regularization, feature engineering, or data strategy.
Compare at least two model families and justify the final choice.
Report final metrics, threshold choice, and top features influencing churn risk.

Interview Guides

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Diagnose Bias-Variance in Churn Models

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer

Diagnose Bias-Variance in Churn Models

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Diagnose Bias-Variance in Churn Models

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer