Detect Overfitting in Churn Model

Context

StreamWave uses a gradient boosting classifier to predict which subscribers are likely to cancel within 30 days so the retention team can send discounts. The model performed well during training, but leadership is concerned that offline results may not generalize because recent campaign performance has been weaker than expected.

Current Performance

Metric	Training Set	Validation Set	Gap
Accuracy	0.94	0.81	-0.13
Precision	0.91	0.68	-0.23
Recall	0.89	0.61	-0.28
F1 Score	0.90	0.64	-0.26
AUC-ROC	0.97	0.76	-0.21
Log Loss	0.18	0.49	+0.31
Positive rate (churn)	0.22	0.21	-0.01

The Problem

The data science manager wants to know whether this model is overfitting, how confident you are in that diagnosis, and what should be changed before the next deployment. You should focus on interpreting the gap between training and validation performance rather than proposing a brand-new modeling approach.

Requirements

Determine whether the model is overfitting and explain which metrics support your conclusion.
Identify at least 3 likely causes of the performance gap.
Explain what additional validation checks you would run to confirm the diagnosis.
Recommend 3-5 concrete changes to reduce overfitting.
Discuss the business impact of deploying the current model as-is.

Constraints

Retention budget allows offers to only 8% of active subscribers each week.
False positives waste discount spend; false negatives miss preventable churn.
Retraining can happen weekly, but feature engineering changes take one sprint.

Context

Current Performance

Metric	Training Set	Validation Set	Gap
Accuracy	0.94	0.81	-0.13
Precision	0.91	0.68	-0.23
Recall	0.89	0.61	-0.28
F1 Score	0.90	0.64	-0.26
AUC-ROC	0.97	0.76	-0.21
Log Loss	0.18	0.49	+0.31
Positive rate (churn)	0.22	0.21	-0.01

The Problem

Requirements

Determine whether the model is overfitting and explain which metrics support your conclusion.
Identify at least 3 likely causes of the performance gap.
Explain what additional validation checks you would run to confirm the diagnosis.
Recommend 3-5 concrete changes to reduce overfitting.
Discuss the business impact of deploying the current model as-is.

Constraints

Retention budget allows offers to only 8% of active subscribers each week.
False positives waste discount spend; false negatives miss preventable churn.
Retraining can happen weekly, but feature engineering changes take one sprint.

Context

Current Performance

Metric	Training Set	Validation Set	Gap
Accuracy	0.94	0.81	-0.13
Precision	0.91	0.68	-0.23
Recall	0.89	0.61	-0.28
F1 Score	0.90	0.64	-0.26
AUC-ROC	0.97	0.76	-0.21
Log Loss	0.18	0.49	+0.31
Positive rate (churn)	0.22	0.21	-0.01

The Problem

Requirements

Determine whether the model is overfitting and explain which metrics support your conclusion.
Identify at least 3 likely causes of the performance gap.
Explain what additional validation checks you would run to confirm the diagnosis.
Recommend 3-5 concrete changes to reduce overfitting.
Discuss the business impact of deploying the current model as-is.

Constraints

Retention budget allows offers to only 8% of active subscribers each week.
False positives waste discount spend; false negatives miss preventable churn.
Retraining can happen weekly, but feature engineering changes take one sprint.

Context

Current Performance

Metric	Training Set	Validation Set	Gap
Accuracy	0.94	0.81	-0.13
Precision	0.91	0.68	-0.23
Recall	0.89	0.61	-0.28
F1 Score	0.90	0.64	-0.26
AUC-ROC	0.97	0.76	-0.21
Log Loss	0.18	0.49	+0.31
Positive rate (churn)	0.22	0.21	-0.01

The Problem

Requirements

Determine whether the model is overfitting and explain which metrics support your conclusion.
Identify at least 3 likely causes of the performance gap.
Explain what additional validation checks you would run to confirm the diagnosis.
Recommend 3-5 concrete changes to reduce overfitting.
Discuss the business impact of deploying the current model as-is.

Constraints

Retention budget allows offers to only 8% of active subscribers each week.
False positives waste discount spend; false negatives miss preventable churn.
Retraining can happen weekly, but feature engineering changes take one sprint.

Interview Guides

Context

Current Performance

The Problem

Requirements

Constraints

Detect Overfitting in Churn Model

Context

Current Performance

The Problem

Requirements

Constraints

Your Answer

Detect Overfitting in Churn Model

Context

Current Performance

The Problem

Requirements

Constraints

Detect Overfitting in Churn Model

Context

Current Performance

The Problem

Requirements

Constraints

Your Answer