Tune Approval Threshold for Churn Model

Context

StreamBox uses a binary classification model to predict which subscribers are likely to cancel in the next 30 days. Customers above a score threshold receive a retention offer worth $8, but unnecessary offers reduce margin, so the team wants to tune the decision threshold rather than retrain the model.

Current Performance

Validation set size: 20,000 users. Positive class = churn. Current threshold = 0.50.

Metric	Threshold 0.30	Threshold 0.50	Threshold 0.70
Precision	0.41	0.62	0.79
Recall	0.86	0.58	0.31
F1 Score	0.56	0.60	0.45
Accuracy	0.71	0.84	0.89
Users flagged for offer	4,200	2,100	950
True churners caught	1,720	1,160	620
False positives	2,480	940	330

The Problem

The marketing lead is asking why the team should change the threshold if the model itself has not changed. They also want to know which threshold best balances churn prevention against offer cost.

Requirements

Explain threshold tuning in plain language and why it changes model behavior without retraining.
Interpret the precision-recall tradeoff across the three thresholds.
Recommend the best threshold for this retention use case and justify it.
Use the confusion matrix counts to explain the business impact of false positives vs. false negatives.
Describe how you would validate the chosen threshold before full rollout.

Constraints

Monthly retention budget supports at most 2,500 offers.
Estimated value of saving a true churner: $45.
Cost of sending an unnecessary offer: $8.
The model score is already calibrated well enough for ranking users.

Context

Current Performance

Validation set size: 20,000 users. Positive class = churn. Current threshold = 0.50.

Metric	Threshold 0.30	Threshold 0.50	Threshold 0.70
Precision	0.41	0.62	0.79
Recall	0.86	0.58	0.31
F1 Score	0.56	0.60	0.45
Accuracy	0.71	0.84	0.89
Users flagged for offer	4,200	2,100	950
True churners caught	1,720	1,160	620
False positives	2,480	940	330

The Problem

The marketing lead is asking why the team should change the threshold if the model itself has not changed. They also want to know which threshold best balances churn prevention against offer cost.

Requirements

Explain threshold tuning in plain language and why it changes model behavior without retraining.
Interpret the precision-recall tradeoff across the three thresholds.
Recommend the best threshold for this retention use case and justify it.
Use the confusion matrix counts to explain the business impact of false positives vs. false negatives.
Describe how you would validate the chosen threshold before full rollout.

Constraints

Monthly retention budget supports at most 2,500 offers.
Estimated value of saving a true churner: $45.
Cost of sending an unnecessary offer: $8.
The model score is already calibrated well enough for ranking users.

Context

Current Performance

Validation set size: 20,000 users. Positive class = churn. Current threshold = 0.50.

Metric	Threshold 0.30	Threshold 0.50	Threshold 0.70
Precision	0.41	0.62	0.79
Recall	0.86	0.58	0.31
F1 Score	0.56	0.60	0.45
Accuracy	0.71	0.84	0.89
Users flagged for offer	4,200	2,100	950
True churners caught	1,720	1,160	620
False positives	2,480	940	330

The Problem

The marketing lead is asking why the team should change the threshold if the model itself has not changed. They also want to know which threshold best balances churn prevention against offer cost.

Requirements

Explain threshold tuning in plain language and why it changes model behavior without retraining.
Interpret the precision-recall tradeoff across the three thresholds.
Recommend the best threshold for this retention use case and justify it.
Use the confusion matrix counts to explain the business impact of false positives vs. false negatives.
Describe how you would validate the chosen threshold before full rollout.

Constraints

Monthly retention budget supports at most 2,500 offers.
Estimated value of saving a true churner: $45.
Cost of sending an unnecessary offer: $8.
The model score is already calibrated well enough for ranking users.

Context

Current Performance

Validation set size: 20,000 users. Positive class = churn. Current threshold = 0.50.

Metric	Threshold 0.30	Threshold 0.50	Threshold 0.70
Precision	0.41	0.62	0.79
Recall	0.86	0.58	0.31
F1 Score	0.56	0.60	0.45
Accuracy	0.71	0.84	0.89
Users flagged for offer	4,200	2,100	950
True churners caught	1,720	1,160	620
False positives	2,480	940	330

The Problem

The marketing lead is asking why the team should change the threshold if the model itself has not changed. They also want to know which threshold best balances churn prevention against offer cost.

Requirements

Explain threshold tuning in plain language and why it changes model behavior without retraining.
Interpret the precision-recall tradeoff across the three thresholds.
Recommend the best threshold for this retention use case and justify it.
Use the confusion matrix counts to explain the business impact of false positives vs. false negatives.
Describe how you would validate the chosen threshold before full rollout.

Constraints

Monthly retention budget supports at most 2,500 offers.
Estimated value of saving a true churner: $45.
Cost of sending an unnecessary offer: $8.
The model score is already calibrated well enough for ranking users.

Interview Guides

Context

Current Performance

The Problem

Requirements

Constraints

Tune Approval Threshold for Churn Model

Context

Current Performance

The Problem

Requirements

Constraints

Your Answer

Tune Approval Threshold for Churn Model

Context

Current Performance

The Problem

Requirements

Constraints

Tune Approval Threshold for Churn Model

Context

Current Performance

The Problem

Requirements

Constraints

Your Answer