StreamBox uses a binary classification model to predict which paid subscribers are likely to cancel in the next 30 days so the CRM team can send retention offers. A gradient boosting model replaced a logistic regression baseline six weeks ago, but retention lift from the campaign has been weaker than expected.
| Metric | Validation at Launch | Current Holdout | Change |
|---|---|---|---|
| Accuracy | 0.84 | 0.81 | -0.03 |
| Precision | 0.61 | 0.48 | -0.13 |
| Recall | 0.72 | 0.39 | -0.33 |
| F1 Score | 0.66 | 0.43 | -0.23 |
| AUC-ROC | 0.81 | 0.77 | -0.04 |
| Log Loss | 0.41 | 0.53 | +0.12 |
| Positive rate (actual churn) | 0.18 | 0.19 | +0.01 |
| Users flagged / week | 42,000 | 24,000 | -18,000 |
The model still shows acceptable overall accuracy, but it is missing a large share of churners and the retention team believes too few at-risk users are being targeted. Leadership wants to know whether the issue is thresholding, calibration, segment drift, feature quality, or model overfitting.