StreamBox uses a binary classification model to predict which subscribers are likely to cancel in the next 30 days so the retention team can send targeted offers. A recently deployed logistic regression model looks strong on overall accuracy, but churn has continued to rise and the CRM team says too many at-risk users are being missed.
| Metric | Validation Set | Previous Model |
|---|---|---|
| Accuracy | 0.91 | 0.87 |
| Precision | 0.64 | 0.58 |
| Recall | 0.38 | 0.55 |
| F1 Score | 0.48 | 0.56 |
| AUC-ROC | 0.79 | 0.76 |
| Churn rate in data | 0.12 | 0.12 |
| Customers flagged for outreach | 7,200 / 60,000 | 11,400 / 60,000 |
The new model improved accuracy and precision, but recall dropped sharply. As a result, many customers who actually churn are not being targeted by retention campaigns. The Head of Growth wants to know whether this model is truly better and what changes should be made before the next monthly campaign.