StreamHive is evaluating two binary classification models to predict 30-day subscriber churn so the CRM team can send retention offers. A logistic regression model is currently in production, and a gradient-boosted tree model was trained as a challenger.
| Metric | Simple Model: Logistic Regression | Complex Model: Gradient Boosting | Change |
|---|---|---|---|
| Accuracy | 0.842 | 0.856 | +0.014 |
| Precision | 0.611 | 0.648 | +0.037 |
| Recall | 0.462 | 0.571 | +0.109 |
| F1 Score | 0.526 | 0.607 | +0.081 |
| AUC-ROC | 0.781 | 0.826 | +0.045 |
| Log Loss | 0.438 | 0.401 | -0.037 |
| Inference latency (p95) | 12 ms | 95 ms | +83 ms |
| Weekly retraining time | 8 min | 2.4 hrs | +2.3 hrs |
The complex model performs better on offline metrics, but it is slower, harder to explain, and more expensive to maintain. The retention team can contact at most 120,000 users per week, and unnecessary offers reduce margin.