StreamBox has deployed a binary classification model to predict whether a subscriber will churn in the next 30 days. The product team wants to use the model to trigger retention offers, but leadership is concerned that the team is reporting only accuracy while the business cost of missed churners is much higher than the cost of sending an unnecessary offer.
| Metric | Current Model | Previous Model | Change |
|---|---|---|---|
| Accuracy | 0.91 | 0.89 | +0.02 |
| Precision | 0.58 | 0.54 | +0.04 |
| Recall | 0.36 | 0.49 | -0.13 |
| F1 Score | 0.44 | 0.51 | -0.07 |
| AUC-ROC | 0.79 | 0.77 | +0.02 |
| Log Loss | 0.29 | 0.33 | -0.04 |
| Churn rate in evaluation set | 0.12 | 0.12 | 0.00 |
The current model looks better on accuracy and AUC-ROC, but it is catching fewer actual churners. The retention team has budget to contact only 25,000 users per month, so the company needs to decide which metrics should drive model selection and thresholding.