BetterUp uses a binary classification model to predict whether a newly proposed coach-member match will lead to a successful first 30 days, defined as the member attending the first session and giving a post-session rating of 4 or 5. The model is used in the matching workflow inside BetterUp Care to prioritize recommended coach options.
A new version of the model was deployed last month. Leadership sees slightly higher overall accuracy, but member complaints about poor-fit recommendations have increased, especially for enterprise members in their first week on the platform.
| Metric | Previous Model | Current Model | Change |
|---|---|---|---|
| Accuracy | 0.74 | 0.78 | +0.04 |
| Precision | 0.69 | 0.81 | +0.12 |
| Recall | 0.76 | 0.52 | -0.24 |
| F1 Score | 0.72 | 0.63 | -0.09 |
| AUC-ROC | 0.80 | 0.79 | -0.01 |
| Positive prediction rate | 0.41 | 0.24 | -0.17 |
| Successful matches in eval set | 3,600 | 3,600 | 0 |
The current model is more conservative: it recommends fewer matches as likely successful, and those recommendations are more often correct, but it misses many matches that would have succeeded. You need to assess whether the new model is actually better for BetterUp's matching experience.