PulseApp, a consumer productivity app, noticed that users who receive more push notifications tend to have higher 7-day retention. The product manager wants to know whether sending more notifications actually causes retention to increase, or whether the relationship is driven by more engaged users naturally receiving more notifications.
You are given both observational data and a follow-up randomized experiment. Use the numbers below to explain why correlation alone is not enough for causal claims, and determine whether the experiment provides evidence of a causal effect.
| Segment | Users | Avg notifications/user | 7-day retained users | Retention rate |
|---|---|---|---|---|
| Low baseline activity users | 8,000 | 2.1 | 1,440 | 18.0% |
| High baseline activity users | 4,000 | 6.4 | 1,920 | 48.0% |
| All users combined | 12,000 | 3.53 | 3,360 | 28.0% |
A simple user-level regression on the observational sample estimated that each additional notification is associated with a 4.0 percentage point increase in probability of 7-day retention.
| Group | Users | Avg notifications/user | 7-day retained users | Retention rate |
|---|---|---|---|---|
| Control | 5,000 | 3.0 | 1,350 | 27.0% |
| Treatment (extra notification policy) | 5,000 | 4.0 | 1,425 | 28.5% |
Use a two-sided test with significance level .