PixelCraft, a design collaboration app, tested a new onboarding illustration intended to increase first-session completion. A product designer saw that the experiment result was “not significant” and asked what the p-value means, and why the team also cares about statistical power.
Use the experiment data below to explain both p-value and statistical power in plain language, while also computing the formal test result. Then assess whether the test was large enough to reliably detect the target improvement.
| Metric | Value |
|---|---|
| Control users | 4,800 |
| Control completions | 1,200 |
| Control completion rate | 25.0% |
| Treatment users | 4,900 |
| Treatment completions | 1,274 |
| Treatment completion rate | 26.0% |
| Observed lift | 1.0 percentage point |
| Significance level | 0.05 |
| Desired power | 80% |
| Minimum detectable effect (absolute) | 2.0 percentage points |
Assume a two-sided test comparing two independent proportions.