StreamCart, a grocery delivery app, wants to test a faster checkout flow. The product manager argues for shipping after one week to increase experimentation velocity, while the data science team wants enough sample size to maintain statistical power.
You need to quantify the trade-off between running a short experiment and waiting for a statistically rigorous result. Assume the team cares about detecting a meaningful lift in checkout conversion.
| Metric | Value |
|---|---|
| Baseline checkout conversion rate | 18.0% |
| Minimum detectable effect (absolute) | 1.5 percentage points |
| Significance level | 0.05 |
| Desired power | 80% |
| Daily eligible users | 40,000 |
| Traffic split | 50% control / 50% treatment |
| One-week observed control users | 140,000 |
| One-week observed treatment users | 140,000 |
| One-week observed control conversions | 25,200 |
| One-week observed treatment conversions | 27,020 |
The one-week result implies treatment conversion of 19.3% versus 18.0% in control.