A
StreamCart is testing a new personalized recommendation widget on its mobile home screen. Product expects only a small improvement in purchase conversion, so the main question is whether the experiment can detect that lift with reasonable power.
You are planning an A/B test on purchase conversion rate. The baseline conversion rate is 8.0%, and the team wants to detect a minimum meaningful lift of 0.4 percentage points (from 8.0% to 8.4%). Assume a two-sided test with significance level 5% and target power 80%.
| Metric | Value |
|---|---|
| Baseline conversion rate | 0.080 |
| Expected treatment conversion rate | 0.084 |
| Absolute lift | 0.004 |
| Relative lift | 5.0% |
| Significance level | 0.05 |
| Target power $1-\beta$ | 0.80 |
| Daily eligible users | 120,000 |
| Traffic split | 50% / 50% |
{"alpha":0.05,"power_target":0.8,"absolute_lift":0.004,"baseline_rate":0.08,"relative_lift":0.05,"treatment_rate":0.084,"daily_eligible_users":120000,"traffic_split_control":0.5,"traffic_split_treatment":0.5,"planned_days_for_power_check":7}Output(none)