Interview Guides

Experiment Speed vs Rigor Tradeoff

Medium

Statistics & Probability

Business Context

StreamCart, a grocery delivery app, wants to test a faster checkout flow. The product manager argues for shipping after one week to increase experimentation velocity, while the data science team wants enough sample size to maintain statistical power.

Problem Statement

You need to quantify the trade-off between running a short experiment and waiting for a statistically rigorous result. Assume the team cares about detecting a meaningful lift in checkout conversion.

Given Data

Metric	Value
Baseline checkout conversion rate	18.0%
Minimum detectable effect (absolute)	1.5 percentage points
Significance level	0.05
Desired power	80%
Daily eligible users	40,000
Traffic split	50% control / 50% treatment
One-week observed control users	140,000
One-week observed treatment users	140,000
One-week observed control conversions	25,200
One-week observed treatment conversions	27,020

The one-week result implies treatment conversion of 19.3% versus 18.0% in control.

Requirements

Compute the required sample size per group to detect the target 1.5 percentage point lift with 80% power.
Convert that sample size into the approximate number of days needed at the given traffic level.
Using the one-week observed data, run a two-proportion z-test.
Compute a 95% confidence interval for the observed lift.
Decide whether shipping after one week is justified.
Explain how you would balance experimentation speed and statistical rigor in practice.

Assumptions

Users are independently randomized at the user level.
No major novelty effect or traffic-quality shift occurs during the test.
The normal approximation for proportions is valid.
Use a two-sided test.

Experiment Speed vs Rigor Tradeoff

Medium

Statistics & Probability

Business Context

Problem Statement

You need to quantify the trade-off between running a short experiment and waiting for a statistically rigorous result. Assume the team cares about detecting a meaningful lift in checkout conversion.

Given Data

Metric	Value
Baseline checkout conversion rate	18.0%
Minimum detectable effect (absolute)	1.5 percentage points
Significance level	0.05
Desired power	80%
Daily eligible users	40,000
Traffic split	50% control / 50% treatment
One-week observed control users	140,000
One-week observed treatment users	140,000
One-week observed control conversions	25,200
One-week observed treatment conversions	27,020

The one-week result implies treatment conversion of 19.3% versus 18.0% in control.

Requirements

Compute the required sample size per group to detect the target 1.5 percentage point lift with 80% power.
Convert that sample size into the approximate number of days needed at the given traffic level.
Using the one-week observed data, run a two-proportion z-test.
Compute a 95% confidence interval for the observed lift.
Decide whether shipping after one week is justified.
Explain how you would balance experimentation speed and statistical rigor in practice.

Assumptions

Users are independently randomized at the user level.
No major novelty effect or traffic-quality shift occurs during the test.
The normal approximation for proportions is valid.
Use a two-sided test.

Your Answer

Experiment Speed vs Rigor Tradeoff

Medium

Statistics & Probability

Business Context

Problem Statement

You need to quantify the trade-off between running a short experiment and waiting for a statistically rigorous result. Assume the team cares about detecting a meaningful lift in checkout conversion.

Given Data

Metric	Value
Baseline checkout conversion rate	18.0%
Minimum detectable effect (absolute)	1.5 percentage points
Significance level	0.05
Desired power	80%
Daily eligible users	40,000
Traffic split	50% control / 50% treatment
One-week observed control users	140,000
One-week observed treatment users	140,000
One-week observed control conversions	25,200
One-week observed treatment conversions	27,020

The one-week result implies treatment conversion of 19.3% versus 18.0% in control.

Requirements

Compute the required sample size per group to detect the target 1.5 percentage point lift with 80% power.
Convert that sample size into the approximate number of days needed at the given traffic level.
Using the one-week observed data, run a two-proportion z-test.
Compute a 95% confidence interval for the observed lift.
Decide whether shipping after one week is justified.
Explain how you would balance experimentation speed and statistical rigor in practice.

Assumptions

Users are independently randomized at the user level.
No major novelty effect or traffic-quality shift occurs during the test.
The normal approximation for proportions is valid.
Use a two-sided test.

Experiment Speed vs Rigor Tradeoff

Medium

Statistics & Probability

Business Context

Problem Statement

You need to quantify the trade-off between running a short experiment and waiting for a statistically rigorous result. Assume the team cares about detecting a meaningful lift in checkout conversion.

Given Data

Metric	Value
Baseline checkout conversion rate	18.0%
Minimum detectable effect (absolute)	1.5 percentage points
Significance level	0.05
Desired power	80%
Daily eligible users	40,000
Traffic split	50% control / 50% treatment
One-week observed control users	140,000
One-week observed treatment users	140,000
One-week observed control conversions	25,200
One-week observed treatment conversions	27,020

The one-week result implies treatment conversion of 19.3% versus 18.0% in control.

Requirements

Compute the required sample size per group to detect the target 1.5 percentage point lift with 80% power.
Convert that sample size into the approximate number of days needed at the given traffic level.
Using the one-week observed data, run a two-proportion z-test.
Compute a 95% confidence interval for the observed lift.
Decide whether shipping after one week is justified.
Explain how you would balance experimentation speed and statistical rigor in practice.

Assumptions

Users are independently randomized at the user level.
No major novelty effect or traffic-quality shift occurs during the test.
The normal approximation for proportions is valid.
Use a two-sided test.