Non-Significant A/B Test Launch Decision

Business Context

StreamCart tested a new mobile checkout button intended to improve purchase conversion. After a 14-day A/B test, the PM wants to launch because the treatment looks directionally better, but the experiment did not reach statistical significance.

Problem Statement

Assess whether the observed lift is strong enough to justify launch, and quantify what the current test result does and does not tell us.

Given Data

Group	Users	Purchases	Conversion Rate
Control	52,400	6,026	11.50%
Treatment	52,100	6,105	11.72%

Additional inputs:

Parameter	Value
Test duration	14 days
Significance level	0.05
Desired power	0.80
PM's minimum practical lift to launch	0.50 percentage points

Requirements

State the null and alternative hypotheses for a two-sided test.
Run a two-proportion z-test and compute the pooled proportion, standard error, z-statistic, and p-value.
Compute a 95% confidence interval for the treatment-control difference.
Determine whether the result is statistically significant at the 5% level.
Compare the observed lift with the PM's minimum practical threshold.
Estimate the required sample size per group to detect a 0.50 percentage point absolute lift with 80% power at 5% significance.
Based on the evidence, explain what you would recommend to the PM: launch, hold, or continue testing.

Assumptions

Users were randomly assigned and counted once.
No sample ratio mismatch or major instrumentation issues.
Conversion is a Bernoulli outcome and normal approximation is valid.
The primary metric was pre-specified before the test started.

Business Context

Problem Statement

Assess whether the observed lift is strong enough to justify launch, and quantify what the current test result does and does not tell us.

Given Data

Group	Users	Purchases	Conversion Rate
Control	52,400	6,026	11.50%
Treatment	52,100	6,105	11.72%

Additional inputs:

Parameter	Value
Test duration	14 days
Significance level	0.05
Desired power	0.80
PM's minimum practical lift to launch	0.50 percentage points

Requirements

State the null and alternative hypotheses for a two-sided test.
Run a two-proportion z-test and compute the pooled proportion, standard error, z-statistic, and p-value.
Compute a 95% confidence interval for the treatment-control difference.
Determine whether the result is statistically significant at the 5% level.
Compare the observed lift with the PM's minimum practical threshold.
Estimate the required sample size per group to detect a 0.50 percentage point absolute lift with 80% power at 5% significance.
Based on the evidence, explain what you would recommend to the PM: launch, hold, or continue testing.

Assumptions

Users were randomly assigned and counted once.
No sample ratio mismatch or major instrumentation issues.
Conversion is a Bernoulli outcome and normal approximation is valid.
The primary metric was pre-specified before the test started.

Business Context

Problem Statement

Assess whether the observed lift is strong enough to justify launch, and quantify what the current test result does and does not tell us.

Given Data

Group	Users	Purchases	Conversion Rate
Control	52,400	6,026	11.50%
Treatment	52,100	6,105	11.72%

Additional inputs:

Parameter	Value
Test duration	14 days
Significance level	0.05
Desired power	0.80
PM's minimum practical lift to launch	0.50 percentage points

Requirements

State the null and alternative hypotheses for a two-sided test.
Run a two-proportion z-test and compute the pooled proportion, standard error, z-statistic, and p-value.
Compute a 95% confidence interval for the treatment-control difference.
Determine whether the result is statistically significant at the 5% level.
Compare the observed lift with the PM's minimum practical threshold.
Estimate the required sample size per group to detect a 0.50 percentage point absolute lift with 80% power at 5% significance.
Based on the evidence, explain what you would recommend to the PM: launch, hold, or continue testing.

Assumptions

Users were randomly assigned and counted once.
No sample ratio mismatch or major instrumentation issues.
Conversion is a Bernoulli outcome and normal approximation is valid.
The primary metric was pre-specified before the test started.

Business Context

Problem Statement

Assess whether the observed lift is strong enough to justify launch, and quantify what the current test result does and does not tell us.

Given Data

Group	Users	Purchases	Conversion Rate
Control	52,400	6,026	11.50%
Treatment	52,100	6,105	11.72%

Additional inputs:

Parameter	Value
Test duration	14 days
Significance level	0.05
Desired power	0.80
PM's minimum practical lift to launch	0.50 percentage points

Requirements

State the null and alternative hypotheses for a two-sided test.
Run a two-proportion z-test and compute the pooled proportion, standard error, z-statistic, and p-value.
Compute a 95% confidence interval for the treatment-control difference.
Determine whether the result is statistically significant at the 5% level.
Compare the observed lift with the PM's minimum practical threshold.
Estimate the required sample size per group to detect a 0.50 percentage point absolute lift with 80% power at 5% significance.
Based on the evidence, explain what you would recommend to the PM: launch, hold, or continue testing.

Assumptions

Users were randomly assigned and counted once.
No sample ratio mismatch or major instrumentation issues.
Conversion is a Bernoulli outcome and normal approximation is valid.
The primary metric was pre-specified before the test started.

Interview Guides

Business Context

Problem Statement

Given Data

Requirements

Assumptions

Non-Significant A/B Test Launch Decision

Business Context

Problem Statement

Given Data

Requirements

Assumptions

Your Answer

Non-Significant A/B Test Launch Decision

Business Context

Problem Statement

Given Data

Requirements

Assumptions

Non-Significant A/B Test Launch Decision

Business Context

Problem Statement

Given Data

Requirements

Assumptions

Your Answer