Power Instagram Reels CTR Test

Context

The Instagram Reels team is testing a new feed ranking tweak that adds a stronger "save-likelihood" feature to the Reels candidate ranker. The PM believes the change could slightly improve top-of-feed engagement by showing more relevant Reels, but wants a rigorous experiment before shipping.

Hypothesis Seed

The proposed change is expected to increase Reels click-through rate (CTR) from the Reels tray into a Reel watch session. Because the model explicitly optimizes for downstream engagement, the team also wants to ensure it does not hurt watch time, IG Save behavior, or user experience in the broader AARRR funnel.

Constraints

Eligible traffic: 12,000,000 Reels tray impressions per day globally
Maximum experiment window: 14 days, after which ranking resources will be reallocated
Randomization must happen in Meta's experimentation platform at the user_id level
The team wants 80% power at a 5% two-sided alpha level to detect a 1% relative lift in CTR
Baseline Reels tray CTR is 20.0%
False positives are costly because a bad ranker can degrade long-term session quality; false negatives are also costly because ranking launches are limited this quarter
You may use CUPED with 7-day pre-experiment user-level CTR as a variance reduction covariate

Deliverables

State the null and alternative hypotheses, and define the primary metric, secondary metrics, and guardrails.
Calculate the required sample size per arm to detect a 1% relative lift in CTR with 80% power, and translate that into experiment duration given the traffic constraint.
Choose the unit of randomization, explain the unit of analysis, and describe how you would use CUPED in the analysis.
Pre-register the analysis plan: statistical test, peeking policy, multiple-comparisons policy, and SRM checks.
Give a clear ship / don't-ship recommendation rule that respects guardrails and addresses novelty effect risk on IG Reels.

Hypothesis Seed

Constraints

Eligible traffic: 12,000,000 Reels tray impressions per day globally

Maximum experiment window: 14 days, after which ranking resources will be reallocated

Randomization must happen in Meta's experimentation platform at the user_id level

The team wants 80% power at a 5% two-sided alpha level to detect a 1% relative lift in CTR

Baseline Reels tray CTR is 20.0%

False positives are costly because a bad ranker can degrade long-term session quality; false negatives are also costly because ranking launches are limited this quarter

You may use CUPED with 7-day pre-experiment user-level CTR as a variance reduction covariate

Deliverables

State the null and alternative hypotheses, and define the primary metric, secondary metrics, and guardrails.

Calculate the required sample size per arm to detect a 1% relative lift in CTR with 80% power, and translate that into experiment duration given the traffic constraint.

Choose the unit of randomization, explain the unit of analysis, and describe how you would use CUPED in the analysis.

Pre-register the analysis plan: statistical test, peeking policy, multiple-comparisons policy, and SRM checks.

Give a clear ship / don't-ship recommendation rule that respects guardrails and addresses novelty effect risk on IG Reels.

Hypothesis Seed

Constraints

Eligible traffic: 12,000,000 Reels tray impressions per day globally

Maximum experiment window: 14 days, after which ranking resources will be reallocated

Randomization must happen in Meta's experimentation platform at the user_id level

The team wants 80% power at a 5% two-sided alpha level to detect a 1% relative lift in CTR

Baseline Reels tray CTR is 20.0%

False positives are costly because a bad ranker can degrade long-term session quality; false negatives are also costly because ranking launches are limited this quarter

You may use CUPED with 7-day pre-experiment user-level CTR as a variance reduction covariate

Deliverables

State the null and alternative hypotheses, and define the primary metric, secondary metrics, and guardrails.

Calculate the required sample size per arm to detect a 1% relative lift in CTR with 80% power, and translate that into experiment duration given the traffic constraint.

Choose the unit of randomization, explain the unit of analysis, and describe how you would use CUPED in the analysis.

Pre-register the analysis plan: statistical test, peeking policy, multiple-comparisons policy, and SRM checks.

Give a clear ship / don't-ship recommendation rule that respects guardrails and addresses novelty effect risk on IG Reels.

Hypothesis Seed

Constraints

Eligible traffic: 12,000,000 Reels tray impressions per day globally

Maximum experiment window: 14 days, after which ranking resources will be reallocated

Randomization must happen in Meta's experimentation platform at the user_id level

The team wants 80% power at a 5% two-sided alpha level to detect a 1% relative lift in CTR

Baseline Reels tray CTR is 20.0%

False positives are costly because a bad ranker can degrade long-term session quality; false negatives are also costly because ranking launches are limited this quarter

You may use CUPED with 7-day pre-experiment user-level CTR as a variance reduction covariate

Deliverables

State the null and alternative hypotheses, and define the primary metric, secondary metrics, and guardrails.

Calculate the required sample size per arm to detect a 1% relative lift in CTR with 80% power, and translate that into experiment duration given the traffic constraint.

Choose the unit of randomization, explain the unit of analysis, and describe how you would use CUPED in the analysis.

Pre-register the analysis plan: statistical test, peeking policy, multiple-comparisons policy, and SRM checks.

Give a clear ship / don't-ship recommendation rule that respects guardrails and addresses novelty effect risk on IG Reels.

Interview Guides

Context

Hypothesis Seed

Constraints

Deliverables

Power Instagram Reels CTR Test

Context

Hypothesis Seed

Constraints

Deliverables

Your Answer

Power Instagram Reels CTR Test

Context

Hypothesis Seed

Constraints

Deliverables

Power Instagram Reels CTR Test

Context

Hypothesis Seed

Constraints

Deliverables

Your Answer