Meta is preparing a major launch of a new Threads feature: AI-generated post summaries shown in the main feed for English-speaking users in the US, UK, and Canada. You are the Operations Manager supporting the cross-functional launch team of 18 people across Engineering, Data Science, Product, Infra, Integrity, and Support. Leadership wants the feature live before a high-profile creator event in 10 weeks, and expects a clear capacity plan for both launch week and the first 30 days after launch.
The Threads Product Director wants maximum reach at launch to drive engagement. The Infra engineering manager is concerned about serving cost and latency on existing inference clusters. Integrity wants additional review capacity because summaries could misrepresent sensitive content. Customer Support wants staffing plans in place before public rollout. Finance has approved only limited incremental spend and expects the team to stay within budget.
The launch deadline is fixed: 10 weeks from today. The approved incremental budget is $420,000 for temporary vendor moderation, on-call coverage, and extra compute. Current inference capacity can support 22 million summary requests per day at p95 latency under 350 ms; forecasted demand ranges from 18 million to 32 million daily requests depending on rollout size. Only 6 backend engineers and 2 ML engineers are available part-time because the team is also supporting Instagram Feed ranking incidents. Integrity can add at most 35 vendor reviewers, but onboarding takes 3 weeks. Support can staff only 12 additional agents for the first month.