Google is preparing a quality-focused testing initiative for a new Gemini feature in Google Workspace: AI-generated email summaries in Gmail on web and Android. You are the QA Engineer responsible for defining what "success" means for the test initiative before launch readiness is reviewed. The cross-functional team includes 6 engineers, 2 QA engineers, 1 product manager, 1 UX researcher, and 1 site reliability engineer, and leadership wants a recommendation in 4 weeks because the feature is targeted for a limited rollout next quarter.
The Gmail PM wants broad coverage and fast launch confidence. Engineering wants to minimize test maintenance and avoid delaying code freeze. The SRE lead is focused on production stability and rollback readiness. Legal and Responsible AI reviewers want evidence that harmful or misleading summaries are rare before any external rollout.
The team has a testing budget of $90,000 for vendor-based manual evaluation and test data setup. Only 2 QA engineers are available, and one is shared 50% with another Workspace release. The feature must support English only at launch, cover 3 major user flows, and integrate with existing GoogleTest-based backend tests and Android Espresso UI tests. A launch recommendation is due in 28 days, with no headcount increase.