

AEAcmeCloud is deploying a customer support copilot that answers questions about product features, billing, and API behavior from internal documentation. The main risk is hallucination: the model may generate confident but unsupported answers, which can mislead users and increase support escalations.
grounded, partially grounded, or hallucinatedA good solution should reduce hallucinated responses by at least 40% relative to the current baseline, achieve ≥0.85 F1 on hallucination detection, and keep end-to-end response latency under 1.5 seconds at p95.
AcmeCloud is deploying a customer support copilot that answers questions about product features, billing, and API behavior from internal documentation. The main risk is hallucination: the model may generate confident but unsupported answers, which can mislead users and increase support escalations.
grounded, partially grounded, or hallucinatedA good solution should reduce hallucinated responses by at least 40% relative to the current baseline, achieve ≥0.85 F1 on hallucination detection, and keep end-to-end response latency under 1.5 seconds at p95.