
AYou are the engineering manager for a digital product platform that supports customer sign-in, order placement, and content delivery across web and mobile. Leadership wants a clear view of operational health, but the team currently reports a mix of uptime, error counts, and incident notes without a shared framework. You need to define the metrics that best reflect reliability, user impact, and early warning signals.