You are responsible for a critical production system that supports internal operations and customer-facing workflows. You have just been paged because the service is unavailable, downstream requests are timing out, and the on-call dashboard shows the failure is spreading across dependent components. You do not yet know whether this is a software defect, infrastructure issue, or security event.