E


You're responsible for a production data pipeline, and something starts going wrong after it has been running normally. You need a clear way to isolate the issue, limit downstream impact, and restore confidence in the data before making broader changes.
How do you approach debugging issues in production systems?