
You have shipped a model that is making decisions in production, and the business wants to know how you will track whether it stays reliable over time. The team is asking how you would monitor performance once labels arrive, how you would notice drift early, and how you would tell if the current threshold is still right.
How would you monitor model performance in production?