



You've shipped a model that is live in production, and the team wants to improve its real-world performance. You need a practical way to evaluate what is limiting the model and decide which optimization steps to take first.
What techniques would you use to optimize the performance of an AI model in production?