You've launched a new model or feature and early signals look mixed. Some stakeholders point to usage gains, while others worry the quality may be drifting after the initial rollout. You need a clear way to tell whether the launch is actually working as intended once real users interact with it over time.
How would you evaluate whether a model or feature is working as intended after launch?