You are building an LLM feature for a product team and need to decide how to handle different request types with the least complexity. Some answers can come from the model directly, some need grounding in external knowledge, and some need behavior changes that are hard to get from prompting alone.
When would you use a simple prompt, when would you use RAG, and when would you fine-tune a model?