You are speaking with a customer team that is new to LLM-powered products. They have heard terms like context window and retrieval-augmented generation, but they are mixing them together and are not sure when each one matters.
How would you explain the mechanics of an LLM's context window and retrieval-augmented generation to that team, in a way that is technically accurate but easy to follow?