Evaluate an LLM System

MediumGenerative AI & LLMs00:00

Delta Electronics Americas

Your interviewer · AI Engineer

In session

Interviewer

Welcome to your interview for the AI Engineer role at Delta Electronics Americas.

The question is on your right: Evaluate an LLM System. Take a moment with it first.

Talk your thinking through with me if you like - when you're confident, submit your answer and I'll grade it like a real screen. You have three graded attempts to score 7/10 or better.

Only Submit answer is graded - discussion is free practice.

Problem

Scenario

You are working on an LLM-powered product feature and need a clear way to judge whether the model is good enough to ship and improve over time. The outputs are open-ended, so simple accuracy is not enough.

Question

How do you evaluate the performance of a generative model?