Design Online and Batch ML Serving

Hard

HardSystem DesignFeature StoreRetrievalModel Serving

Asked 2mo ago|

ElevenLabs

Asked 3 times

Problem

Scenario

You are building an AI voice platform with personalization across discovery, voice selection, and content recommendations. Some predictions must react to fresh user behavior, while others can be precomputed and served cheaply.

Question

How do you design online versus batch serving for an AI product?

What this tests

Choosing between batch precomputation and online inference
Designing retrieval, ranking, and re-ranking stages
Using a feature store to avoid training-serving skew
Handling feature drift, cold start, and fallback paths

Practicing as: Software Engineer interview at ElevenLabs

Hi, I'll play your ElevenLabs interviewer for the Software Engineer role. Candidates describe these interviews as mostly positive and moderately difficult, so expect me to be friendly and conversational. Take your time with the question above and answer like we're in the room.

Take this as a live interview session →

You are practicing as a guest. Sign up free to get your answer graded with AI feedback. Your draft stays right here.

Next questions

Online vs Batch Model ServingMedium

Choose Online vs Batch ServingHard Design Batch and Online Ad ServingMedium