Dataford
Interview Guides
Upgrade
All questions/NLP/Choose Fine-Tuning for Support Routing

Choose Fine-Tuning for Support Routing

Easy
NLP
Word EmbeddingsText ClassificationLanguage Models

Problem

Business Context

ZendeskPro wants to automate routing of inbound enterprise support tickets. The team is deciding whether to deploy a prompt-based large language model using in-context learning or fine-tune a smaller transformer for stable production classification.

Data

You have 420,000 historical English support tickets labeled into 6 routing queues: Billing, Technical Bug, Account Access, Feature Request, Compliance, and Sales. Ticket bodies range from 10 to 900 words (median 120), often include pasted logs, order IDs, URLs, and product names. Class distribution is moderately imbalanced: Technical Bug 34%, Billing 22%, Account Access 18%, Feature Request 14%, Compliance 7%, Sales 5%. About 3% of labels are noisy due to manual reassignment after first triage.

Success Criteria

A production-ready recommendation should achieve at least 0.88 macro-F1, keep p95 inference latency below 150 ms per ticket, and produce consistent predictions across repeated runs. The solution should also explain when in-context learning is still preferable despite lower consistency.

Constraints

  • Daily volume: ~60,000 tickets
  • Deployment on a single A10G GPU or CPU batch inference
  • Customer data cannot be sent to third-party APIs
  • Weekly label refreshes are available

Requirements

  1. Build a baseline in-context learning approach and a fine-tuned transformer classifier.
  2. Define a realistic preprocessing pipeline for noisy support text.
  3. Compare the two approaches on accuracy, latency, cost, and operational stability.
  4. Explain when you would choose fine-tuning versus in-context learning for this task.
  5. Provide Python code for preprocessing, training, inference, and evaluation.
  6. Recommend a final production approach and justify trade-offs clearly.

Problem

Business Context

ZendeskPro wants to automate routing of inbound enterprise support tickets. The team is deciding whether to deploy a prompt-based large language model using in-context learning or fine-tune a smaller transformer for stable production classification.

Data

You have 420,000 historical English support tickets labeled into 6 routing queues: Billing, Technical Bug, Account Access, Feature Request, Compliance, and Sales. Ticket bodies range from 10 to 900 words (median 120), often include pasted logs, order IDs, URLs, and product names. Class distribution is moderately imbalanced: Technical Bug 34%, Billing 22%, Account Access 18%, Feature Request 14%, Compliance 7%, Sales 5%. About 3% of labels are noisy due to manual reassignment after first triage.

Success Criteria

A production-ready recommendation should achieve at least 0.88 macro-F1, keep p95 inference latency below 150 ms per ticket, and produce consistent predictions across repeated runs. The solution should also explain when in-context learning is still preferable despite lower consistency.

Constraints

  • Daily volume: ~60,000 tickets
  • Deployment on a single A10G GPU or CPU batch inference
  • Customer data cannot be sent to third-party APIs
  • Weekly label refreshes are available

Requirements

  1. Build a baseline in-context learning approach and a fine-tuned transformer classifier.
  2. Define a realistic preprocessing pipeline for noisy support text.
  3. Compare the two approaches on accuracy, latency, cost, and operational stability.
  4. Explain when you would choose fine-tuning versus in-context learning for this task.
  5. Provide Python code for preprocessing, training, inference, and evaluation.
  6. Recommend a final production approach and justify trade-offs clearly.
Your answer
Try one AI text evaluation on us
Get structured feedback, scored against a 4-axis rubric. Premium unlocks unlimited.
0 wordstarget ~200
Up next
Fine-Tune Support Intent ClassifierMediumCompare NLP Algorithms for Ticket RoutingEasySynechronChoose Prompting or Fine-TuningHard
Next question