ShopFlow, an e-commerce operations platform processing 2M support tickets per year, wants to automatically route incoming tickets to the correct queue (billing, shipping, returns, technical issue, or account access). The team wants an applied ML solution using Hugging Face libraries that can be trained quickly, evaluated rigorously, and deployed for low-latency inference.
You are given a historical dataset of customer support tickets labeled by the operations team.
| Feature Group | Count | Examples |
|---|---|---|
| Text | 2 | subject, message_body |
| Categorical | 3 | channel, customer_tier, region |
| Numeric | 4 | account_age_days, prior_ticket_count, order_value, attachment_count |
| Metadata | 2 | created_at, language |
billing, shipping, returns, technical, account_accesssubject, 6% missing numeric fields, and occasional empty message bodies after redactionA production-ready solution should achieve macro F1 >= 0.84 on a held-out test set and p95 inference latency < 120 ms per ticket on CPU batch inference. The routing team also wants per-class precision/recall so they can understand failure modes for minority classes.
datasets, transformers, and evaluate