Alibaba Group wants to better understand user feedback from Taobao product reviews and buyer-seller chat logs. Build an NLP solution that classifies short Chinese text into actionable categories so operations teams can route issues faster and summarize customer pain points.
You are given 1.5 million labeled text records collected from Taobao over the last 12 months. Each record is a short user-generated message or review, typically 8-120 Chinese characters (median 34), with occasional emojis, SKU codes, seller shorthand, and mixed Chinese-English tokens. Labels are moderately imbalanced across four classes:
The corpus is primarily Simplified Chinese, with some dialect terms, misspellings, and repeated characters for emphasis.
A good solution should achieve macro-F1 ≥ 0.84, with recall ≥ 0.88 on the Logistics/Delivery class because delayed shipment complaints must be escalated quickly. The model should support batch scoring of 5 million texts/day and provide interpretable outputs for analysts.