AbbVie is collecting internal survey responses from business teams about their experience with Artificial Intelligence and Large Language Models. The analytics team wants an NLP system that classifies each response into a primary capability area so leaders can quantify adoption patterns and identify where enablement is needed.
You are given 85,000 historical free-text responses from employees across commercial, medical, R&D, and operations teams submitted through an AbbVie internal feedback workflow. Responses are in English, range from 10 to 220 words (median 48), and are labeled into 5 classes: No Practical Experience (18%), Prompting / Summarization (27%), Information Retrieval / Search (21%), Text Analytics / Classification (19%), and Workflow Automation / Copilot Usage (15%). About 6% of responses contain abbreviations, product names, or pharma-specific terminology.
A good solution should achieve macro-F1 >= 0.82, with recall >= 0.88 for No Practical Experience and Workflow Automation / Copilot Usage so downstream reporting is reliable.