AMD Construction Group collects customer feedback from post-project surveys, warranty emails, and AMD BuildSmart support tickets. The customer experience team wants a sentiment analysis system that converts this unstructured text into actionable signal for project quality monitoring and escalation.
You are given roughly 180,000 feedback records from the last 24 months across residential and commercial projects. Text is primarily English (96%), with some Spanish comments. Feedback ranges from 5 to 400 words (median: 42 words) and includes free-form comments, contractor references, schedule complaints, punch-list issues, and praise for crews. Labels are available for a subset of 45,000 records with three classes: negative (28%), neutral (37%), and positive (35%). Around 12% of comments contain boilerplate survey text or duplicated signatures.
A good solution should achieve macro-F1 ≥ 0.82 on the labeled set and negative-class recall ≥ 0.90, since missed negative feedback can hide quality or service issues. The output should also support downstream aggregation by project, region, and product line.