ShopLens, an e-commerce analytics company, is building a sentiment classifier for product reviews. The team wants to know whether stop-word removal should be part of the preprocessing pipeline, and under what conditions it may reduce model quality.
You are given 420,000 English product reviews collected over 18 months from electronics, home, and beauty categories. Reviews range from 3 to 250 words (median: 38 words). Labels are positive (62%), negative (24%), and neutral (14%). The text contains contractions, negations, emojis, misspellings, and short phrases such as "not worth it", "never again", and "I do recommend it".
A good solution should clearly explain stop-word removal, implement at least two preprocessing variants, and show when removing stop words helps or hurts performance. Target macro-F1 ≥ 0.84 on the held-out test set, with special attention to errors involving negation and short reviews.