You are working on an NLP system and need to turn raw text into features that a model can learn from. The task could be something like document classification, routing, or tagging, and the main challenge is choosing representations that capture useful signal without adding unnecessary complexity.
How do you approach the problem of feature extraction in natural language processing?