
You are training a binary classifier where the positive class is rare, as is common in fraud detection. A model with high accuracy can still be useless if it misses most true positives or floods investigators with false alarms.
How do you handle highly imbalanced datasets, which are common in insurance fraud detection?