

You have a classifier that outputs a score for each case, and the team needs a decision threshold before launch. Different thresholds change the balance between false positives and false negatives, so the right choice depends on the cost of each error and the volume the downstream team can handle.
How do you choose an appropriate threshold for a classifier?