Set Threshold for High-Risk Routing

Context

StreamSafe uses a binary classifier to score user-generated posts for policy risk and route the highest-risk content to a human safety queue. The current threshold was set six months ago, but policy operations now report that too many harmful posts are slipping through while reviewers are also near capacity.

Current Performance

Validation set size: 200,000 posts. Harmful content prevalence: 2.5% (5,000 posts).

Threshold	Precision	Recall	F1	FPR	Posts Routed/Day	Harmful Posts Caught/Day
0.80	0.91	0.42	0.57	0.2%	1,150	525
0.65	0.78	0.61	0.68	0.5%	2,050	763
0.50 (current)	0.64	0.76	0.69	1.1%	3,400	950
0.35	0.46	0.88	0.60	2.8%	6,100	1,100
0.20	0.29	0.95	0.44	6.4%	10,900	1,188

The Problem

Leadership wants a threshold recommendation for routing high-risk content. Missing truly harmful content has regulatory and brand risk, but false positives consume reviewer bandwidth and delay benign posts.

Requirements

Recommend the most appropriate threshold and justify it using the table above.
Explain the precision-recall tradeoff in this routing setting.
Assess whether the model appears well calibrated enough for threshold-based operations.
Describe what additional offline and online validation you would run before changing the threshold.
Propose follow-up improvements if no single threshold cleanly satisfies business needs.

Constraints

Human review capacity: 4,000 posts/day
Estimated cost of a false negative: 20x a false positive
Benign posts sent to review add moderation delay and creator friction
Threshold changes can be deployed immediately; retraining takes 10 days

Context

Current Performance

Validation set size: 200,000 posts. Harmful content prevalence: 2.5% (5,000 posts).

Threshold	Precision	Recall	F1	FPR	Posts Routed/Day	Harmful Posts Caught/Day
0.80	0.91	0.42	0.57	0.2%	1,150	525
0.65	0.78	0.61	0.68	0.5%	2,050	763
0.50 (current)	0.64	0.76	0.69	1.1%	3,400	950
0.35	0.46	0.88	0.60	2.8%	6,100	1,100
0.20	0.29	0.95	0.44	6.4%	10,900	1,188

The Problem

Requirements

Recommend the most appropriate threshold and justify it using the table above.
Explain the precision-recall tradeoff in this routing setting.
Assess whether the model appears well calibrated enough for threshold-based operations.
Describe what additional offline and online validation you would run before changing the threshold.
Propose follow-up improvements if no single threshold cleanly satisfies business needs.

Constraints

Human review capacity: 4,000 posts/day
Estimated cost of a false negative: 20x a false positive
Benign posts sent to review add moderation delay and creator friction
Threshold changes can be deployed immediately; retraining takes 10 days

Context

Current Performance

Validation set size: 200,000 posts. Harmful content prevalence: 2.5% (5,000 posts).

Threshold	Precision	Recall	F1	FPR	Posts Routed/Day	Harmful Posts Caught/Day
0.80	0.91	0.42	0.57	0.2%	1,150	525
0.65	0.78	0.61	0.68	0.5%	2,050	763
0.50 (current)	0.64	0.76	0.69	1.1%	3,400	950
0.35	0.46	0.88	0.60	2.8%	6,100	1,100
0.20	0.29	0.95	0.44	6.4%	10,900	1,188

The Problem

Requirements

Recommend the most appropriate threshold and justify it using the table above.
Explain the precision-recall tradeoff in this routing setting.
Assess whether the model appears well calibrated enough for threshold-based operations.
Describe what additional offline and online validation you would run before changing the threshold.
Propose follow-up improvements if no single threshold cleanly satisfies business needs.

Constraints

Human review capacity: 4,000 posts/day
Estimated cost of a false negative: 20x a false positive
Benign posts sent to review add moderation delay and creator friction
Threshold changes can be deployed immediately; retraining takes 10 days

Context

Current Performance

Validation set size: 200,000 posts. Harmful content prevalence: 2.5% (5,000 posts).

Threshold	Precision	Recall	F1	FPR	Posts Routed/Day	Harmful Posts Caught/Day
0.80	0.91	0.42	0.57	0.2%	1,150	525
0.65	0.78	0.61	0.68	0.5%	2,050	763
0.50 (current)	0.64	0.76	0.69	1.1%	3,400	950
0.35	0.46	0.88	0.60	2.8%	6,100	1,100
0.20	0.29	0.95	0.44	6.4%	10,900	1,188

The Problem

Requirements

Recommend the most appropriate threshold and justify it using the table above.
Explain the precision-recall tradeoff in this routing setting.
Assess whether the model appears well calibrated enough for threshold-based operations.
Describe what additional offline and online validation you would run before changing the threshold.
Propose follow-up improvements if no single threshold cleanly satisfies business needs.

Constraints

Human review capacity: 4,000 posts/day
Estimated cost of a false negative: 20x a false positive
Benign posts sent to review add moderation delay and creator friction
Threshold changes can be deployed immediately; retraining takes 10 days

Interview Guides

Context

Current Performance

The Problem

Requirements

Constraints

Set Threshold for High-Risk Routing

Context

Current Performance

The Problem

Requirements

Constraints

Your Answer

Set Threshold for High-Risk Routing

Context

Current Performance

The Problem

Requirements

Constraints

Set Threshold for High-Risk Routing

Context

Current Performance

The Problem

Requirements

Constraints

Your Answer