LexiFlow uses a binary classifier to decide whether incoming insurance claim documents can be auto-approved or must go to human reviewers. Today, all claims above a model score threshold are still manually reviewed, and leadership wants to know when the workflow has enough signal to safely automate a subset of decisions.
| Metric | Manual Review Queue (Current Threshold 0.70) | Candidate Auto-Approve Band (Score 0.92) |
|---|---|---|
| Volume share | 18.0% of claims | 6.5% of claims |
| Precision | 0.91 | 0.985 |
| Recall | 0.54 | 0.31 |
| F1 Score | 0.68 | 0.47 |
| AUC-ROC | 0.93 | 0.93 |
| False positive rate | 0.8% | 0.12% |
| Calibration error (ECE) | 0.041 | 0.018 |
| Avg claims/day | 120,000 | 7,800 |
The operations team can manually review only 22,000 claims per day, and backlog has grown 19% month over month. Auto-approving the highest-confidence claims would reduce cost and latency, but an incorrect approval creates regulatory and financial risk.