You’re the analytics lead at ResolveAI, a fintech/legaltech platform that helps insurers and claimants settle auto injury and property damage claims. The product serves 3.5M claims/year across the US and UK, and ~40% of claims now receive an AI-generated initial settlement offer that a human adjuster can accept, edit, or override. ResolveAI’s customers care about cycle time and cost, but they are also under increasing scrutiny from regulators and plaintiffs’ attorneys for disparate outcomes across protected classes.
Over the last quarter, ResolveAI expanded the model to recommend offers for more complex injury claims. Customer success reports that cycle time improved (median time-to-offer down from 3.2 days to 1.1 days), but a large insurer paused rollout after an internal audit suggested the AI’s offers were “systematically lower” for certain demographic groups after controlling for claim severity. The CEO asks you to design a single, reportable fairness metric (with supporting cuts) that can be used in:
Stakeholders disagree on what “fair” means:
You have to propose a metric that is: (a) hard to game, (b) decomposable into root causes, (c) stable enough to track weekly, and (d) aligned with business outcomes (retention, expansion, regulatory risk).
| Source | What it contains | Grain |
|---|---|---|
claims | claim_id, insurer_id, jurisdiction, claim_type, injury_severity_score, policy_limits, claimant_age_band, claimant_gender (optional), protected_class_proxy (optional), attorney_representation, prior_claims_count | claim |
ai_offers | claim_id, model_version, offer_amount, offer_timestamp, confidence_score, explanation_features (top-K), guardrail_flags | claim-offer |
adjuster_actions | claim_id, adjuster_id, accepted_ai_offer (Y/N), edited_amount, override_reason, time_to_offer | claim |
settlements | claim_id, final_settlement_amount, settlement_date, litigation_filed (Y/N), time_to_close | claim |
customer_complaints | claim_id, complaint_type (bias/unfair/other), channel, resolution | complaint |
external_benchmarks | jurisdiction-level typical payout ranges by claim_type/severity (industry data) | jurisdiction x segment |
Constraints: