Compare Bagging and Boosting for Claims Risk

Business Context

BrightShield Insurance wants a simple, evidence-based explanation of when to use bagging vs boosting for a tabular risk model. The analytics team currently uses single decision trees to predict whether an auto insurance claim will become high-cost, but performance is unstable across retraining runs.

Dataset

You are given a historical claims dataset for binary classification.

Feature Group	Count	Examples
Policyholder profile	6	age, tenure_months, region, vehicle_age
Claim details	8	claim_amount_initial, accident_type, repair_estimate, police_report_flag
Driving history	5	prior_claims_3y, violations_2y, annual_mileage
Policy attributes	4	coverage_type, deductible, premium_amount, channel
Derived operational fields	3	days_to_report, claim_to_premium_ratio, weekend_accident_flag

Rows: 120,000 claims, 26 features
Target: high_cost_claim = 1 if final payout exceeds $15,000, else 0
Class balance: 18% positive, 82% negative
Missing data: ~7% missing in repair estimates, ~4% missing in annual mileage, <1% elsewhere

Success Criteria

A strong solution should:

clearly explain the practical difference between bagging and boosting in simple terms,
implement and compare at least one bagging model and one boosting model,
improve over a single decision tree on out-of-sample performance,
provide a recommendation for production given interpretability and latency constraints.

Constraints

Batch scoring runs hourly on up to 50,000 claims
Underwriting stakeholders want a simple explanation of model behavior
Training must be reproducible and finish within a standard laptop or small VM environment

Deliverables

Explain bagging vs boosting in plain language for a non-technical stakeholder.
Train a baseline decision tree, a bagging model, and a boosting model.
Compare models using cross-validated and test-set metrics.
Summarize bias-variance tradeoffs and failure modes for each approach.
Recommend which approach to deploy for this claims dataset and why.

Business Context

Dataset

You are given a historical claims dataset for binary classification.

Feature Group	Count	Examples
Policyholder profile	6	age, tenure_months, region, vehicle_age
Claim details	8	claim_amount_initial, accident_type, repair_estimate, police_report_flag
Driving history	5	prior_claims_3y, violations_2y, annual_mileage
Policy attributes	4	coverage_type, deductible, premium_amount, channel
Derived operational fields	3	days_to_report, claim_to_premium_ratio, weekend_accident_flag

Rows: 120,000 claims, 26 features
Target: high_cost_claim = 1 if final payout exceeds $15,000, else 0
Class balance: 18% positive, 82% negative
Missing data: ~7% missing in repair estimates, ~4% missing in annual mileage, <1% elsewhere

Success Criteria

A strong solution should:

clearly explain the practical difference between bagging and boosting in simple terms,
implement and compare at least one bagging model and one boosting model,
improve over a single decision tree on out-of-sample performance,
provide a recommendation for production given interpretability and latency constraints.

Constraints

Batch scoring runs hourly on up to 50,000 claims
Underwriting stakeholders want a simple explanation of model behavior
Training must be reproducible and finish within a standard laptop or small VM environment

Deliverables

Explain bagging vs boosting in plain language for a non-technical stakeholder.
Train a baseline decision tree, a bagging model, and a boosting model.
Compare models using cross-validated and test-set metrics.
Summarize bias-variance tradeoffs and failure modes for each approach.
Recommend which approach to deploy for this claims dataset and why.

Business Context

Dataset

You are given a historical claims dataset for binary classification.

Feature Group	Count	Examples
Policyholder profile	6	age, tenure_months, region, vehicle_age
Claim details	8	claim_amount_initial, accident_type, repair_estimate, police_report_flag
Driving history	5	prior_claims_3y, violations_2y, annual_mileage
Policy attributes	4	coverage_type, deductible, premium_amount, channel
Derived operational fields	3	days_to_report, claim_to_premium_ratio, weekend_accident_flag

Rows: 120,000 claims, 26 features
Target: high_cost_claim = 1 if final payout exceeds $15,000, else 0
Class balance: 18% positive, 82% negative
Missing data: ~7% missing in repair estimates, ~4% missing in annual mileage, <1% elsewhere

Success Criteria

A strong solution should:

clearly explain the practical difference between bagging and boosting in simple terms,
implement and compare at least one bagging model and one boosting model,
improve over a single decision tree on out-of-sample performance,
provide a recommendation for production given interpretability and latency constraints.

Constraints

Batch scoring runs hourly on up to 50,000 claims
Underwriting stakeholders want a simple explanation of model behavior
Training must be reproducible and finish within a standard laptop or small VM environment

Deliverables

Explain bagging vs boosting in plain language for a non-technical stakeholder.
Train a baseline decision tree, a bagging model, and a boosting model.
Compare models using cross-validated and test-set metrics.
Summarize bias-variance tradeoffs and failure modes for each approach.
Recommend which approach to deploy for this claims dataset and why.

Business Context

Dataset

You are given a historical claims dataset for binary classification.

Feature Group	Count	Examples
Policyholder profile	6	age, tenure_months, region, vehicle_age
Claim details	8	claim_amount_initial, accident_type, repair_estimate, police_report_flag
Driving history	5	prior_claims_3y, violations_2y, annual_mileage
Policy attributes	4	coverage_type, deductible, premium_amount, channel
Derived operational fields	3	days_to_report, claim_to_premium_ratio, weekend_accident_flag

Rows: 120,000 claims, 26 features
Target: high_cost_claim = 1 if final payout exceeds $15,000, else 0
Class balance: 18% positive, 82% negative
Missing data: ~7% missing in repair estimates, ~4% missing in annual mileage, <1% elsewhere

Success Criteria

A strong solution should:

clearly explain the practical difference between bagging and boosting in simple terms,
implement and compare at least one bagging model and one boosting model,
improve over a single decision tree on out-of-sample performance,
provide a recommendation for production given interpretability and latency constraints.

Constraints

Batch scoring runs hourly on up to 50,000 claims
Underwriting stakeholders want a simple explanation of model behavior
Training must be reproducible and finish within a standard laptop or small VM environment

Deliverables

Explain bagging vs boosting in plain language for a non-technical stakeholder.
Train a baseline decision tree, a bagging model, and a boosting model.
Compare models using cross-validated and test-set metrics.
Summarize bias-variance tradeoffs and failure modes for each approach.
Recommend which approach to deploy for this claims dataset and why.

Interview Guides

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Compare Bagging and Boosting for Claims Risk

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer

Compare Bagging and Boosting for Claims Risk

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Compare Bagging and Boosting for Claims Risk

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer