Detect Card Fraud with Imbalanced Labels

Business Context

PayWave processes millions of card transactions per day and wants a fraud detection model to score transactions before approval. Fraud is rare, but missed fraud is expensive, while too many false positives create customer friction and declined legitimate payments.

Dataset

You are given a historical transaction dataset for supervised binary classification.

Feature Group	Count	Examples
Transaction attributes	12	amount, merchant_category, entry_mode, card_present, currency
Customer behavior	9	avg_txn_amount_7d, txn_count_24h, chargeback_count_90d
Merchant risk	6	merchant_fraud_rate_30d, merchant_country, terminal_age_days
Temporal & geo	7	hour_of_day, day_of_week, billing_shipping_distance_km
Device & channel	8	device_id_hash, browser_type, app_version, ip_risk_score

Size: 2.4M transactions, 42 features
Target: is_fraud where 1 = fraudulent transaction, 0 = legitimate transaction
Class balance: 0.9% positive, 99.1% negative
Missing data: ~12% missing in device fields, ~4% missing in geo features, some unseen categories in production

Success Criteria

A strong solution should improve fraud capture materially over a majority-class baseline and be evaluated with metrics appropriate for severe class imbalance. Good enough means achieving recall >= 70% at precision >= 20% on the fraud class, with PR-AUC > 0.18 on a held-out test set.

Constraints

Inference must complete in under 50 ms per transaction.
The fraud operations team needs feature-level explanations for flagged transactions.
Training can run offline daily; scoring happens online.
Avoid data leakage from post-transaction or chargeback-resolution fields.

Deliverables

Propose a modeling approach for highly imbalanced binary classification.
Define preprocessing and feature engineering for mixed data types and missing values.
Choose evaluation metrics and threshold selection strategy appropriate for fraud.
Explain how you would validate the model without leakage.
Provide production-ready Python code for training, evaluation, and threshold tuning.

Business Context

Dataset

You are given a historical transaction dataset for supervised binary classification.

Feature Group	Count	Examples
Transaction attributes	12	amount, merchant_category, entry_mode, card_present, currency
Customer behavior	9	avg_txn_amount_7d, txn_count_24h, chargeback_count_90d
Merchant risk	6	merchant_fraud_rate_30d, merchant_country, terminal_age_days
Temporal & geo	7	hour_of_day, day_of_week, billing_shipping_distance_km
Device & channel	8	device_id_hash, browser_type, app_version, ip_risk_score

Size: 2.4M transactions, 42 features
Target: is_fraud where 1 = fraudulent transaction, 0 = legitimate transaction
Class balance: 0.9% positive, 99.1% negative
Missing data: ~12% missing in device fields, ~4% missing in geo features, some unseen categories in production

Success Criteria

Constraints

Inference must complete in under 50 ms per transaction.
The fraud operations team needs feature-level explanations for flagged transactions.
Training can run offline daily; scoring happens online.
Avoid data leakage from post-transaction or chargeback-resolution fields.

Deliverables

Propose a modeling approach for highly imbalanced binary classification.
Define preprocessing and feature engineering for mixed data types and missing values.
Choose evaluation metrics and threshold selection strategy appropriate for fraud.
Explain how you would validate the model without leakage.
Provide production-ready Python code for training, evaluation, and threshold tuning.

Business Context

Dataset

You are given a historical transaction dataset for supervised binary classification.

Feature Group	Count	Examples
Transaction attributes	12	amount, merchant_category, entry_mode, card_present, currency
Customer behavior	9	avg_txn_amount_7d, txn_count_24h, chargeback_count_90d
Merchant risk	6	merchant_fraud_rate_30d, merchant_country, terminal_age_days
Temporal & geo	7	hour_of_day, day_of_week, billing_shipping_distance_km
Device & channel	8	device_id_hash, browser_type, app_version, ip_risk_score

Size: 2.4M transactions, 42 features
Target: is_fraud where 1 = fraudulent transaction, 0 = legitimate transaction
Class balance: 0.9% positive, 99.1% negative
Missing data: ~12% missing in device fields, ~4% missing in geo features, some unseen categories in production

Success Criteria

Constraints

Inference must complete in under 50 ms per transaction.
The fraud operations team needs feature-level explanations for flagged transactions.
Training can run offline daily; scoring happens online.
Avoid data leakage from post-transaction or chargeback-resolution fields.

Deliverables

Propose a modeling approach for highly imbalanced binary classification.
Define preprocessing and feature engineering for mixed data types and missing values.
Choose evaluation metrics and threshold selection strategy appropriate for fraud.
Explain how you would validate the model without leakage.
Provide production-ready Python code for training, evaluation, and threshold tuning.

Business Context

Dataset

You are given a historical transaction dataset for supervised binary classification.

Feature Group	Count	Examples
Transaction attributes	12	amount, merchant_category, entry_mode, card_present, currency
Customer behavior	9	avg_txn_amount_7d, txn_count_24h, chargeback_count_90d
Merchant risk	6	merchant_fraud_rate_30d, merchant_country, terminal_age_days
Temporal & geo	7	hour_of_day, day_of_week, billing_shipping_distance_km
Device & channel	8	device_id_hash, browser_type, app_version, ip_risk_score

Size: 2.4M transactions, 42 features
Target: is_fraud where 1 = fraudulent transaction, 0 = legitimate transaction
Class balance: 0.9% positive, 99.1% negative
Missing data: ~12% missing in device fields, ~4% missing in geo features, some unseen categories in production

Success Criteria

Constraints

Inference must complete in under 50 ms per transaction.
The fraud operations team needs feature-level explanations for flagged transactions.
Training can run offline daily; scoring happens online.
Avoid data leakage from post-transaction or chargeback-resolution fields.

Deliverables

Propose a modeling approach for highly imbalanced binary classification.
Define preprocessing and feature engineering for mixed data types and missing values.
Choose evaluation metrics and threshold selection strategy appropriate for fraud.
Explain how you would validate the model without leakage.
Provide production-ready Python code for training, evaluation, and threshold tuning.

Interview Guides

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Detect Card Fraud with Imbalanced Labels

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer

Detect Card Fraud with Imbalanced Labels

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Detect Card Fraud with Imbalanced Labels

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer