Monitor Deployed Loan Risk Model

Context

Microsoft is using an Azure Machine Learning–deployed gradient boosted classifier to predict default risk for small business credit applications in Dynamics 365 Finance. The model was strong at launch, but six months later the risk team reports more unexpected defaults while the model’s approval rate has stayed nearly unchanged.

Current Performance

Metric	At Launch	Current	Change
Precision	0.78	0.74	-0.04
Recall	0.81	0.63	-0.18
F1 Score	0.79	0.68	-0.11
AUC-ROC	0.87	0.82	-0.05
Log Loss	0.41	0.53	+0.12
Brier Score	0.16	0.21	+0.05
Approval Rate	61%	60%	-1 pt
Monthly default rate on approved loans	2.9%	4.7%	+1.8 pts

The Problem

You need to design a post-deployment monitoring approach and diagnose whether the issue is threshold drift, score miscalibration, feature drift, or a broader change in borrower behavior. Assume labels arrive with a 60-day delay, so some online metrics are only available later.

Requirements

Explain which metrics you would monitor daily, weekly, and monthly after deployment.
Diagnose what the current metric pattern suggests about model health.
Identify likely root causes and how you would validate each one.
Recommend concrete actions to improve monitoring and restore performance.
State what alert thresholds and rollback criteria you would implement in Azure Machine Learning.

Constraints

False negatives are costly because missed high-risk applicants drive loan losses.
False positives reduce approvals and hurt revenue.
Full retraining takes 10 days and requires model risk review.
The business cannot reduce approval volume by more than 3 percentage points without executive sign-off.

Context

Current Performance

Metric	At Launch	Current	Change
Precision	0.78	0.74	-0.04
Recall	0.81	0.63	-0.18
F1 Score	0.79	0.68	-0.11
AUC-ROC	0.87	0.82	-0.05
Log Loss	0.41	0.53	+0.12
Brier Score	0.16	0.21	+0.05
Approval Rate	61%	60%	-1 pt
Monthly default rate on approved loans	2.9%	4.7%	+1.8 pts

The Problem

Requirements

Explain which metrics you would monitor daily, weekly, and monthly after deployment.
Diagnose what the current metric pattern suggests about model health.
Identify likely root causes and how you would validate each one.
Recommend concrete actions to improve monitoring and restore performance.
State what alert thresholds and rollback criteria you would implement in Azure Machine Learning.

Constraints

False negatives are costly because missed high-risk applicants drive loan losses.
False positives reduce approvals and hurt revenue.
Full retraining takes 10 days and requires model risk review.
The business cannot reduce approval volume by more than 3 percentage points without executive sign-off.

Context

Current Performance

Metric	At Launch	Current	Change
Precision	0.78	0.74	-0.04
Recall	0.81	0.63	-0.18
F1 Score	0.79	0.68	-0.11
AUC-ROC	0.87	0.82	-0.05
Log Loss	0.41	0.53	+0.12
Brier Score	0.16	0.21	+0.05
Approval Rate	61%	60%	-1 pt
Monthly default rate on approved loans	2.9%	4.7%	+1.8 pts

The Problem

Requirements

Explain which metrics you would monitor daily, weekly, and monthly after deployment.
Diagnose what the current metric pattern suggests about model health.
Identify likely root causes and how you would validate each one.
Recommend concrete actions to improve monitoring and restore performance.
State what alert thresholds and rollback criteria you would implement in Azure Machine Learning.

Constraints

False negatives are costly because missed high-risk applicants drive loan losses.
False positives reduce approvals and hurt revenue.
Full retraining takes 10 days and requires model risk review.
The business cannot reduce approval volume by more than 3 percentage points without executive sign-off.

Context

Current Performance

Metric	At Launch	Current	Change
Precision	0.78	0.74	-0.04
Recall	0.81	0.63	-0.18
F1 Score	0.79	0.68	-0.11
AUC-ROC	0.87	0.82	-0.05
Log Loss	0.41	0.53	+0.12
Brier Score	0.16	0.21	+0.05
Approval Rate	61%	60%	-1 pt
Monthly default rate on approved loans	2.9%	4.7%	+1.8 pts

The Problem

Requirements

Explain which metrics you would monitor daily, weekly, and monthly after deployment.
Diagnose what the current metric pattern suggests about model health.
Identify likely root causes and how you would validate each one.
Recommend concrete actions to improve monitoring and restore performance.
State what alert thresholds and rollback criteria you would implement in Azure Machine Learning.

Constraints

False negatives are costly because missed high-risk applicants drive loan losses.
False positives reduce approvals and hurt revenue.
Full retraining takes 10 days and requires model risk review.
The business cannot reduce approval volume by more than 3 percentage points without executive sign-off.

Interview Guides

Context

Current Performance

The Problem

Requirements

Constraints

Monitor Deployed Loan Risk Model

Context

Current Performance

The Problem

Requirements

Constraints

Your Answer

Monitor Deployed Loan Risk Model

Context

Current Performance

The Problem

Requirements

Constraints

Monitor Deployed Loan Risk Model

Context

Current Performance

The Problem

Requirements

Constraints

Your Answer