VoltGrid Manufacturing operates 1,200 industrial machines across 18 plants and wants to predict equipment failures 24 hours in advance so maintenance can be scheduled before unplanned downtime occurs. False negatives are costly because each missed failure can stop a production line, but too many false positives overload the maintenance team.
The training data is built from machine-hour snapshots over the last 24 months.
| Feature Group | Count | Examples |
|---|---|---|
| Sensor aggregates | 18 | temperature_mean_1h, vibration_std_6h, pressure_max_24h |
| Operating context | 9 | load_pct, shift_id, ambient_temp, line_speed |
| Maintenance history | 7 | days_since_last_service, component_replaced_30d, failure_count_90d |
| Machine metadata | 6 | machine_type, plant_id, install_age_days, vendor |
| Derived trend features | 12 | temp_slope_6h, vibration_delta_24h, rolling_fault_rate |
A good solution should achieve strong ranking quality and support an operating point that catches most failures without overwhelming maintenance capacity. Target at least 0.45 PR-AUC, 0.88 ROC-AUC, and 70% recall with precision above 25% on a held-out time-based test set.