VoltSense operates 18,000 industrial IoT sensors across manufacturing plants and uses hourly forecasts to trigger maintenance alerts. The current forecasting pipeline is unstable because raw sensor streams contain spikes, dropouts, and short-term measurement noise.
You are given a historical univariate-to-multivariate time series dataset for power_load_forecasting. Each row is one sensor-hour observation, and the goal is to predict the next 24 hours of power_load_kw while making the training data robust to noise.
| Feature Group | Count | Examples |
|---|---|---|
| Sensor readings | 8 | power_load_kw, voltage, current, temperature, vibration |
| Calendar features | 6 | hour_of_day, day_of_week, weekend_flag, holiday_flag |
| Lag features | 12 | load_t-1, load_t-6, load_t-24, temp_t-1 |
| Rolling statistics | 10 | rolling_mean_6h, rolling_std_24h, rolling_median_12h |
| Quality indicators | 4 | missing_flag, sensor_reset_flag, outlier_score, packet_loss_rate |
power_load_kw for the next 24 hoursA good solution should reduce forecast error relative to a raw-data baseline and remain stable during noisy periods. Target MAE < 2.8 kW, RMSE < 4.1 kW, and at least 15% lower MAE on noisy windows versus a model trained on unsmoothed inputs only.