Explain Lab Yield Neural Network

Business Context

NovaThera runs a high-throughput biology platform that executes ~40,000 wet-lab experiments per month. The R&D leadership team wants a model that predicts whether an assay run will succeed, but the senior experimentalist approving deployment is skeptical of black-box AI and requires a clear, evidence-based explanation of the model.

Dataset

You are given historical assay-run data collected over 24 months.

Feature Group	Count	Examples
Assay setup	12	assay_type, reagent_lot, plate_format, incubation_minutes
Instrument telemetry	18	temperature_mean, pressure_std, dispense_error_rate, calibration_score
Sample metadata	10	sample_source, concentration_ng_ml, storage_days, operator_experience
Quality controls	8	control_signal_mean, control_cv, contamination_flag, baseline_drift
Derived features	6	temp_range, signal_to_noise, reagent_age_days, run_order_within_batch

Size: 96K assay runs, 54 features
Target: Binary — assay success (1) vs failure/repeat required (0)
Class balance: 68% success, 32% failure
Missing data: 9% missing in telemetry due to sensor dropouts, 6% missing in sample metadata for external partners

Success Criteria

A good solution should achieve strong predictive performance while producing explanations a senior experimentalist can validate against domain knowledge. The model should reach ROC-AUC >= 0.86, PR-AUC >= 0.74, and provide both global and per-run explanations.

Constraints

Explanations must be understandable to non-ML stakeholders
Batch inference must score 40K runs in under 10 minutes
Retraining is allowed monthly, not daily
The final recommendation must compare the neural network to a simpler baseline

Deliverables

Train a baseline interpretable model and a neural network classifier
Evaluate both models on a held-out test set with appropriate metrics
Produce feature-level and example-level explanations for the neural network
Describe how you would explain the architecture, predictions, and limitations to a skeptical senior experimentalist
Recommend whether the neural network is suitable for production given performance and interpretability tradeoffs

Business Context

Dataset

You are given historical assay-run data collected over 24 months.

Feature Group	Count	Examples
Assay setup	12	assay_type, reagent_lot, plate_format, incubation_minutes
Instrument telemetry	18	temperature_mean, pressure_std, dispense_error_rate, calibration_score
Sample metadata	10	sample_source, concentration_ng_ml, storage_days, operator_experience
Quality controls	8	control_signal_mean, control_cv, contamination_flag, baseline_drift
Derived features	6	temp_range, signal_to_noise, reagent_age_days, run_order_within_batch

Size: 96K assay runs, 54 features
Target: Binary — assay success (1) vs failure/repeat required (0)
Class balance: 68% success, 32% failure
Missing data: 9% missing in telemetry due to sensor dropouts, 6% missing in sample metadata for external partners

Success Criteria

Constraints

Explanations must be understandable to non-ML stakeholders
Batch inference must score 40K runs in under 10 minutes
Retraining is allowed monthly, not daily
The final recommendation must compare the neural network to a simpler baseline

Deliverables

Train a baseline interpretable model and a neural network classifier
Evaluate both models on a held-out test set with appropriate metrics
Produce feature-level and example-level explanations for the neural network
Describe how you would explain the architecture, predictions, and limitations to a skeptical senior experimentalist
Recommend whether the neural network is suitable for production given performance and interpretability tradeoffs

Business Context

Dataset

You are given historical assay-run data collected over 24 months.

Feature Group	Count	Examples
Assay setup	12	assay_type, reagent_lot, plate_format, incubation_minutes
Instrument telemetry	18	temperature_mean, pressure_std, dispense_error_rate, calibration_score
Sample metadata	10	sample_source, concentration_ng_ml, storage_days, operator_experience
Quality controls	8	control_signal_mean, control_cv, contamination_flag, baseline_drift
Derived features	6	temp_range, signal_to_noise, reagent_age_days, run_order_within_batch

Size: 96K assay runs, 54 features
Target: Binary — assay success (1) vs failure/repeat required (0)
Class balance: 68% success, 32% failure
Missing data: 9% missing in telemetry due to sensor dropouts, 6% missing in sample metadata for external partners

Success Criteria

Constraints

Explanations must be understandable to non-ML stakeholders
Batch inference must score 40K runs in under 10 minutes
Retraining is allowed monthly, not daily
The final recommendation must compare the neural network to a simpler baseline

Deliverables

Train a baseline interpretable model and a neural network classifier
Evaluate both models on a held-out test set with appropriate metrics
Produce feature-level and example-level explanations for the neural network
Describe how you would explain the architecture, predictions, and limitations to a skeptical senior experimentalist
Recommend whether the neural network is suitable for production given performance and interpretability tradeoffs

Business Context

Dataset

You are given historical assay-run data collected over 24 months.

Feature Group	Count	Examples
Assay setup	12	assay_type, reagent_lot, plate_format, incubation_minutes
Instrument telemetry	18	temperature_mean, pressure_std, dispense_error_rate, calibration_score
Sample metadata	10	sample_source, concentration_ng_ml, storage_days, operator_experience
Quality controls	8	control_signal_mean, control_cv, contamination_flag, baseline_drift
Derived features	6	temp_range, signal_to_noise, reagent_age_days, run_order_within_batch

Size: 96K assay runs, 54 features
Target: Binary — assay success (1) vs failure/repeat required (0)
Class balance: 68% success, 32% failure
Missing data: 9% missing in telemetry due to sensor dropouts, 6% missing in sample metadata for external partners

Success Criteria

Constraints

Explanations must be understandable to non-ML stakeholders
Batch inference must score 40K runs in under 10 minutes
Retraining is allowed monthly, not daily
The final recommendation must compare the neural network to a simpler baseline

Deliverables

Train a baseline interpretable model and a neural network classifier
Evaluate both models on a held-out test set with appropriate metrics
Produce feature-level and example-level explanations for the neural network
Describe how you would explain the architecture, predictions, and limitations to a skeptical senior experimentalist
Recommend whether the neural network is suitable for production given performance and interpretability tradeoffs

Interview Guides

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Explain Lab Yield Neural Network

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer

Explain Lab Yield Neural Network

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Explain Lab Yield Neural Network

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer