Quantera Bio runs high-sensitivity Simoa immunoassays to detect low-abundance protein biomarkers in blood and cerebrospinal fluid. The assay team wants a model that classifies whether a sample is biomarker-positive and estimates concentration quality from instrument and assay-derived features, reducing manual review while preserving scientific trust.
You are given historical assay runs collected from a Simoa platform. Simoa (Single Molecule Array) works by isolating immunocomplexes on beads into femtoliter-sized wells so fluorescence can be measured at near single-molecule resolution. Each row represents one processed sample from a completed run.
| Feature Group | Count | Examples |
|---|---|---|
| Instrument signal | 12 | avg_enzyme_per_bead, fluorescence_intensity_mean, on_bead_fraction, dark_count_rate |
| Assay chemistry | 9 | capture_antibody_lot, detector_lot, incubation_minutes, wash_cycles |
| Sample metadata | 8 | specimen_type, dilution_factor, storage_days, freeze_thaw_count |
| Run QC | 11 | calibrator_r2, blank_signal, cv_replicates, plate_position, operator_id |
| Derived features | 6 | signal_to_blank, replicate_delta, normalized_bead_count |
A good solution should achieve strong recall on positive samples without overwhelming the lab with false positives. Target ROC-AUC > 0.88, PR-AUC > 0.60, and recall >= 0.85 at precision >= 0.55.