Dataford
Interview Guides
Upgrade
All questions/Machine Learning/Harmful Video Upload Detection Pipeline

Harmful Video Upload Detection Pipeline

Hard
Machine Learning
Asked at 1 company1Supervised LearningDeep LearningFeature Engineering
Also asked at
Snap

Problem

Business Context

You’re on the Trust & Safety ML team at StreamShare, a short-form video platform with 85M DAU and ~12M video uploads/day globally. Regulators and app stores require fast removal of CSAM, graphic violence, self-harm encouragement, extremist propaganda, and hate symbols. Human moderators can review only ~0.5–1% of uploads, so the company needs an ML system that can auto-block the worst content in near real time and route borderline cases to moderation queues.

The product requirement is: within 30 seconds of upload, produce a risk score and an action: ALLOW, QUEUE, or BLOCK. False negatives are high-severity safety incidents; false positives harm creators and retention. You are asked to design the first production version using “traditional ML” framing (classification, training, evaluation), but you may use pretrained embeddings.

Dataset

You have a labeled dataset built from moderator decisions and policy team audits.

ComponentScaleWhat you getNotes
Video metadata180M uploadsduration, country, device, account_age_days, prior_strikes, upload_hourSome features are missing for new users (~8%)
Visual frames180M videos1 FPS sampled frames (JPEG) + timestampsAvg 18 frames/video; long tail up to 10 minutes
Audio140M videos16kHz mono waveform + VAD segments22% missing (no audio track)
ASR transcript110M videostext + word-level timestamps + confidenceASR quality varies by language; 35% non-English
Labels180M videosmulti-label policy tags + final actionLabels are noisy; inter-rater agreement ~0.78
  • Target: For this interview, model a binary label: harmful=1 if policy requires BLOCK (CSAM/graphic violence/self-harm/extremism), else 0.
  • Class balance: Highly imbalanced — about 0.35% harmful overall; spikes to 1–2% in some regions.
  • Leakage risk: Some creators re-upload similar content; duplicates exist.

Success Criteria

  1. Safety operating point: Achieve ≥ 90% recall on harmful content at ≤ 2.0% false positive rate (FPR) on a held-out, recent-time test set.
  2. Queue efficiency: Among the top 1% highest-risk uploads, achieve ≥ 25% precision (so moderators see meaningful hits).
  3. Latency: End-to-end scoring within 30s of upload; model inference budget < 200ms per video on CPU for metadata + embeddings (GPU optional only for offline feature extraction).
  4. Robustness: Performance should not collapse on new uploads (cold-start users) and should be monitored for policy drift.

Constraints

  • Privacy/compliance: Raw video cannot be retained beyond 30 days in some regions; you should prefer storing derived features/embeddings.
  • Interpretability: Trust & Safety ops need reason codes (e.g., “violent imagery signal high”, “hate symbol detected”) for appeals.
  • Training: You have a weekly training window on a Spark + GPU feature-extraction cluster; online serving is Kubernetes CPU.

Deliverables (what you must produce in the interview)

  1. Propose an end-to-end ML approach: feature extraction, model choice, and how you combine modalities.
  2. Define a leakage-safe split strategy and cross-validation plan.
  3. Describe how you handle extreme class imbalance and label noise.
  4. Specify evaluation metrics and how you pick thresholds for ALLOW/QUEUE/BLOCK.
  5. Provide a training + evaluation Python skeleton (production-quality) and explain how you’d deploy and retrain.

Problem

Business Context

You’re on the Trust & Safety ML team at StreamShare, a short-form video platform with 85M DAU and ~12M video uploads/day globally. Regulators and app stores require fast removal of CSAM, graphic violence, self-harm encouragement, extremist propaganda, and hate symbols. Human moderators can review only ~0.5–1% of uploads, so the company needs an ML system that can auto-block the worst content in near real time and route borderline cases to moderation queues.

The product requirement is: within 30 seconds of upload, produce a risk score and an action: ALLOW, QUEUE, or BLOCK. False negatives are high-severity safety incidents; false positives harm creators and retention. You are asked to design the first production version using “traditional ML” framing (classification, training, evaluation), but you may use pretrained embeddings.

Dataset

You have a labeled dataset built from moderator decisions and policy team audits.

ComponentScaleWhat you getNotes
Video metadata180M uploadsduration, country, device, account_age_days, prior_strikes, upload_hourSome features are missing for new users (~8%)
Visual frames180M videos1 FPS sampled frames (JPEG) + timestampsAvg 18 frames/video; long tail up to 10 minutes
Audio140M videos16kHz mono waveform + VAD segments22% missing (no audio track)
ASR transcript110M videostext + word-level timestamps + confidenceASR quality varies by language; 35% non-English
Labels180M videosmulti-label policy tags + final actionLabels are noisy; inter-rater agreement ~0.78
  • Target: For this interview, model a binary label: harmful=1 if policy requires BLOCK (CSAM/graphic violence/self-harm/extremism), else 0.
  • Class balance: Highly imbalanced — about 0.35% harmful overall; spikes to 1–2% in some regions.
  • Leakage risk: Some creators re-upload similar content; duplicates exist.

Success Criteria

  1. Safety operating point: Achieve ≥ 90% recall on harmful content at ≤ 2.0% false positive rate (FPR) on a held-out, recent-time test set.
  2. Queue efficiency: Among the top 1% highest-risk uploads, achieve ≥ 25% precision (so moderators see meaningful hits).
  3. Latency: End-to-end scoring within 30s of upload; model inference budget < 200ms per video on CPU for metadata + embeddings (GPU optional only for offline feature extraction).
  4. Robustness: Performance should not collapse on new uploads (cold-start users) and should be monitored for policy drift.

Constraints

  • Privacy/compliance: Raw video cannot be retained beyond 30 days in some regions; you should prefer storing derived features/embeddings.
  • Interpretability: Trust & Safety ops need reason codes (e.g., “violent imagery signal high”, “hate symbol detected”) for appeals.
  • Training: You have a weekly training window on a Spark + GPU feature-extraction cluster; online serving is Kubernetes CPU.

Deliverables (what you must produce in the interview)

  1. Propose an end-to-end ML approach: feature extraction, model choice, and how you combine modalities.
  2. Define a leakage-safe split strategy and cross-validation plan.
  3. Describe how you handle extreme class imbalance and label noise.
  4. Specify evaluation metrics and how you pick thresholds for ALLOW/QUEUE/BLOCK.
  5. Provide a training + evaluation Python skeleton (production-quality) and explain how you’d deploy and retrain.
Your answer
Try one AI text evaluation on us
Get structured feedback, scored against a 4-axis rubric. Premium unlocks unlimited.
0 wordstarget ~200
Up next
Evaluate Harmful Content ClassifierMedium
Next question