Screen YouTube Spam with Baselines

Business Context

A Google Research Scientist candidate is asked to reason through a realistic first-round ML problem rather than recite isolated theory. Build a lightweight classifier for YouTube comment spam detection and compare a classical ML baseline with a simple neural baseline, explaining the statistical and modeling tradeoffs clearly.

Dataset

Use a historical moderation dataset from YouTube comments collected across popular channels. Each row is one comment with metadata available at prediction time.

Feature Group	Count	Examples
Text	1 raw field	comment_text
Numeric metadata	6	comment_length, url_count, emoji_count, uppercase_ratio, account_age_days, prior_flags
Categorical metadata	3	language, device_type, channel_topic
Temporal	2	hour_of_day, day_of_week

Size: 120K comments, 12 engineered non-text features plus raw text
Target: Binary label — spam (1) vs non-spam (0)
Class balance: 18% spam, 82% non-spam
Missing data: ~8% missing in account_age_days and language; some comments are empty after text cleaning

Success Criteria

A good solution should outperform a majority-class baseline and deliver strong ranking quality for moderation triage. Aim for AUC-ROC > 0.92, F1 > 0.78, and recall > 0.85 at precision >= 0.75 on a held-out test set.

Constraints

Inference should support near-real-time moderation in <20 ms per comment in a Google production setting
The solution should be interpretable enough to explain obvious spam signals to Trust & Safety reviewers
Retraining can run daily; serving cost should remain modest

Deliverables

Train a classical baseline model and one simple deep learning baseline
Justify feature preprocessing, regularization, and validation strategy
Evaluate with threshold-free and threshold-based metrics
Explain how you would choose the operating threshold for moderation
Describe failure modes, including language drift and adversarial spam patterns

Business Context

Dataset

Use a historical moderation dataset from YouTube comments collected across popular channels. Each row is one comment with metadata available at prediction time.

Feature Group	Count	Examples
Text	1 raw field	comment_text
Numeric metadata	6	comment_length, url_count, emoji_count, uppercase_ratio, account_age_days, prior_flags
Categorical metadata	3	language, device_type, channel_topic
Temporal	2	hour_of_day, day_of_week

Size: 120K comments, 12 engineered non-text features plus raw text
Target: Binary label — spam (1) vs non-spam (0)
Class balance: 18% spam, 82% non-spam
Missing data: ~8% missing in account_age_days and language; some comments are empty after text cleaning

Success Criteria

Constraints

Inference should support near-real-time moderation in <20 ms per comment in a Google production setting
The solution should be interpretable enough to explain obvious spam signals to Trust & Safety reviewers
Retraining can run daily; serving cost should remain modest

Deliverables

Train a classical baseline model and one simple deep learning baseline
Justify feature preprocessing, regularization, and validation strategy
Evaluate with threshold-free and threshold-based metrics
Explain how you would choose the operating threshold for moderation
Describe failure modes, including language drift and adversarial spam patterns

Business Context

Dataset

Use a historical moderation dataset from YouTube comments collected across popular channels. Each row is one comment with metadata available at prediction time.

Feature Group	Count	Examples
Text	1 raw field	comment_text
Numeric metadata	6	comment_length, url_count, emoji_count, uppercase_ratio, account_age_days, prior_flags
Categorical metadata	3	language, device_type, channel_topic
Temporal	2	hour_of_day, day_of_week

Size: 120K comments, 12 engineered non-text features plus raw text
Target: Binary label — spam (1) vs non-spam (0)
Class balance: 18% spam, 82% non-spam
Missing data: ~8% missing in account_age_days and language; some comments are empty after text cleaning

Success Criteria

Constraints

Inference should support near-real-time moderation in <20 ms per comment in a Google production setting
The solution should be interpretable enough to explain obvious spam signals to Trust & Safety reviewers
Retraining can run daily; serving cost should remain modest

Deliverables

Train a classical baseline model and one simple deep learning baseline
Justify feature preprocessing, regularization, and validation strategy
Evaluate with threshold-free and threshold-based metrics
Explain how you would choose the operating threshold for moderation
Describe failure modes, including language drift and adversarial spam patterns

Business Context

Dataset

Use a historical moderation dataset from YouTube comments collected across popular channels. Each row is one comment with metadata available at prediction time.

Feature Group	Count	Examples
Text	1 raw field	comment_text
Numeric metadata	6	comment_length, url_count, emoji_count, uppercase_ratio, account_age_days, prior_flags
Categorical metadata	3	language, device_type, channel_topic
Temporal	2	hour_of_day, day_of_week

Size: 120K comments, 12 engineered non-text features plus raw text
Target: Binary label — spam (1) vs non-spam (0)
Class balance: 18% spam, 82% non-spam
Missing data: ~8% missing in account_age_days and language; some comments are empty after text cleaning

Success Criteria

Constraints

Inference should support near-real-time moderation in <20 ms per comment in a Google production setting
The solution should be interpretable enough to explain obvious spam signals to Trust & Safety reviewers
Retraining can run daily; serving cost should remain modest

Deliverables

Train a classical baseline model and one simple deep learning baseline
Justify feature preprocessing, regularization, and validation strategy
Evaluate with threshold-free and threshold-based metrics
Explain how you would choose the operating threshold for moderation
Describe failure modes, including language drift and adversarial spam patterns

Interview Guides

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Screen YouTube Spam with Baselines

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer

Screen YouTube Spam with Baselines

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Screen YouTube Spam with Baselines

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer