Classify AI Framework Questions

Business Context

LexiPrep, an interview preparation platform, wants to automatically route user-submitted AI questions to the correct framework family so editors can review content faster. A common failure case is confusion between framework ecosystems such as NGA and ADK, especially when users ask comparative questions in short, ambiguous text.

Data

You are given a corpus of 180,000 historical question titles and descriptions from AI learning forums and internal content libraries.

Task: classify each question into one of 4 labels: NGA, ADK, Both/Comparison, Other AI Framework
Text length: 8-220 words, median 34 words
Language: English only
Label distribution: 28% NGA, 24% ADK, 18% Both/Comparison, 30% Other AI Framework
Noise: abbreviations, inconsistent capitalization, product aliases, and partial framework names are common

Success Criteria

A production-ready solution should achieve macro-F1 >= 0.86, with F1 >= 0.90 on the Both/Comparison class because those questions are most valuable for editorial review.

Constraints

Inference latency must stay below 80 ms per query
The model must run on a single CPU-backed API pod or one small GPU
Editors need probability scores for fallback manual review on low-confidence predictions

Requirements

Build a multi-class text classification pipeline for framework-related questions.
Design preprocessing for abbreviations, aliases, punctuation noise, and short-text ambiguity.
Implement a modern Python solution using a transformer fine-tuning workflow.
Evaluate class-wise performance and propose a confidence-thresholding strategy.
Explain how your system distinguishes direct comparison questions from single-framework questions.
Describe how you would monitor drift as new framework names enter the taxonomy.

Business Context

Data

You are given a corpus of 180,000 historical question titles and descriptions from AI learning forums and internal content libraries.

Task: classify each question into one of 4 labels: NGA, ADK, Both/Comparison, Other AI Framework
Text length: 8-220 words, median 34 words
Language: English only
Label distribution: 28% NGA, 24% ADK, 18% Both/Comparison, 30% Other AI Framework
Noise: abbreviations, inconsistent capitalization, product aliases, and partial framework names are common

Success Criteria

A production-ready solution should achieve macro-F1 >= 0.86, with F1 >= 0.90 on the Both/Comparison class because those questions are most valuable for editorial review.

Constraints

Inference latency must stay below 80 ms per query
The model must run on a single CPU-backed API pod or one small GPU
Editors need probability scores for fallback manual review on low-confidence predictions

Requirements

Build a multi-class text classification pipeline for framework-related questions.
Design preprocessing for abbreviations, aliases, punctuation noise, and short-text ambiguity.
Implement a modern Python solution using a transformer fine-tuning workflow.
Evaluate class-wise performance and propose a confidence-thresholding strategy.
Explain how your system distinguishes direct comparison questions from single-framework questions.
Describe how you would monitor drift as new framework names enter the taxonomy.

Business Context

Data

You are given a corpus of 180,000 historical question titles and descriptions from AI learning forums and internal content libraries.

Task: classify each question into one of 4 labels: NGA, ADK, Both/Comparison, Other AI Framework
Text length: 8-220 words, median 34 words
Language: English only
Label distribution: 28% NGA, 24% ADK, 18% Both/Comparison, 30% Other AI Framework
Noise: abbreviations, inconsistent capitalization, product aliases, and partial framework names are common

Success Criteria

A production-ready solution should achieve macro-F1 >= 0.86, with F1 >= 0.90 on the Both/Comparison class because those questions are most valuable for editorial review.

Constraints

Inference latency must stay below 80 ms per query
The model must run on a single CPU-backed API pod or one small GPU
Editors need probability scores for fallback manual review on low-confidence predictions

Requirements

Build a multi-class text classification pipeline for framework-related questions.
Design preprocessing for abbreviations, aliases, punctuation noise, and short-text ambiguity.
Implement a modern Python solution using a transformer fine-tuning workflow.
Evaluate class-wise performance and propose a confidence-thresholding strategy.
Explain how your system distinguishes direct comparison questions from single-framework questions.
Describe how you would monitor drift as new framework names enter the taxonomy.

Business Context

Data

You are given a corpus of 180,000 historical question titles and descriptions from AI learning forums and internal content libraries.

Task: classify each question into one of 4 labels: NGA, ADK, Both/Comparison, Other AI Framework
Text length: 8-220 words, median 34 words
Language: English only
Label distribution: 28% NGA, 24% ADK, 18% Both/Comparison, 30% Other AI Framework
Noise: abbreviations, inconsistent capitalization, product aliases, and partial framework names are common

Success Criteria

A production-ready solution should achieve macro-F1 >= 0.86, with F1 >= 0.90 on the Both/Comparison class because those questions are most valuable for editorial review.

Constraints

Inference latency must stay below 80 ms per query
The model must run on a single CPU-backed API pod or one small GPU
Editors need probability scores for fallback manual review on low-confidence predictions

Requirements

Build a multi-class text classification pipeline for framework-related questions.
Design preprocessing for abbreviations, aliases, punctuation noise, and short-text ambiguity.
Implement a modern Python solution using a transformer fine-tuning workflow.
Evaluate class-wise performance and propose a confidence-thresholding strategy.
Explain how your system distinguishes direct comparison questions from single-framework questions.
Describe how you would monitor drift as new framework names enter the taxonomy.

Interview Guides

Business Context

Data

Success Criteria

Constraints

Requirements

Classify AI Framework Questions

Business Context

Data

Success Criteria

Constraints

Requirements

Your Answer

Classify AI Framework Questions

Business Context

Data

Success Criteria

Constraints

Requirements

Classify AI Framework Questions

Business Context

Data

Success Criteria

Constraints

Requirements

Your Answer