Machine Learning & Statistical Foundations
This area is the bedrock of your technical evaluation. Tiger Analytics interviewers will probe your understanding of core algorithms to ensure you can select the right tool for a client's specific problem. Strong performance means you can confidently derive basic algorithms, explain their underlying assumptions, and diagnose model performance issues.
Be ready to go over:
- Supervised vs. Unsupervised Learning – Understanding when to apply regression, classification, or clustering, and the trade-offs of each.
- Model Evaluation Metrics – Deep knowledge of Precision, Recall, F1-score, ROC-AUC, RMSE, and MAPE, and knowing which to optimize based on the business context.
- Bias-Variance Tradeoff – Explaining overfitting and underfitting, and how to address them using regularization (L1/L2) or ensemble methods.
- Advanced concepts (less common) –
- Mathematics behind gradient descent and backpropagation.
- Handling highly imbalanced datasets using SMOTE or custom loss functions.
- Deep dive into tree-based models (XGBoost, LightGBM) hyperparameter tuning.
Example questions or scenarios:
- "Walk me through the mathematical difference between Ridge and Lasso regression, and when you would use each."
- "If your Random Forest model is overfitting, what specific hyperparameters would you adjust and why?"
- "How would you explain an ROC curve to a non-technical marketing executive?"
Domain Expertise: GenAI & Market Mix Modeling
Depending on the specific Data Scientist role you are targeting, you will face a deep dive into either advanced modern AI frameworks or econometric modeling. Tiger Analytics has distinct tracks, and you are expected to be an expert in your chosen domain.
Be ready to go over:
- Generative AI Architectures – Understanding Transformers, attention mechanisms, and the differences between various foundational models.
- RAG & Prompt Engineering – Designing Retrieval-Augmented Generation systems, vector embeddings, and optimizing prompts for enterprise use cases.
- Market Mix Modeling (MMM) – Time-series analysis, adstock transformations, diminishing returns, and multi-touch attribution methods.
- Advanced concepts (less common) –
- Fine-tuning open-source LLMs (LoRA, QLoRA) for proprietary client data.
- Bayesian approaches to MMM and handling multicollinearity in marketing data.
Example questions or scenarios:
- "Design a RAG architecture for a client who wants to query highly sensitive internal financial documents."
- "In an MMM project, how do you handle the carryover effect of a TV advertising campaign?"
- "Explain the trade-offs between using a zero-shot prompt on GPT-4 versus fine-tuning a smaller model like LLaMA-3."
Coding and Data Manipulation
Your ability to write efficient, bug-free code is critical. Interviewers will test your practical skills in extracting, cleaning, and manipulating data, as this is a significant part of your day-to-day work. Strong candidates write clean code, handle edge cases proactively, and understand the computational complexity of their operations.
Be ready to go over:
- SQL Mastery – Complex joins, window functions, CTEs (Common Table Expressions), and performance optimization.
- Python for Data Science – Extensive use of Pandas for data wrangling, aggregations, merges, and handling missing values.
- Data Pipeline Logic – Structuring code that can scale from a local Jupyter notebook to a production environment.
- Advanced concepts (less common) –
- PySpark for distributed data processing.
- Algorithmic complexity (Big O notation) for custom data transformation functions.
Example questions or scenarios:
- "Write a SQL query to find the top 3 selling products in each category, along with their running total revenue."
- "Given a messy Pandas DataFrame with missing dates and duplicate customer IDs, write a function to clean and aggregate the weekly sales."
- "How would you optimize a Python script that is running out of memory when processing a 10GB CSV file?"
Business Case & Consulting Mindset
Because you will be working directly with clients, Tiger Analytics places a heavy emphasis on your ability to translate business ambiguity into structured analytical frameworks. Strong performance here involves asking the right questions, designing a logical roadmap, and communicating with clarity and confidence.
Be ready to go over:
- Problem Scoping – Identifying the core business objective, constraints, and success criteria before touching any data.
- End-to-End Pipeline Design – Architecting a solution from data ingestion and feature engineering to model deployment and monitoring.
- Stakeholder Management – Explaining technical limitations, managing expectations, and presenting actionable insights.
- Advanced concepts (less common) –
- Designing A/B tests to validate model impact in a live production environment.
- Estimating the ROI or cost-savings of a proposed machine learning solution.
Example questions or scenarios:
- "A retail client wants to reduce customer churn. Walk me through your entire approach from the first kickoff meeting to the final model delivery."
- "Your model shows a significant drop in accuracy after being deployed in production for three months. How do you investigate and communicate this to the client?"
- "How would you convince a skeptical business leader to trust your machine learning model over their intuition?"