Advanced Interview Prep #data-science #machine-learning #ml-engineering #statistics

Data Scientist / ML Engineer Interview Questions

5 exercises — practice structuring strong English answers to data science and ML engineering interview questions: model drift, precision vs recall, model explainability, overfitting, and feature engineering.

How to structure ML interview answers

Drift questions: always distinguish data drift (P(X) changes) from concept drift (P(Y|X) changes) — most candidates miss this
Metrics questions: give the formula → explain the threshold mechanism → give contrasting real-world scenarios with reasoning
Explainability questions: three layers — frame as a decision, translate metrics to business language, use SHAP for specific predictions
Overfitting questions: give the loss curve signature → remedies with mechanism → connect to bias-variance trade-off
Feature engineering questions: open with domain knowledge → structured categories (temporal, encoding, interaction) → feature selection to prune noise

0 / 5 completed

1 / 5

The interviewer asks: "How do you detect and handle model drift in a production ML system?"
Which answer best demonstrates ML engineering maturity?

2 / 5

The interviewer asks: "Explain the difference between precision and recall, and describe a real-world scenario where you would optimise for one over the other."
Which answer demonstrates the deepest understanding?

3 / 5

The interviewer asks: "How would you explain your machine learning model and its predictions to a non-technical stakeholder?"
Which answer demonstrates the best communication strategy?

4 / 5

The interviewer asks: "What is the difference between overfitting and underfitting, and how do you address each?"
Which answer demonstrates the clearest mental model?

5 / 5

The interviewer asks: "How do you approach feature engineering, and what techniques do you commonly use?"
Which answer best demonstrates practical ML experience?