Data Scientist Interview Questions

Data Scientist Interview Questions

Lors d’un entretien pour le poste de data scientist, les employeurs vont poser des questions leur permettant d’évaluer vos compétences en modélisation des données, résolution des problèmes et programmation. Soyez préparé à répondre à des questions générales testant vos connaissances en statistiques et en science des données. Vous devez également être prêt à répondre à des questions ouvertes permettant de tester votre créativité, vos compétences en communication et votre éducation formelle en modélisation des données et en programmation.

Questions d'entretien d'embauche fréquentes pour un data scientist (H/F) et comment y répondre

Question 1

Question 1 : Quelles techniques de modélisation des données préférez-vous et pourquoi ?

How to answer
Comment répondre : La transformation des données en informations compréhensibles et exploitables est un élément critique du métier de data scientist. Cette question permet aux employeurs de comprendre vos compétences en modélisation des données et votre cursus. Répertoriez et détaillez les techniques de modélisation des données que vous préférez, notamment leurs avantages comme leur facilité d’utilisation, leur flexibilité, etc.
Question 2

Question 2 : Comment détectez-vous les faux comptes Instagram utilisés pour escroquer les clients ?

How to answer
Comment répondre : Ce type de question permet à un employeur de tester vos compétences en résolution des problèmes. Lorsque vous répondez à des questions ouvertes comme celle-ci, n’hésitez pas à demander des précisions sur ces dernières et à utiliser un tableau pour présenter vos compétences en programmation et création de graphiques. Partagez votre processus de réflexion lorsque vous résolvez le problème.
Question 3

Question 3 : Décrivez des situations qui requièrent une liste, un uplet ou un ensemble sur Python.

How to answer
Comment répondre : Les intervieweurs posent ce type de question pour tester vos compétences en programmation sur Python. Révisez les rudiments de Python comme les listes, les uplets et les ensembles avant votre entretien. Vous devez être en mesure d’expliquer à quel moment et de quelle manière chaque outil est utilisé par les data scientists.

54,205 data scientist interview questions shared by candidates

business sense: There are two types of cars A and B. The number of people in US who use A and B are the same. They drive the same distances each month. Now there are two new technologies, X and Y (of equal cost). If apply X, mpg of A would increase from 50 mpg to 75 mpg; If apply Y, mpg of B would increase from 10 mpg to 11 mpg. The goal is to decrease the dependence on foreign oil, or to decrease the consumption of gasoline. Question: which technology would you apply? Follow up question: after applying the technology of your choice, assume there's money available for research on new technology, which car would you choose to conduct research on?
avatar

Data Scientist

Interviewed at Meta

3.6
Jan 7, 2015

business sense: There are two types of cars A and B. The number of people in US who use A and B are the same. They drive the same distances each month. Now there are two new technologies, X and Y (of equal cost). If apply X, mpg of A would increase from 50 mpg to 75 mpg; If apply Y, mpg of B would increase from 10 mpg to 11 mpg. The goal is to decrease the dependence on foreign oil, or to decrease the consumption of gasoline. Question: which technology would you apply? Follow up question: after applying the technology of your choice, assume there's money available for research on new technology, which car would you choose to conduct research on?

SQL part1 Question # 1 : you have the following table - User_id , action (post, edit, cancel) , date == > what is the post rate Question # 2 : you have also the following table (active users) - User_id , country , active , date == > What is the average post rate for yesterday for all active users by country
avatar

Data Scientist

Interviewed at Meta

3.6
Aug 7, 2019

SQL part1 Question # 1 : you have the following table - User_id , action (post, edit, cancel) , date == > what is the post rate Question # 2 : you have also the following table (active users) - User_id , country , active , date == > What is the average post rate for yesterday for all active users by country

Lets say the population on Facebook clicks ads with a click-through-rate of P. We select a sample of size N and examine the sample's conversion rate, denoted by hat{P}, what is the minimum sample size N such that Probability( ABS(hat{P} - P) < DELTA ) = 95%. In other words (this is my translation), find the minimum sample size N such that our sample estimate hat{P} is within DELTA of the true click through rate P, with 95% confidence.
avatar

Data Scientist, Analytics

Interviewed at Meta

3.6
Mar 6, 2015

Lets say the population on Facebook clicks ads with a click-through-rate of P. We select a sample of size N and examine the sample's conversion rate, denoted by hat{P}, what is the minimum sample size N such that Probability( ABS(hat{P} - P) < DELTA ) = 95%. In other words (this is my translation), find the minimum sample size N such that our sample estimate hat{P} is within DELTA of the true click through rate P, with 95% confidence.

Given two binary strings, write a function that adds them. You are not allowed to use any built in string to int conversions or parsing tools. E.g. Given "100" and "111" you should return "1011". What is the time and space complexity of your algorithm?
avatar

Data Scientist, Analytics

Interviewed at Meta

3.6
Mar 6, 2015

Given two binary strings, write a function that adds them. You are not allowed to use any built in string to int conversions or parsing tools. E.g. Given "100" and "111" you should return "1011". What is the time and space complexity of your algorithm?

We have a table called ad_accounts(account_id, date, status). Status can be active/closed/fraud. A) what percent of active accounts are fraud? B) How many accounts became fraud today for the first time? C) What would be the financial impact of letting fraud accounts become active (how would you approach this question)?
avatar

Data Scientist, Analytics

Interviewed at Meta

3.6
Mar 6, 2019

We have a table called ad_accounts(account_id, date, status). Status can be active/closed/fraud. A) what percent of active accounts are fraud? B) How many accounts became fraud today for the first time? C) What would be the financial impact of letting fraud accounts become active (how would you approach this question)?

Viewing 31 - 40 interview questions

Glassdoor has 54,205 interview questions and reports from Data scientist interviews. Prepare for your interview. Get hired. Love your job.