Data Engineer Interview Questions

Data Engineer Interview Questions

Le data engineer est un professionnel de l’informatique présent dans presque tous les secteurs. Il/Elle suit l’évolution et les tendances des données pour orienter les stratégies futures de l’entreprise. Une part essentielle de son travail consiste à transformer des données brutes en données exploitables en créant des pipelines et des systèmes de données.

Questions d'entretien d'embauche fréquentes pour un data engineer (H/F) et comment y répondre

Question 1

Question 1 : Décrivez en détail votre niveau d’expertise en langage de programmation.

How to answer
Comment répondre : Avant l’entretien, révisez votre CV et dressez la liste des programmes que vous maîtrisez. Si vous vous apercevez que vous ne connaissez pas un logiciel que l’entreprise utilise majoritairement, mettez en avant votre motivation et votre volonté de vous former au logiciel en question.
Question 2

Question 2 : Expliquez selon vous en quoi consiste le data engineering.

How to answer
Comment répondre : Soulignez votre rôle au sein de l’entreprise et par rapport à d’autres fonctions telles que data scientist pour définir clairement votre contribution. Précisez la différence entre un ingénieur axé sur les bases de données et un ingénieur axé sur les pipelines de données.
Question 3

Question 3 : Quelle est votre expérience en gestion de données dans le cloud et avec Apache Hadoop ?

How to answer
Comment répondre : Renseignez-vous sur les logiciels de gestion de données dans le cloud utilisés par l’entreprise (notamment Apache Hadoop). Un data engineer doit maîtriser les langages de programmation et les systèmes de gestion des données couramment employés dans le secteur, dont Apache Hadoop.

20,302 data engineer interview questions shared by candidates

Like I said above, come up with as many situations and examples in your career where you failed at something but learned, where you improved something and it's impact and where you had to overcome challenging situations that have a positive outcome.
avatar

Data Center Project Engineer

Interviewed at Amazon

3.5
Aug 31, 2017

Like I said above, come up with as many situations and examples in your career where you failed at something but learned, where you improved something and it's impact and where you had to overcome challenging situations that have a positive outcome.

1) Data warehousing basics 2) Python coding - intermediate level 3) 2 SQL questions - group by, RANK() types 4) Resume drill down 5) Tell me about a time you helped a teammate? 6) Hardest data pipeline you have worked on? 7) Difference between STAR, SNOWFLAKE and GALAXY schema 8) SCD and types 9) A recent ETL process you have worked on and challenges you faced in it
avatar

Data Engineer Intern

Interviewed at Amazon

3.5
May 5, 2022

1) Data warehousing basics 2) Python coding - intermediate level 3) 2 SQL questions - group by, RANK() types 4) Resume drill down 5) Tell me about a time you helped a teammate? 6) Hardest data pipeline you have worked on? 7) Difference between STAR, SNOWFLAKE and GALAXY schema 8) SCD and types 9) A recent ETL process you have worked on and challenges you faced in it

Problem: o A traveler flies to many cities (airports) in an unbroken chain of flights with no loops i.e never revisiting an airport. o For every flight, she has a boarding pass with only a From (City) and To (City) printed on it but no date/time. o At the end of her journey, she hands you all her boarding passes but they’re shuffled, so you don’t know the starting or the ending city. Can you: o Write logic or pseudocode to print her whole journey in sequence. It should print e.g. (Starting) City1 -> City2 ->….-> (Ending) CityX o State the time complexity of your solution. o you’re given a Set of BoardingPass objects as input. o there could be as many as hundreds of thousands of unique cities/airports. o memory is no concern (i.e. you have infinite memory!). Optimize for execution time (time complexity).
May 19, 2016

Problem: o A traveler flies to many cities (airports) in an unbroken chain of flights with no loops i.e never revisiting an airport. o For every flight, she has a boarding pass with only a From (City) and To (City) printed on it but no date/time. o At the end of her journey, she hands you all her boarding passes but they’re shuffled, so you don’t know the starting or the ending city. Can you: o Write logic or pseudocode to print her whole journey in sequence. It should print e.g. (Starting) City1 -> City2 ->….-> (Ending) CityX o State the time complexity of your solution. o you’re given a Set of BoardingPass objects as input. o there could be as many as hundreds of thousands of unique cities/airports. o memory is no concern (i.e. you have infinite memory!). Optimize for execution time (time complexity).

Viewing 1671 - 1680 interview questions

Glassdoor has 20,302 interview questions and reports from Data engineer interviews. Prepare for your interview. Get hired. Love your job.