- Difference between data frame and dataset? - What is partitioning in DB? - What are CTEs and when to use them? - What do you know about lead/lag functions? - How do you measure the quality of data? - What can you do if find a very slow SQL query? - Did you use parquet files, and why? - How does spark work, when did you need to use it, and how is it compared to other tools you used before? - You mentioned A/B testing, what is it? And why do you use it to achieve your goal? - Did you work with streaming data and frameworks like Kafka? - Are you using DBT to build your transformations? - Did you hear about Adobe Campaign and Audience Manager? - Did you work with looker? - How can you deal with team members who are eager to do and try new things?