What is combinebykey SCD1 logic Different between edge node and data node Where the code will be deployed? (edge node or in cluster) YARN architecture What are all the versions of spark you have worked? Diff btw SchemaRDD and df Different ways to create dataframe what is bundle in oozie? fork action in oozie? distcp command how do you decide number of mappers in sqoop job? what is the optimal number of mappers provided there is no restriction in establishing connection to DB? how to do you pull clob,blob datatype in oracle to HDFS? semi join,anti-join in scala diff between logical plan and physical plan where can we see logical plan?
Consultant Big Data Interview Questions
1,784 consultant big data interview questions shared by candidates
interview questions were mostly from experience and easy.
-Présentez-vous -Pourquoi Airbus -Quelle est le projet dont vous êtes le plus fier -Parmis les 5 qualité d'airbus, laquelle vous correspond le plus (répondre en anglais) -quelque chose à ajouter ?
Q: When did you analyse data?
Why Kubrick?
Design an app that uses more than one data type
Garbage collection & JVM internals. Unique vs primary key. Clustered vs non-clustered indexes.
Reasoning questions include scenario based given 2 statements which of following is true(difficult), picture based( weight problem), one 'for' loop program question,jumbling characters problem, one % based problem. techical include one MR program to print files output based on month and find no of sundays in each month. couple of spark and kafka questions. mostly on rdd . nit sure what it is.
Can you describe some projects that you did at the university?
Two parts - working with API and an advanced SQL question
Viewing 1721 - 1730 interview questions