primary key and unique key joins in SQL spark architecture project explanation
Desenvolvedor Hadoop Interview Questions
329 desenvolvedor hadoop interview questions shared by candidates
The most difficult question was about Nagios and exactly where you need to change the contact email.
Hive Serde to log files like log, double delimiter Tuning Partitioning n bucketing Internal external tables File formats Map redice Joins Tuning Etc
Hadoop,Hive and ecosystem , about ptojects and ur role related
Backup and restore of HDF/HDP environments.
How to troubleshoot slow running job in the cluster ?
What is RDD (spark) What r the problem with In u place How do u use global sort in hive and partitioning logics Diff between bucketing and partitioning When will u use this.. Syntax for bucking and partitioning
Interviewer started to ask questions based on what i have kept in resume. His questions started from basic to advanced. What is bucketing and partitioning and use cases? Difference between group and co group in Pig ? Day to day activities in my job? Project flow ? Accumulators and broad cast variables in spark? How spark is fault tolerant? Distributed cache in hadoop? What types of fileformats used in hive? Default mappers in sqoop? How much of data you handle daily or monthly? Why hive and pig used? On Map reduce basic programming? like how you will write program just to test your knowledge on MapReduce. If you store file from HDFS to pig using PIGSTORAGE in grunt shell but if that file is not actually available in that HDFS path will it work?
what is combiner?
HBASE architecture concepts, Spark architecture Concepts, difference between Hbase and Hive, why Hive/Sqoop, concepts of HDFS architecture, current Project, Versions of component, cluster specifications. If you are good at python that would be great.
Viewing 191 - 200 interview questions