Kafka out of sync replicas
Big Data Engineer Interview Questions
898 big data engineer interview questions shared by candidates
Question on Joins, Optimizations based on Pyspark
what are UDF in hive and pig
self join
class male female 1 2 1 2 0 2 3 2 0 class gender 1 m 1 f 1 m 2 f 2 f 3 m 3 m Using pyspark take the data from table1 and dump into another table in the format of table2 as given . Also please do vice-versa Question 2: dept salary 1 10 1 20 2 40 2 50 2 90 2 80 1 100 1 70 using pyspark sort the datasets as per salary and print out third highest row for each dept Question 3: How you the offset works in KAFKA , also write the syntax for it
Questions were on Big Data technologies for me focused on Spark, SQL, Hive, Python.
All big data concept and python programming, Pyspark program
What are Spark optimisation technique’s ?
Basic questions on hadoop and PySpark.
Most of the questions are related to the Technology I have worked on
Viewing 741 - 750 interview questions