About project , architectures ,some basics like partition,bucketing,RDD, Data frames, DAG execution engine, why from Hive to Spark SQL, difference between RDD, DataFrames, Datasets, how to make joins between data frames, what to do in spark job if our infrastructure is limited.
Data Engineer Interview Questions
18,773 data engineer interview questions shared by candidates
Azure Batch Storage and .Net applications
Can you walk us through the process of designing an end-to-end data pipeline, including data ingestion, transformation, and loading (ETL), and how you would ensure its scalability and reliability?
Slowly Changing Dimension Types (DataWarehouse - SCDs)
Considerations to define table structure with billion of records
Architecture and ETL process of my previous employment with an example of End to end ETL flow. Couple of questions on AWS services. Learning spirit Differences between batch processing and stream processing. Questions on Kappa architecture Many more relavent to my previous experience mentioned in CV
What was my previous experience.
Get rows where price decreased compared to the previous date petrol_details (pump, products, price, date)? [ (PUMP-1, Petrol, 100.05, 2025-01-01), (PUMP-1, Petrol, 98.00, 2025-01-02 ), (PUMP-1, Diesel, 90.00. 2025-01-01 ), (PUMP-1, Diesel, 95.00, 2025-01-02 ), (PUMP-2, Petrol, 101.05, 2025-01-01), (PUMP-2, Petrol, 97.99, 2025-01-02 ), (PUMP-2, Diesel, 90.05. 2025-01-01 ), (PUMP-2, Diesel, 86.00, 2025-01-02 )]
Encuentra la relación en el siguiente ejercicio 222 l 11-----11 l 101
Completely into non-technical, but the round was supposed to be technical round 1
Viewing 1611 - 1620 interview questions