Questions on System design, Python and SQL
Data Engineer Interview Questions
18,761 data engineer interview questions shared by candidates
Preguntas de pandas multiple choice
What is the biggest challenge in your previous work?
tell me about youself ?
Basic concepts about Data Engineering
Spark optimizations: what are the optimizations that can be done for the below snippet code: shoppers_df (customers description DF) 250MB, 15M records: schema: StructType = StructType(Array(StructFiled("shopper_id", LongType, nullable = True), StructField("retailer_id", StringType, nullable = True), StructField("shopper_group_id", StringType, nullable = True), StructField("join_date", DateType, nullable = True), StructField("shopper_type", StringType, nullable = True), StructField("gender", StringType, nullable = True))) sku_df (dimension DF): 15 MB, 90K records purchase_df (transactions DF): 50GB of parquet compressed files 5,000,000,000 records. schema: StructType = StructType(Array(StructFiled("shopper_id", LongType, nullable = True), StructField("product_id", LongType, nullable = True), StructField("pos_id", IntegerType, nullable = True), StructField("purchase_date", DateType, nullable = True), StructField("units", DoubleType, nullable = True), StructField("total_spent", DoubleType, nullable = True))) Current code: products_purchased_df = purchase_df.alias("purchase").join(shoppers_df, on = "shopper_id", how = "left outer").join(sku_df.alias("sku"), on = "product_id").select(Col("purchase.*"), Col("sku.*")) usage: status_df = products_purchased_df.groupBy(["shopper_id", "product_id"]).agg(...) Optimize join statement
We will give you a take-home project to do and you will have to do research and come up with architecture around it?
Two rounds - Online technical test Multiple choice answer and question format (skip questions that are not relevant) Technical questions on current problems the company faced and how you would solve it
Talk about a project that involved Databases
What are your career goals for the next 5 years
Viewing 1171 - 1180 interview questions