SQL, Data Architecture and Behavioral
Sr Data Engineer Interview Questions
2,437 sr data engineer interview questions shared by candidates
Kafka , airflow architecture, spark
Python, SQL, systems architecture, idempotency, team fit/goals, dealing with mistakes or unideal systems/processes and how you interact with team(s) and the company.
State and describe the different types of SQL joins. What is referential integrity? With regard to statistics and machine learning, state your knowledge of: p-value, hypothesis testing, overfitting, ensembling. With regard to working on the command-line, demonstrate your knowledge of these tools: awk, cat, cut, grep, sed.
How to store tiled images in an S3 bucket
On cloud services Data lake Streaming data
Describe past projects, What is your background What is your motivation to join the company Why did you use the technology
Explain how you use PySpark for big data processing.
Wie gehst du damit um, wenn du bereits einige Task auf dem Tisch liegen hast und dann der Projektleiter deines Projektes auf dich mit einer "extrem" wichtigen Aufgabe zukommt.
1. Can you provide an overview of your experience working with PySpark and big data processing? 2. What motivated you to specialize in PySpark, and how have you applied it in your previous roles? 3. Explain the basic architecture of PySpark. 4. How does PySpark relate to Apache Spark, and what advantages does it offer in distributed data processing? 5. Describe the difference between a DataFrame and an RDD in PySpark.
Viewing 331 - 340 interview questions