Data Engineer Interview Questions

18,724 data engineer interview questions shared by candidates

Bellow is a table called fact_daily_users, which contains the users who were active on a specific date with the specific action they have made (aggregate total actions per row). user_id | date | action| total_actions| day_in_row 332 |17/06| view | 1 | 1 332 |20/06| view | 6 | 1 332 |20/06| click | 2 | 1 221 |24/06| view | 4 | 1 221 |24/06| click | 2 | 1 221 |21/06| view | 1 | 2 221 |20/06| view | 1 | 1 332 |21/06| view | 4 | 2 332 |21/06| click | 3 | 2 Q1: write a query that calculates the logic of day_in_row field with SQL, no joins are allowed. "day_in_row" shows consecutive days for user - users that return day after day to the website per row. Q2: write a function (Python / Java) that gets sql query and returns the output in json object
avatar

Data Engineer

Interviewed at Tipalti

3.3
Feb 18, 2023

Bellow is a table called fact_daily_users, which contains the users who were active on a specific date with the specific action they have made (aggregate total actions per row). user_id | date | action| total_actions| day_in_row 332 |17/06| view | 1 | 1 332 |20/06| view | 6 | 1 332 |20/06| click | 2 | 1 221 |24/06| view | 4 | 1 221 |24/06| click | 2 | 1 221 |21/06| view | 1 | 2 221 |20/06| view | 1 | 1 332 |21/06| view | 4 | 2 332 |21/06| click | 3 | 2 Q1: write a query that calculates the logic of day_in_row field with SQL, no joins are allowed. "day_in_row" shows consecutive days for user - users that return day after day to the website per row. Q2: write a function (Python / Java) that gets sql query and returns the output in json object

1) Imagine you are data modeling Netflix and create the entities and relationships involved using a modeling tool. 2) follow a link to a sql tool which shows 4 tables. Run some provided sql to populate the tables, Create the sql to aggregate the data and provide a result set showing the top row.
avatar

Senior Data Engineer

Interviewed at Pluralsight

2.9
Mar 9, 2022

1) Imagine you are data modeling Netflix and create the entities and relationships involved using a modeling tool. 2) follow a link to a sql tool which shows 4 tables. Run some provided sql to populate the tables, Create the sql to aggregate the data and provide a result set showing the top row.

Viewing 461 - 470 interview questions

Glassdoor has 18,724 interview questions and reports from Data engineer interviews. Prepare for your interview. Get hired. Love your job.