Describe a scalable and fault-tolerant architecture for ingesting, processing, and storing real-time streaming data from diverse sources, ensuring data integrity and efficient querying for analytics purposes.
Data Engineer Interview Questions
18,773 data engineer interview questions shared by candidates
Implement a CSV parser. Implement a web-scraper.
Multiple SQL Questions
Manager want to analyze the how the promotions on certain products are performing.find how the the percent of promoted sales?
python :average length of words.
Design a dashboard to highlight a certain aspect of the user behaviour
Why do you want to return to Stori?
They asked me to explain how I would design and optimize a cloud-based data pipeline on GCP, including data ingestion, transformation, storage, and performance considerations.
sql: already present in glassdoor. 1)ratio of is_fat and is_someother among total products 2)top 5 single channel companies that spent max amount on promotion. take into account of the ones that have only single entry company , channel; 1 food, water 2 food 3) percentage of sales on first day and last day python: occurrences of a letter in a string, replace none with previous value in a list, from a dictionary get the keys corresponding to nth highest value.
simple sql queries, but hard schema
Viewing 1461 - 1470 interview questions