Find customer sales timeseries whose moving averages have positive slope, except the average should only be over two days, and the lead day changes, and actually it should be (customer, day) pairs, or triples with total sales, we're not sure actually what we want but it's definitely a very complicated SQL query. No. More complicated. What do you mean "use pandas?" Pandas and SQL are the same thing. SQL is column-oriented. I've never heard of KDB. That's something you made up.
Data Scientist Junior Interview Questions
33,936 data scientist junior interview questions shared by candidates
Explain a probability distribution that is not normal and how to apply that.
Review the resume, nothing special
What are the assumptions of linear regression?
Read 4 questions in a Coderpad and solve them with SQL in under 20 minutes
Las DS project
Motivation, Statistik, Maschinelles Lernen, Fallstudie
The interviewer asked about random forest and how it works. When I said that each decision tree in the forest considers a random subset of features, he disrespectfully interrupted me and told me that I am wrong. Then he scolded me for giving the "wrong" answer.
How to reduce number of variables in Logistic regression and random forest?
They asked a lot of questions about my take-home project; in particular wanted to know about the reasons that I took the approach that I did. I could tell they were coming more from a statistical and economics background for the most part; while I'm more of an engineering and machine-learning hacker type standpoint. They also had a lot of "ambiguous" questions; by that, I mean questions about ambiguous business situations I might encounter in this position. Wanted to know how well I would do with ambiguous questions I might get from business leaders.
Viewing 361 - 370 interview questions