Statistics - performance metrics, Why GBM, why not xgboost , Differences between GBM and xgboost Then bias, overfitting ,underfitting, Regularisation , Lasso regression( explain). ... Followed by extended questions
Data Scientist Internship Interview Questions
33,930 data scientist internship interview questions shared by candidates
write a code in R/SQL: Given a table with three column, (id, category, value) and each id has 3 or less category (price, size, color). Now, how can I find those id's for which the value of two or more category matches to one another? For eg: ID1 (price 10, size M, color Red), ID2 (price 10, Size L, Color Red) , ID3 (price 15, size L, color Red) Then the output should be two rows: ID1 ID2 and ID2 ID3
- What is over-fitting? How do you avoid it? - What types of regularization do we have? Which one is simpler to use? L1 or L2? - Explain decision trees? What are different metrics to classify dataset? - What is bagging? - We have two models, one with 85% accuracy, one 82%. Which one do you pick? - What is p-value and how can we use it?
Assumption of Linear Regression, etc.
Interviewer: What is the technique to evaluate Logistic regression. I answered Cross entropy. Interviewer: no
Given measurements of acceleration taken from a wristband, with second by second acceleration in the X, Y, Z axis, how would you predict if the person wearing the wrist is sitting, walking or just standing?
Mettez vous à jour des procédures?
Alice ha 9 monete, Bob ha 8 monete. Quale è la probabilità che Alice ottenga più teste di Bob?
How to measure distance between data point?
Let's say you finished developing a machine learning model and you are getting 95% accuracy. Should you be happy?
Viewing 321 - 330 interview questions