Here is a list with 10 of the most common data science interview questions for junior positions. Interviewers usually focus on Statistics fundamentals and Machine Learning concepts.
Answer: It's mean and it's standard deviation.
Answer: The process to encode/transform data into an sparse vector in which one element is set to 1 and all other elements are set to 0.
Answer: It's the difference between the observed value and the predicted value of the quantity of interest.
Answer: An esemble approach to finding the decision tree that best fits the training data by creating many decision trees and then determining the "average" one.
Answer: It's the process of reducing the number of variables under consideration by obtaining a set of principal components.
Answer: The "random" part of the term refers to building each of the decision trees from a random selection of features.
Answer: Imputation, predicting the missing values and if there are just a few of missing values you can delete the rows with missing values.
Answer: Unsupervised learning aims to detect patterns in data where no labels are given.
Answer: R-Squared is a statistical measure of how close the data are to the fitted regression line. It is also known as the coefficient of determination.
Answer: Because it assumes that all of the features in a data set are equally important and independent.
If you liked these 10 question you'll probably love the collection of 197 data science questions that I have put together at datasciencetrivia.com
Practice with +190 interview questions carefully crafted by experienced data scientist at datasciencetrivia.com
Published with Salto
Get emails about data science trivia questions every month