
- Multiple linear regression
- Simple linear regression
- Interaction regression
- Coefficient regression
Q: What technique turns one categorical variable into several binary
variables?
- Multiple linear regression
- Overfitting
- One hot encoding
- Adjusted R squared
Q: Which of the following is true regarding variance inflation factors?
Select all that apply.
- The larger the variable inflation factor, the less multicollinearity in the model.
- The minimum value is 0.
- The larger the variable inflation factor, the more multicollinearity in the model.
- The minimum value is 1.
Q: What term represents how the relationship between two independent
variables is associated with changes in the mean of the dependent variable?
- Normality term
- Selection term
- Interaction term
- Coefficient term
Q: Which of the following statements accurately describe adjusted R
squared? Select all that apply.
- It is greater than 1.
- It is a regression evaluation metric.
- It can vary from 0 to 1.
- It penalizes unnecessary explanatory variables.
Q: Which of the following statements accurately describe forward
selection and backward elimination? Select all that apply.
- Forward selection begins with the full model with all possible independent variables.
- Forward selection begins with the full model with all possible dependent variables.
- Forward selection begins with the null model and zero independent variables.
- Backward elimination begins with the full model with all possible independent variables.
Q: A data professional reviews model predictions for a human resources
project. They discover that the model performs poorly on both the training data
and the test holdout data, consistently predicting figures that are too low.
This leads to inaccurate estimates about employee retention. What quality does
this model have too much of?
- Bias
- Entropy
- Variance
- Leakage
Q: What regularization technique completely removes variables that are
less important to predicting the y variable of interest?
- Elastic net regression
- Independent regression
- Lasso regression
- Ridge regression
Q: A data team with a restaurant group uses a regression technique to
learn about customer loyalty and ratings. They estimate the linear relationship
between one continuous dependent variable and two independent variables. What
technique are they using?
- Coefficient regression
- Simple linear regression
- Interaction regression
- Multiple linear regression
Q: A data professional confirms that no two independent variables are
highly correlated with each other. Which assumption are they testing for?
- No multicollinearity assumption
- No linearity assumption
- No normality assumption
- No homoscedasticity assumption
Q: What term represents the relationship for how two variables’ values
affect each other?
- Underfitting term
- Linearity term
- Interaction term
- Feature selection term
Q: Which regression evaluation metric penalizes unnecessary explanatory
variables?
- Holdout sampling
- Adjusted R squared
- Overfitting
- Regression sampling
Q: A data professional tells you that their model fails to adequately
capture the relationship between the target variable and independent variables
because it has too much bias. What is the most likely cause of the bias?
- Underfitting
- Overfitting
- Leakage
- Entropy
Q: What regularization technique minimizes the impact of less relevant
variables, but drops none of the variables from the equation?
- Lasso regression
- Forward regression
- Elastic net regression
- Ridge regression
Q: Fill in the blank: The no multicollinearity assumption states that
no two _____ variables can be highly correlated with each other.
- Dependent
- categorical
- independent
- continuous
Q: Fill in the blank: An interaction term represents how the
relationship between two independent variables is associated with changes in
the _____ of the dependent variable.
- category
- multicollinearity
- assumption
- mean
Q: A data professional uses an evaluation metric that penalizes
unnecessary explanatory variables. Which metric are they using?
- Link function
- Adjusted R squared
- Ordinary least squares
- Holdout sampling
Q: What stepwise variable selection process begins with the full model
with all possible independent variables?
- Forward selection
- Backward elimination
- Extra-sum-of-squares F-test
- Overfit selection
Q: A data analytics team creates a model for a project supporting their
company’s sales department. The model performs very well on the training data,
but it scores much worse when used to predict on new, unseen data. What does
this model have too much of?
- Entropy
- Bias
- Leakage
- Variance
Q: A data professional at a car rental agency uses a regression
technique to learn about how customers engage with various sections of the
company website. They estimate the linear relationship between one continuous
dependent variable and three independent variables. What technique are they
using?
- One hot encoding
- Multiple linear regression
- Simple linear regression
- Interaction terms