Quiz on non-linear regression

Check how much you remember from previous sections by answering the questions below.

Which of the following best describes the difference between a linear and non-linear model in machine learning??

A linear model assumes a direct proportionality between the input and output variables, while a non-linear model does not.

Linear models are more accurate than non-linear models in all cases.

Non-linear models always involve more complex mathematical calculations than linear models.

A non-linear model cannot be used for regression problems.

What is the purpose of splitting a dataset into training and testing sets?

To avoid overfitting by using different data for model training and evaluation.

To ensure that the model sees all the data at least once.

To reduce the size of the dataset to make the model run faster.

To increase the accuracy of the predictions by using smaller datasets.

Which of the following steps is performed immediately after training a machine learning model?

Splitting the dataset.

Evaluating the model using the test set.

Fine-tuning the model.

Predicting on unseen data.

If a regression model has an R² score of 0.85, what does this indicate about the model’s performance?

The model has an error rate of 85%.

The model is overfitting the training data.

The model’s predictions are accurate by 85% on average.

The model explains 85% of the variance in the target variable.

Which of the following evaluation metrics penalizes large prediction errors more heavily?

Mean Absolute Error (MAE)

R-squared (R²)

Root Mean Squared Error (RMSE)

None of the above

What is the main benefit of using cross-validation over a simple train-test split?

Cross-validation ensures that the model will be 100% accurate.

Cross-validation helps to better estimate model performance by using multiple training-test splits.

Cross-validation makes the training process faster.

Cross-validation can only be used for linear models.

What is the main reason for computing spatially stratified metrics in model evaluation?

To get global evaluation scores for the entire dataset.

To account for regional differences and compute performance metrics for each geographical subset.

To increase the accuracy of the model’s predictions.

To ensure that the model is overfitting.

What does the term “spatial leakage” refer to in spatial cross-validation?

The leakage of information between temporal datasets.

When spatial proximity between training and test sets violates the assumption of independence.

The leakage of features from test data into the training data.

The violation of model performance evaluation due to overfitting.

In a Local Indicators of Spatial Association (LISA) analysis, what does a “High-High” cluster of residuals indicate?

A region with consistently high error rates.

A region with over-predicted values.

A region where the model consistently under-predicts.

A region with low error rates.

What does feature importance represent in tree-based models such as Gradient Boosting or Random Forest?

The statistical significance of each feature.

The accuracy of each feature’s predictions.

The raw coefficients assigned to each feature.

The relative contribution of each feature in reducing the model’s loss function.