Fundamentals of Statistics contains material of various lectures and courses of H. Lohninger on statistics, data analysis and chemometrics......click here for more. |
![]() |
Home ![]() ![]() ![]() |
|
See also: variable selection, Confounded Variables, Distribution of the Correlation Coefficient, Validation of Models | |
Spurious Correlation
In summary, chance correlations have a considerable effect in multivariate models. Thus it is important that the number of variables is low compared to the number of observations. In the literature a rule of thumb is often presented which requires the number of observations to be at least 3 times the number of variables. However, it can easily be shown that this rule of thumb is quite useless, especially when extensive feature selection is taking place. Go to the DataLab to
carry out some trial calculations on your own. Use the mathematical formula
editor to fill a data matrix with random numbers and then try to establish
an MLR model between any number of independent variables and one selected
target variable. Change the number of observations and repeat the experiment
(start with 10 observations, then repeat the experiment with 20 and with
100 observations).
|
|
Home ![]() ![]() ![]() |