Task 1: culture

Select only respondents from Estonia.

  1. Specify a model to predict concert attendance by age, gender, community type, difficulties paying bills and the age of education completion.
  2. Interpret the results.
  3. Check if age has a quadratic relationship with the probability of concert attendance.
  4. Visualize the effect of age, interpret the results.
  5. Look at model fit, interpret the results.

Results interpretation:

All coefficients are significant!

Overall

Note on quadratic relationship of age & concert attendance

Looks like there is no any significant and/or strong relationships between those two variables - neither confidence intervals are reliable (-0 is not goot CI at all) or the coefficient value almost at 0 too.

Plot interpretation

From those two observations can be derived simple assumption (which is already confirmed by the model) that with more years of age probability of attending on concerts decreasing very slowly.

Model fit:

Task 2: values

Select only respondents from Finland.

  1. Specify a model to predict the value of achievement by other variables.
  2. Interpret the results
  3. Check if there is an interaction effect between respondent’s age and respondent’s gender.
  4. Visualize the interaction effect, interpret the results.
  5. Look at model fit, interpret the results.
  6. Check if the model has outliers, multicollinearity, non-linear effects, and non-constant error variance.

Results interpretation:

All coefficients are significant except:

Listed countries will be not interpreted in the further model exploration!

Interaction effect truly exists and gender significantly moderates effect of age - as can be seen overall trend with more years of age to values achievements lower but! for males that downtrend is elaborated with much lower slope than for females

R-sqaured and Adjusted R-squared are equal (which seems strange to me because I await for lower adjusted R-squared with such amount of regressors)

They both equals to 29.5-4% of explained variance which is not really cool with that amount of regressors.

Assumptions

Almost any observation looks to be influential - but that highly affected by number of observations.

As can be seen by VIF value for constant variable (intercept) there is multicollinearity for the outcome (but not for numeric regressors).

stim 0.2381 0.005 46.561 0.000 0.228 0.248 hedon 0.1876 0.005 34.642 0.000 0.177 0.198 benev 0.1892 0.007 27.918 0.000 0.176 0.202 intage1 0.0014 0.000 3.198 0.001 0.001 0.002 age -0.0060 0.000 -20.502 0.000 -0.007 -0.005 eduy

No any non-linear relationships for numeric regressors are found.

Both tests indicate violation of homoscedasticity.