SO5032: Lab Materials

Table of Contents

1. Week 12 Lab: Multinomial and Ordinal Regression

1.1. Multinomial logistic regression

Load this BHPS excerpt:

use http://teaching.sociology.ul.ie/so5032/bhpsqual.dta

vote has four categories. Examine bivariate relationships between vote and some of the other variables, and then search for a multinomial logistic regession that makes sense: mlogit vote indvars…. It is better to have a base category that is easily interpretable, and the default here will use the biggest category ("Other/nationalist"). To avoid this, use the baseoutcome(1) option, which will force category one as the base. Use the likelihood-ratio test for each variable, since there are three times as many parameters as usual.

Use predict to generate predicted values (note you have to supply one variable name to hold the prediction for each category). How often is the most probable predicted category the same as the observed one?

Repeat the exercise using qual as the dependent variable. Select the variables carefully: some do not make sense for predicting highest qualification.

1.2. Ordinal logistic regression

qual is an ordinal variable. In your previous analysis, did you observe patterns in the parameter estimates? Search for a good proportional odds ordinal logistic model with syntax such as the following:

ologit qual i.sex age

Compare the ordinal logistic results with the multinomial results you have already produced. Do they tell the same story?

1.3. Additional: Exam performance as ordinal

Using data on exam grades, consider the variable G4.

use http://teaching.sociology.ul.ie/so5032/labs/marksdata, clear

Fit a set of binary logistic regressions, comparing Fail, C and B respectively with A. To do this, create a new variable which is 1 for Fail etc., 0 for A and missing otherwise. Use CAO1 (CAO points divided by 100) and MODSIZE as explanatory variables.

Then fit a mutltinomial logistic regression with G4 as the dependent variable, and the same explanatory variables. Compare your results with the binary regressions.

Finally, noting the ordinal pattern in the parameter estimates in both the binary and multinomial, fit an ordinal logistic regression. Compare your results with the preceeding.

With the results of the ordinal regression, calculate the odds ratio of being higher rather than lower, for a 100-point difference in raw CAO points (a 1-unit difference in CAO1). Do the same for a 100 difference in MODSIZE.

Author: Brendan Halpin

Created: 2023-04-18 Tue 15:35

Validate