Home Directory of Statistical Analyses Ordinal Regression

Ordinal Regression

Ordinal regression predicts the behavior of ordinal level dependent variables using a set of independent variables. The dependent variable is the order response category variable and the independent variable may be categorical or continuous. In SPSS, this test is available on the regression option analysis menu.

Do gender and race influence happiness as categorized by the XYZ survey?

Does age relate to the level of shopping likelihood (not at all likely, somewhat likely, moderately likely, extremely likely)?

Assumptions:

One dependent variable, we cannot use multiple dependent variables.
Parallel lines assumption: There is one regression equation for each category except the last category. You can predict the probability of the last category as 1 minus the probability of the second last category.
Adequate cell count: As per the rule of thumb, 80% of cells must have more than 5 counts. No cell should have Zero count. The greater the cell with less count, the less reliable the chi-square test will be.

Need help conducting your Ordinal Regression? Leverage our 30+ years of experience and low-cost same-day service to complete your results today!

Schedule now using the calendar below.

Key terms and concepts:

Dependent variable: The dependent variable is ordinal. Researchers usually consider the first category as the lowest and the last category as the highest, typically coding them numerically from 0 upwards. SPSS usually uses the logit function to predict the dependent variable category. They use the probit function to predict the dependent variable category when the categories are relatively equal. There is a K-1 predication where K is the number of a category in a dependent variable.

Factor: Researchers must code a factor, a categorical independent variable, numerically in SPSS (e.g., gender coded as 0 = male and 1 = female).

Covariate: Researchers use covariates, continuous independent variables, to predict the dependent variable category (e.g., IQ score).

Link function:

The link function transforms the cumulative probabilities of the dependent ordered variable, enabling model estimation. However, in SPSS, five link functions are available, these link functions are as follows:

Logit function: Logit function is the default function in SPSS for ordinal regression. Researchers usually use this function when the dependent ordinal variable has equal categories. Mathematically, logit function equals to f(x) = log(x / (1 – x)).
Probit model: This is the inverse standard normal cumulative distribution function. This function is more suitable when the dependent variable follows a normal distribution.
Negative log-log f(x) = -log (- log(x)): This link function is recommended when the probability of the lower category is high.
Complementary log-log f(x) = log (- log (1 – x)): This function is inverse of the negative log-log function, it is recommended when the probability of higher category is high.
Cauchit. f(x) = tan (p(x – 0.5)): This link function is used when the extreme values are present in the data.

Statistics and saved variables: The output button in SPSS gives the flexibility to save the output. We can save predicted category, or predicted category probability by selecting this option from the output button.

Parameter estimates, standard errors, significance levels, and confidence intervals:

In the output table of SPSS, a table called ‘parameter estimates’ appears. There is a variable named threshold, which is used for the Intercept term, and the location variable gives the coefficient for the independent variable for the specified link function. The first threshold will be used to predict the probability of the first order. Wald statistics is used to test the significance of the independent variable with degrees of freedom and standard error.

Goodness of fit information: Pearson chi-square test gives the information about how many predicted cell frequencies differ from observed frequencies.

R-square estimate: As in simple linear regression, we cannot use simple r-square in ordinal regression. R-square gives the information about how much variance is explained by the independent variable. However, variance is split into categories. Hence Cox and Snell’s, Nagelkerke’s, and McFadden’s pseudo-R2 statistics will be used in ordinal regression to estimate the variance explained by the independent variable.

*For assistance with conducting an ordinal regression or other quantitative analysis click here.

Resources

Armstrong, B. G., & Sloan, M. (1989). Ordinal regression models for epidemiological data. American Journal of Epidemiology, 129(1), 191-204.

Bender, R., & Benner, A. (2000). Calculating ordinal regression models in SAS and S-Plus. Biometrical Journal, 42(6), 677-699.

Chu, W., & Ghahramani, Z. (2005). Gaussian processes for ordinal regression. Journal of Machine Learning Research, 6, 1019-1041.

Gerhard, T., & Wolfgang, H. (1996). Random effects in ordinal regression models. Computational Statistics and Data Analysis, 22(5), 537-557.

Guisan, A., & Harrell, F. E. (2000). Ordinal response regression models in ecology. Journal of Vegetation Science, 11(5), 617-626.

Hedeker, D., & Gibbons, R. D. (1994). A random-effects ordinal regression model for multilevel analysis. Biometrics, 50(4), 933-944.

Johnson, T. R. (2003). On the use of heterogeneous thresholds ordinal regression models to account for individual differences in response style. Psychometrika, 68(4), 563-583.

Lall, R., Campbell, M. J., Walters, S. J., & Morgan, K. (2002). A review of ordinal regression models applied on health-related quality of life assessments. Statistical Methods in Medical Research, 11(1), 49-67.

McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical Society, 42(2), 109-142.

Reynolds, T. J., & Sutrick, K. H. (1986). Assessing the correspondence of one or more vectors to a symmetric matrix using ordinal regression. Psychometrika, 51(1), 101-112.

Toledano, A. Y., & Gatsonis, C. (1998). Ordinal regression methodology for ROC curves derived from correlated data. Statistics in Medicine, 15(16), 1807-1826.

Conduct and Interpret an Ordinal Regression

Regression