Sangam: A Confluence of Knowledge Streams

Regression Methods for Categorical Dependent Variables: Effects on a Model of Student College Choice

Show simple item record

dc.contributor Plucker, Jonathan A.
dc.contributor Delandshere, Ginette
dc.creator Rapp, Kelly E.
dc.date 2013-05-15T23:54:07Z
dc.date 2013-05-15T23:54:07Z
dc.date 2013-05-15
dc.date 2012
dc.date.accessioned 2023-02-21T11:18:34Z
dc.date.available 2023-02-21T11:18:34Z
dc.identifier http://hdl.handle.net/2022/15879
dc.identifier.uri http://localhost:8080/xmlui/handle/CUHPOERS/252937
dc.description Thesis (Ph.D.) - Indiana University, School of Education, 2012
dc.description The use of categorical dependent variables with the classical linear regression model (CLRM) violates many of the model's assumptions and may result in biased estimates (Long, 1997; O'Connell, Goldstein, Rogers, & Peng, 2008). Many dependent variables of interest to educational researchers (e.g., professorial rank, educational attainment) are categorical in nature but are analyzed using the CLRM (Harwell & Gatti, 2001) even though alternate regression techniques for categorical dependent variables are recommended (Agresti, 1996; Long, 1997). Data obtained from ACT<super>®</super>, Inc., on 5,200 high school seniors in Illinois and Colorado were used to analyze effects of regression method on a model of ascriptive and academic influences on selectivity of postsecondary institution attended. The dependent variable was measured in rank-ordered categories based on self-reported institutional admissions policies and analyzed with classical linear, multinomial logistic, and ordered logistic regressions. Choice of regression method did not affect overall model performance as evidenced by significant <italic>F</italic> and Likelihood Ratio <italic>&chi;</italic><super>2</super> tests. The full CLRM was fit moderately-well to the data (<italic>R</italic><super>2</super> = .391), surpassing some previous findings (Hearn, 1988, 1991; Davies & Guppy, 1997). McFadden's <italic>R</italic><super>2</super>L measure of strength of association was larger in the multinomial regression than in the ordered regression (<italic>R</italic><super>2</super>L = .191 vs. <italic>R</italic><super>2</super>L = .158). The multinomial logistic method also correctly predicted dependent variable category with the greatest accuracy (46.3% correct), but Somers' <italic>D</italic>yx measure of association was smallest for the multinomial model. Direction and significance of relationship between predictors and the dependent variable was substantively consistent across the CLRM and logistic methods. In all regressions, ACT<super>®</super> score had the most impact on selectivity of institution attended. Threshold values were significant, supporting the assumption of an ordered dependent variable. Due to the CLRM's theoretical and predictive shortcomings and the multinomial model's complexity in interpretation, ordered logistic regression was determined to be the most appropriate for explaining influences on selectivity of postsecondary institution attended.
dc.language en
dc.publisher [Bloomington, Ind.] : Indiana University
dc.subject college selectivity
dc.subject comparative study
dc.subject logistic regression
dc.subject ordinal data
dc.subject student college choice
dc.subject Educational psychology
dc.subject Statistics
dc.subject Higher education
dc.title Regression Methods for Categorical Dependent Variables: Effects on a Model of Student College Choice
dc.type Doctoral Dissertation


Files in this item

Files Size Format View
Rapp_indiana_0093A_11545.pdf 2.462Mb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse