NotesFAQContact Us
Search Tips
ERIC Number: ED548628
Record Type: Non-Journal
Publication Date: 2012
Pages: 254
Abstractor: As Provided
Reference Count: N/A
ISBN: 978-1-2673-1742-1
Regression Methods for Categorical Dependent Variables: Effects on a Model of Student College Choice
Rapp, Kelly E.
ProQuest LLC, Ph.D. Dissertation, Indiana University
The use of categorical dependent variables with the classical linear regression model (CLRM) violates many of the model's assumptions and may result in biased estimates (Long, 1997; O'Connell, Goldstein, Rogers, & Peng, 2008). Many dependent variables of interest to educational researchers (e.g., professorial rank, educational attainment) are categorical in nature but are analyzed using the CLRM (Harwell & Gatti, 2001) even though alternate regression techniques for categorical dependent variables are recommended (Agresti, 1996; Long, 1997). Data obtained from ACT®, Inc., on 5,200 high school seniors in Illinois and Colorado were used to analyze effects of regression method on a model of ascriptive and academic influences on selectivity of postsecondary institution attended. The dependent variable was measured in rank-ordered categories based on self-reported institutional admissions policies and analyzed with classical linear, multinomial logistic, and ordered logistic regressions. Choice of regression method did not affect overall model performance as evidenced by significant F and Likelihood Ratio ?2; tests. The full CLRM was fit moderately-well to the data (R[superscript 2] = 0.391), surpassing some previous findings (Hearn, 1988, 1991; Davies & Guppy, 1997). McFadden's R[superscript 2][subscript L] measure of strength of association was larger in the multinomial regression than in the ordered regression (R[superscript 2][subscript L] = 0.191 vs. R[superscript 2][subscript L] = 0.158). The multinomial logistic method also correctly predicted dependent variable category with the greatest accuracy (46.3% correct), but Somers' D[subscript yx] measure of association was smallest for the multinomial model. Direction and significance of relationship between predictors and the dependent variable was substantively consistent across the CLRM and logistic methods. In all regressions, ACT® score had the most impact on selectivity of institution attended. Threshold values were significant, supporting the assumption of an ordered dependent variable. Due to the CLRM's theoretical and predictive shortcomings and the multinomial model's complexity in interpretation, ordered logistic regression was determined to be the most appropriate for explaining influences on selectivity of postsecondary institution attended. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page:]
ProQuest LLC. 789 East Eisenhower Parkway, P.O. Box 1346, Ann Arbor, MI 48106. Tel: 800-521-0600; Web site:
Publication Type: Dissertations/Theses - Doctoral Dissertations
Education Level: Higher Education; Postsecondary Education; High Schools; Secondary Education
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Identifiers - Location: Colorado; Illinois
Identifiers - Assessments and Surveys: ACT Assessment