Publication Date
| In 2024 | 7 |
| Since 2023 | 23 |
| Since 2020 (last 5 years) | 69 |
| Since 2015 (last 10 years) | 187 |
| Since 2005 (last 20 years) | 443 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 28 |
| Practitioners | 2 |
| Policymakers | 1 |
| Students | 1 |
Location
| Turkey | 14 |
| Canada | 10 |
| United States | 10 |
| California | 9 |
| Netherlands | 9 |
| Australia | 6 |
| Germany | 6 |
| South Korea | 6 |
| Iowa | 5 |
| Norway | 5 |
| Turkey (Ankara) | 5 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Mercer, Sterett H.; Dufrene, Brad A.; Zoder-Martell, Kimberly; Harpole, Lauren Lestremau; Mitchell, Rachel R.; Blaze, John T. – Assessment for Effective Intervention, 2012
Despite growing use of CBM Maze in universal screening and research, little information is available regarding the number of CBM Maze probes needed for reliable decisions. The current study extends existing research on the technical adequacy of CBM Maze by investigating the number of probes and assessment durations (1-3 min) needed for reliable…
Descriptors: Generalizability Theory, Curriculum Based Assessment, Reading Tests, Cloze Procedure
Zhou, Hong; Muellerleile, Paige; Ingram, Debra; Wong, Seok P. – Journal of Educational and Behavioral Statistics, 2011
Intraclass correlation coefficients (ICCs) are commonly used in behavioral measurement and psychometrics when a researcher is interested in the relationship among variables of a common class. The formulas for deriving ICCs, or generalizability coefficients, vary depending on which models are specified. This article gives the equations for…
Descriptors: Computation, Statistical Analysis, Generalizability Theory, Correlation
Williams, Judith C.; Alwis, W. A. M.; Rotgans, Jerome I. – Advances in Health Sciences Education, 2011
The purpose of this study was to investigate the stability of three distinct tutor behaviors (1) use of subject-matter expertise, (2) social congruence and (3) cognitive congruence, in a problem-based learning (PBL) environment. The data comprised the input from 16,047 different students to a survey of 762 tutors administered in three consecutive…
Descriptors: Expertise, Generalizability Theory, Tutor Training, Problem Based Learning
Sao Pedro, Michael A.; Baker, Ryan S. J. d.; Gobert, Janice D. – Grantee Submission, 2013
When validating assessment models built with data mining, generalization is typically tested at the student-level, where models are tested on new students. This approach, though, may fail to find cases where model performance suffers if other aspects of those cases relevant to prediction are not well represented. We explore this here by testing if…
Descriptors: Educational Research, Data Collection, Data Analysis, Generalizability Theory
Gugiu, Mihaiela R.; Gugiu, Paul C.; Baldus, Robert – Journal of MultiDisciplinary Evaluation, 2012
Background: Educational researchers have long espoused the virtues of writing with regard to student cognitive skills. However, research on the reliability of the grades assigned to written papers reveals a high degree of contradiction, with some researchers concluding that the grades assigned are very reliable whereas others suggesting that they…
Descriptors: Grades (Scholastic), Grading, Scoring Rubrics, Research Design
Crits-Christoph, Paul; Gibbons, Mary Beth Connolly; Hamilton, Jessica; Ring-Kurtz, Sarah; Gallop, Robert – Journal of Consulting and Clinical Psychology, 2011
Objective: To examine the dependability of alliance scores at the patient and therapist level, to evaluate the potential causal direction of session-to-session changes in alliance and depressive symptoms, and to investigate the impact of aggregating the alliance over progressively more sessions on the size of the alliance-outcome relationship.…
Descriptors: Counselor Client Relationship, Generalizability Theory, Patients, Psychotherapy
Raymond, Mark R.; Harik, Polina; Clauser, Brian E. – Applied Psychological Measurement, 2011
Prior research indicates that the overall reliability of performance ratings can be improved by using ordinary least squares (OLS) regression to adjust for rater effects. The present investigation extends previous work by evaluating the impact of OLS adjustment on standard errors of measurement ("SEM") at specific score levels. In…
Descriptors: Performance Based Assessment, Licensing Examinations (Professions), Least Squares Statistics, Item Response Theory
Rantanen, Pekka – Assessment & Evaluation in Higher Education, 2013
A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…
Descriptors: Test Reliability, Feedback (Response), Generalizability Theory, Student Evaluation of Teacher Performance
Chafouleas, Sandra M.; Briesch, Amy M.; Riley-Tillman, T. Chris; Christ, Theodore J.; Black, Anne C.; Kilgus, Stephen P. – Journal of School Psychology, 2010
A total of 4 raters, including 2 teachers and 2 research assistants, used Direct Behavior Rating Single Item Scales (DBR-SIS) to measure the academic engagement and disruptive behavior of 7 middle school students across multiple occasions. Generalizability study results for the full model revealed modest to large magnitudes of variance associated…
Descriptors: Middle School Students, Generalizability Theory, Research Assistants, Teachers
Volpe, Robert J.; Briesch, Amy M. – School Psychology Review, 2012
Direct behavior rating (DBR) has been described as a hybrid of systematic direct observation and behavior rating scales. Although single-item (DBR-SIS) and multi-item (DBR-MIS) methods have been advocated, the overwhelming majority of research attention has focused on DBR-SIS. This study employed generalizability theory to compare the…
Descriptors: Video Technology, Behavior Rating Scales, Student Behavior, Graduate Students
Shaughnessy, Michael F.; Valdez, Gilbert – Gifted and Talented International, 2012
In the lead article, Persson (2012a) focuses on salient issues that have not as yet been addressed by others, and which are relevant, and germane. With the advent of the Internet and web and e-mail, conversation and discussion among scholars have increased tremendously. At the current time, researchers are able to share their data, their thoughts…
Descriptors: Multicultural Education, Cultural Context, Ethnocentrism, Research Methodology
Hill, Heather C.; Charalambous, Charalambos Y.; Kraft, Matthew A. – Educational Researcher, 2012
In recent years, interest has grown in using classroom observation as a means to several ends, including teacher development, teacher evaluation, and impact evaluation of classroom-based interventions. Although education practitioners and researchers have developed numerous observational instruments for these purposes, many developers fail to…
Descriptors: Generalizability Theory, Observation, Classroom Observation Techniques, Evaluation
Shin, Yongyun; Raudenbush, Stephen W. – Psychometrika, 2012
Social scientists are frequently interested in assessing the qualities of social settings such as classrooms, schools, neighborhoods, or day care centers. The most common procedure requires observers to rate social interactions within these settings on multiple items and then to combine the item responses to obtain a summary measure of setting…
Descriptors: Generalizability Theory, Neighborhoods, Intervals, Child Care Centers
Kachchaf, Rachel; Solano-Flores, Guillermo – Applied Measurement in Education, 2012
We examined how rater language background affects the scoring of short-answer, open-ended test items in the assessment of English language learners (ELLs). Four native English and four native Spanish-speaking certified bilingual teachers scored 107 responses of fourth- and fifth-grade Spanish-speaking ELLs to mathematics items administered in…
Descriptors: Error of Measurement, English Language Learners, Scoring, Bilingual Teachers
Lengh, Carolyn J. – ProQuest LLC, 2010
This study compares the dependability of four classroom assessment scoring methods. Generalizability theory (G) and alternative decision (D) are used to measure the results of students' classroom assessment scores and compare the results of the four scoring methods on variability of rater by person variance and the level of G and D coefficients…
Descriptors: Generalizability Theory, Scoring, Social Studies, Tests

Peer reviewed
Direct link
