Publication Date
In 2024 | 0 |
Since 2023 | 0 |
Since 2020 (last 5 years) | 2 |
Since 2015 (last 10 years) | 2 |
Since 2005 (last 20 years) | 3 |
Descriptor
Foreign Countries | 3 |
Test Items | 3 |
Generalizability Theory | 2 |
Scoring | 2 |
Achievement Tests | 1 |
Computation | 1 |
Credentials | 1 |
Cutting Scores | 1 |
Difficulty Level | 1 |
Goodness of Fit | 1 |
Group Discussion | 1 |
More ▼ |
Source
Applied Measurement in… | 3 |
Author
Andrich, David | 1 |
Bimpeh, Yaw | 1 |
Chis, Liliana | 1 |
Clauser, Brian E. | 1 |
El Masri, Yasmine H. | 1 |
Harik, Polina | 1 |
Harrison, Liz | 1 |
Margolis, Melissa J. | 1 |
McManus, I. C. | 1 |
Mollon, Jennifer | 1 |
Pointer, William | 1 |
More ▼ |
Publication Type
Journal Articles | 3 |
Reports - Evaluative | 2 |
Reports - Research | 1 |
Education Level
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 1 |
What Works Clearinghouse Rating
Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020
Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…
Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries
El Masri, Yasmine H.; Andrich, David – Applied Measurement in Education, 2020
In large-scale educational assessments, it is generally required that tests are composed of items that function invariantly across the groups to be compared. Despite efforts to ensure invariance in the item construction phase, for a range of reasons (including the security of items) it is often necessary to account for differential item…
Descriptors: Models, Goodness of Fit, Test Validity, Achievement Tests
Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009
Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…
Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring