ERIC - Search Results

Publication Date

In 2024	6
Since 2023	33
Since 2020 (last 5 years)	103
Since 2015 (last 10 years)	239
Since 2005 (last 20 years)	453

Descriptor

Test Items	346
Item Response Theory	257
Scores	199
Test Construction	197
Models	177
Comparative Analysis	168
Test Reliability	164
Test Validity	161
Simulation	151
Higher Education	136
Statistical Analysis	133
Item Analysis	132
Test Bias	121
Multiple Choice Tests	119
Evaluation Methods	113
Error of Measurement	111
Mathematical Models	111
Achievement Tests	109
Equated Scores	107
Computer Assisted Testing	106
Scoring	105
Measurement Techniques	98
Testing Problems	97
Correlation	96
College Entrance Examinations	89
More ▼

Source

Journal of Educational…

1350

Publication Type

Journal Articles	1072
Reports - Research	641
Reports - Evaluative	284
Reports - Descriptive	83
Speeches/Meeting Papers	38
Opinion Papers	34
Information Analyses	26
Book/Product Reviews	23
Guides - Non-Classroom	6
Tests/Questionnaires	3
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Secondary Education	24
Higher Education	20
Postsecondary Education	17
Elementary Secondary Education	11
High Schools	9
Elementary Education	7
Middle Schools	7
Grade 4	3
Grade 8	3
Grade 7	2
Intermediate Grades	2
Junior High Schools	2
Early Childhood Education	1
Grade 1	1
Grade 10	1
Grade 2	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 9	1
Primary Education	1
More ▼

Audience

Researchers	21
Practitioners	4
Teachers	1

Location

Israel	7
Netherlands	6
United States	5
Canada	4
United Kingdom	3
United Kingdom (England)	3
Australia	2
Belgium	2
China	2
Georgia	2
Hong Kong	2
Ireland	2
Africa	1
California	1
Colombia	1
France	1
Germany	1
Jordan	1
New Jersey	1
New Zealand	1
Rhode Island	1
South Carolina	1
Spain	1
Turkey	1
United Kingdom (Scotland)	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
No Child Left Behind Act 2001	2
Defunis v Odegaard	1
Race to the Top	1

What Works Clearinghouse Rating

Journal of Educational Measurement X

Showing 106 to 120 of 1,350 results Save | Export

Pedagogical Considerations for Examining Rater Variability in Rater-Mediated Assessments: A Three-Model Framework

Peer reviewed

Direct link

Wesolowski, Brian C.; Wind, Stefanie A. – Journal of Educational Measurement, 2019

Rater-mediated assessments are a common methodology for measuring persons, investigating rater behavior, and/or defining latent constructs. The purpose of this article is to provide a pedagogical framework for examining rater variability in the context of rater-mediated assessments using three distinct models. The first model is the observation…

Descriptors: Interrater Reliability, Models, Observation, Measurement

Accounting for Rater Effects with the Hierarchical Rater Model Framework When Scoring Simple Structured Constructed Response Tests

Peer reviewed

Direct link

Nieto, Ricardo; Casabianca, Jodi M. – Journal of Educational Measurement, 2019

Many large-scale assessments are designed to yield two or more scores for an individual by administering multiple sections measuring different but related skills. Multidimensional tests, or more specifically, simple structured tests, such as these rely on multiple multiple-choice and/or constructed responses sections of items to generate multiple…

Descriptors: Tests, Scoring, Responses, Test Items

Scoring Stability in a Large-Scale Assessment Program: A Longitudinal Analysis of Leniency/Severity Effects

Peer reviewed

Direct link

Palermo, Corey; Bunch, Michael B.; Ridge, Kirk – Journal of Educational Measurement, 2019

Although much attention has been given to rater effects in rater-mediated assessment contexts, little research has examined the overall stability of leniency and severity effects over time. This study examined longitudinal scoring data collected during three consecutive administrations of a large-scale, multi-state summative assessment program.…

Descriptors: Scoring, Interrater Reliability, Measurement, Summative Evaluation

Modeling Rater Response Processes in Evaluating Score Meaning

Peer reviewed

Direct link

Lane, Suzanne – Journal of Educational Measurement, 2019

Rater-mediated assessments require the evaluation of the accuracy and consistency of the inferences made by the raters to ensure the validity of score interpretations and uses. Modeling rater response processes allows for a better understanding of how raters map their representations of the examinee performance to their representation of the…

Descriptors: Responses, Accuracy, Validity, Interrater Reliability

Conceptualizing Rater Judgments and Rating Processes for Rater-Mediated Assessments

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Journal of Educational Measurement, 2019

Rater-mediated assessments exhibit scoring challenges due to the involvement of human raters. The quality of human ratings largely determines the reliability, validity, and fairness of the assessment process. Our research recommends that the evaluation of ratings should be based on two aspects: a theoretical model of human judgment and an…

Descriptors: Evaluative Thinking, Models, Measurement, Achievement

Predicting Operational Rater-Type Classifications Using Rasch Measurement Theory and Random Forests: A Music Performance Assessment Perspective

Peer reviewed

Direct link

Wesolowski, Brian C. – Journal of Educational Measurement, 2019

The purpose of this study was to build a Random Forest supervised machine learning model in order to predict musical rater-type classifications based upon a Rasch analysis of raters' differential severity/leniency related to item use. Raw scores (N = 1,704) from 142 raters across nine high school solo and ensemble festivals (grades 9-12) were…

Descriptors: Item Response Theory, Prediction, Classification, Artificial Intelligence

Examining the Dual Purpose Use of Student Learning Objectives for Classroom Assessment and Teacher Evaluation

Peer reviewed

Direct link

Briggs, Derek C.; Chattergoon, Rajendra; Burkhardt, Amy – Journal of Educational Measurement, 2019

The process of setting and evaluating student learning objectives (SLOs) has become increasingly popular as an example where classroom assessment is intended to fulfill the dual purpose use of informing instruction and holding teachers accountable. A concern is that the high-stakes purpose may lead to distortions in the inferences about students…

Descriptors: Student Educational Objectives, Student Evaluation, Teacher Evaluation, Scores

Can We Learn from Student Mistakes in a Formative, Reading Comprehension Assessment?

Peer reviewed

Direct link

Liu, Bowen; Kennedy, Patrick C.; Seipel, Ben; Carlson, Sarah E.; Biancarosa, Gina; Davison, Mark L. – Journal of Educational Measurement, 2019

This article describes an ongoing project to develop a formative, inferential reading comprehension assessment of causal story comprehension. It has three features to enhance classroom use: equated scale scores for progress monitoring within and across grades, a scale score to distinguish among low-scoring students based on patterns of mistakes,…

Descriptors: Formative Evaluation, Reading Comprehension, Story Reading, Test Construction

Exploring How to Model Formative Assessment Trajectories of Posing-Pausing-Probing Practices: Toward a Teacher Learning Progressions Framework for the Study of Novice Teachers

Peer reviewed

Direct link

Duckor, Brent; Holmberg, Carrie – Journal of Educational Measurement, 2019

A robust body of evidence supports the finding that particular teaching and assessment strategies in the K-12 classroom can improve student achievement. While experts have identified many effective teaching and learning practices in the assessment for learning literature, teachers' knowledge and use of "high leverage" formative…

Descriptors: Formative Evaluation, Beginning Teachers, Science Teachers, Preservice Teachers

Classroom Assessment and Large-Scale Psychometrics: Shall the Twain Meet? (A Conversation with Margaret Heritage and Neal Kingston)

Peer reviewed

Direct link

Heritage, Margaret; Kingston, Neal M. – Journal of Educational Measurement, 2019

Classroom assessment and large-scale assessment have, for the most part, existed in mutual isolation. Some experts have felt this is for the best and others have been concerned that the schism limits the potential contribution of both forms of assessment. Margaret Heritage has long been a champion of best practices in classroom assessment. Neal…

Descriptors: Measurement, Psychometrics, Context Effect, Classroom Environment

Examining Psychometric Properties and Level Classification of the van Hiele Geometry Test Using CTT and CDM Frameworks

Peer reviewed

Direct link

Chen, Yi-Hsin; Senk, Sharon L.; Thompson, Denisse R.; Voogt, Kevin – Journal of Educational Measurement, 2019

The van Hiele theory and van Hiele Geometry Test have been extensively used in mathematics assessments across countries. The purpose of this study is to use classical test theory (CTT) and cognitive diagnostic modeling (CDM) frameworks to examine psychometric properties of the van Hiele Geometry Test and to compare how various classification…

Descriptors: Geometry, Mathematics Tests, Test Theory, Psychometrics

A General Framework for the Validation of Embedded Formative Assessment

Peer reviewed

Direct link

Hopster-den Otter, Dorien; Wools, Saskia; Eggen, Theo J. H. M.; Veldkamp, Bernard P. – Journal of Educational Measurement, 2019

In educational practice, test results are used for several purposes. However, validity research is especially focused on the validity of summative assessment. This article aimed to provide a general framework for validating formative assessment. The authors applied the argument-based approach to validation to the context of formative assessment.…

Descriptors: Formative Evaluation, Test Validity, Scores, Inferences

Assessing and Validating Effects of a Data-Based Decision-Making Intervention on Student Growth for Mathematics and Spelling

Peer reviewed

Direct link

Keuning, Trynke; van Geel, Marieke; Visscher, Adrie; Fox, Jean-Paul – Journal of Educational Measurement, 2019

Data-based decision making (DBDM) is presumed to improve student performance in elementary schools in all subjects. The majority of studies in which DBDM effects have been evaluated have focused on mathematics. A hierarchical multiple single-subject design was used to measure effects of a 2-year training, in which entire school teams learned how…

Descriptors: Data, Decision Making, Elementary School Students, Mathematics Instruction

Students' Interpretation of Formative Assessment Feedback: Three Claims for Why We Know so Little about Something so Important

Peer reviewed

Direct link

Leighton, Jacqueline P. – Journal of Educational Measurement, 2019

If K-12 students are to be fully integrated as active participants in their own learning, understanding how they interpret formative assessment feedback is needed. The objective of this article is to advance three claims about why teachers and assessment scholars/specialists may have little understanding of students' interpretation of formative…

Descriptors: Elementary Secondary Education, Formative Evaluation, Feedback (Response), Student Attitudes

The Effects of Incomplete Rating Designs in Combination with Rater Effects

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Journal of Educational Measurement, 2019

Researchers have explored a variety of topics related to identifying and distinguishing among specific types of rater effects, as well as the implications of different types of incomplete data collection designs for rater-mediated assessments. In this study, we used simulated data to examine the sensitivity of latent trait model indicators of…

Descriptors: Rating Scales, Models, Evaluators, Data Collection

« Previous Page | Next Page »

Pages: 1 | ... | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | ... | 90

Privacy | Copyright | Contact Us | Selection Policy | API

Sinharay, Sandip	20
van der Linden, Wim J.	19
Clauser, Brian E.	16
Dorans, Neil J.	16
Kolen, Michael J.	16
Lee, Won-Chan	16
Linn, Robert L.	16
Wainer, Howard	16
Bridgeman, Brent	12
Livingston, Samuel A.	12
Hambleton, Ronald K.	11
Wang, Wen-Chung	11
Holland, Paul W.	10
von Davier, Alina A.	10
Bennett, Randy Elliot	9
Chang, Hua-Hua	9
Kane, Michael T.	9
Lewis, Charles	9
Moses, Tim	9
Plake, Barbara S.	9
Puhan, Gautam	9
Subkoviak, Michael J.	9
Zwick, Rebecca	9
Brennan, Robert L.	8
More ▼

SAT (College Admission Test)	49
National Assessment of…	27
Graduate Record Examinations	18
Program for International…	14
Iowa Tests of Basic Skills	13
Metropolitan Achievement Tests	9
Advanced Placement…	8
Law School Admission Test	7
California Achievement Tests	6
Comprehensive Tests of Basic…	6
Wechsler Intelligence Scale…	5
General Educational…	4
National Teacher Examinations	4
Stanford Achievement Tests	4
Differential Aptitude Test	3
Peabody Picture Vocabulary…	3
Stanford Binet Intelligence…	3
Test of English as a Foreign…	3
Trends in International…	3
ACT Assessment	2
Indiana Statewide Testing for…	2
Iowa Tests of Educational…	2
Kaufman Assessment Battery…	2
Metropolitan Readiness Tests	2
National Longitudinal Study…	2
More ▼