ERIC - Search Results

Publication Date

In 2024	6
Since 2023	33
Since 2020 (last 5 years)	103
Since 2015 (last 10 years)	239
Since 2005 (last 20 years)	453

Descriptor

Test Items	346
Item Response Theory	257
Scores	199
Test Construction	197
Models	177
Comparative Analysis	168
Test Reliability	164
Test Validity	161
Simulation	151
Higher Education	136
Statistical Analysis	133
Item Analysis	132
Test Bias	121
Multiple Choice Tests	119
Evaluation Methods	113
Error of Measurement	111
Mathematical Models	111
Achievement Tests	109
Equated Scores	107
Computer Assisted Testing	106
Scoring	105
Measurement Techniques	98
Testing Problems	97
Correlation	96
College Entrance Examinations	89
More ▼

Source

Journal of Educational…

1350

Publication Type

Journal Articles	1072
Reports - Research	641
Reports - Evaluative	284
Reports - Descriptive	83
Speeches/Meeting Papers	38
Opinion Papers	34
Information Analyses	26
Book/Product Reviews	23
Guides - Non-Classroom	6
Tests/Questionnaires	3
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Secondary Education	24
Higher Education	20
Postsecondary Education	17
Elementary Secondary Education	11
High Schools	9
Elementary Education	7
Middle Schools	7
Grade 4	3
Grade 8	3
Grade 7	2
Intermediate Grades	2
Junior High Schools	2
Early Childhood Education	1
Grade 1	1
Grade 10	1
Grade 2	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 9	1
Primary Education	1
More ▼

Audience

Researchers	21
Practitioners	4
Teachers	1

Location

Israel	7
Netherlands	6
United States	5
Canada	4
United Kingdom	3
United Kingdom (England)	3
Australia	2
Belgium	2
China	2
Georgia	2
Hong Kong	2
Ireland	2
Africa	1
California	1
Colombia	1
France	1
Germany	1
Jordan	1
New Jersey	1
New Zealand	1
Rhode Island	1
South Carolina	1
Spain	1
Turkey	1
United Kingdom (Scotland)	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
No Child Left Behind Act 2001	2
Defunis v Odegaard	1
Race to the Top	1

What Works Clearinghouse Rating

Journal of Educational Measurement X

Showing 121 to 135 of 1,350 results Save | Export

Exploring the Influence of Judge Proficiency on Standard-Setting Judgments

Peer reviewed

Direct link

Peabody, Michael R.; Wind, Stefanie A. – Journal of Educational Measurement, 2019

Setting performance standards is a judgmental process involving human opinions and values as well as technical and empirical considerations. Although all cut score decisions are by nature somewhat arbitrary, they should not be capricious. Judges selected for standard-setting panels should have the proper qualifications to make the judgments asked…

Descriptors: Standard Setting, Decision Making, Performance Based Assessment, Evaluators

Efficiency of Targeted Multistage Calibration Designs under Practical Constraints: A Simulation Study

Peer reviewed

Direct link

Berger, Stéphanie; Verschoor, Angela J.; Eggen, Theo J. H. M.; Moser, Urs – Journal of Educational Measurement, 2019

Calibration of an item bank for computer adaptive testing requires substantial resources. In this study, we investigated whether the efficiency of calibration under the Rasch model could be enhanced by improving the match between item difficulty and student ability. We introduced targeted multistage calibration designs, a design type that…

Descriptors: Simulation, Computer Assisted Testing, Test Items, Difficulty Level

Bias and Bias Correction Method for Nonproportional Abilities Requirement (NPAR) Tests

Peer reviewed

Direct link

Ip, Edward H.; Strachan, Tyler; Fu, Yanyan; Lay, Alexandra; Willse, John T.; Chen, Shyh-Huei; Rutkowski, Leslie; Ackerman, Terry – Journal of Educational Measurement, 2019

Test items must often be broad in scope to be ecologically valid. It is therefore almost inevitable that secondary dimensions are introduced into a test during test development. A cognitive test may require one or more abilities besides the primary ability to correctly respond to an item, in which case a unidimensional test score overestimates the…

Descriptors: Test Items, Test Bias, Test Construction, Scores

Modeling Response Styles in Cross-Country Self-Reports: An Application of a Multilevel Multidimensional Nominal Response Model

Peer reviewed

Direct link

Ju, Unhee; Falk, Carl F. – Journal of Educational Measurement, 2019

We examined the feasibility and results of a multilevel multidimensional nominal response model (ML-MNRM) for measuring both substantive constructs and extreme response style (ERS) across countries. The ML-MNRM considers within-country clustering while allowing overall item slopes to vary across items and examination of whether certain items were…

Descriptors: Cross Cultural Studies, Self Efficacy, Item Response Theory, Item Analysis

Bayesian Model Selection Methods for Multilevel IRT Models: A Comparison of Five DIC-Based Indices

Peer reviewed

Direct link

Zhang, Xue; Tao, Jian; Wang, Chun; Shi, Ning-Zhong – Journal of Educational Measurement, 2019

Model selection is important in any statistical analysis, and the primary goal is to find the preferred (or most parsimonious) model, based on certain criteria, from a set of candidate models given data. Several recent publications have employed the deviance information criterion (DIC) to do model selection among different forms of multilevel item…

Descriptors: Bayesian Statistics, Item Response Theory, Measurement, Models

A Method for Detecting Regression of Hard and Easy Item Angoff Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2019

One common phenomenon in Angoff standard setting is that panelists regress their ratings in toward the middle of the probability scale. This study describes two indices based on taking ratios of standard deviations that can be utilized with a scatterplot of item ratings versus expected probabilities of success to identify whether ratings are…

Descriptors: Item Analysis, Standard Setting, Probability, Feedback (Response)

An Item-Level Expected Classification Accuracy and Its Applications in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Wang, Wenyi; Song, Lihong; Chen, Ping; Ding, Shuliang – Journal of Educational Measurement, 2019

Most of the existing classification accuracy indices of attribute patterns lose effectiveness when the response data is absent in diagnostic testing. To handle this issue, this article proposes new indices to predict the correct classification rate of a diagnostic test before administering the test under the deterministic noise input…

Descriptors: Cognitive Tests, Classification, Accuracy, Diagnostic Tests

Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019

This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…

Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory

Modeling Partial Knowledge on Multiple-Choice Items Using Elimination Testing

Peer reviewed

Direct link

Wu, Qian; De Laet, Tinne; Janssen, Rianne – Journal of Educational Measurement, 2019

Single-best answers to multiple-choice items are commonly dichotomized into correct and incorrect responses, and modeled using either a dichotomous item response theory (IRT) model or a polytomous one if differences among all response options are to be retained. The current study presents an alternative IRT-based modeling approach to…

Descriptors: Multiple Choice Tests, Item Response Theory, Test Items, Responses

Examining Differential Rater Functioning Using a Between-Subgroup Outfit Approach

Peer reviewed

Direct link

Wind, Stefanie A.; Sebok-Syer, Stefanie S. – Journal of Educational Measurement, 2019

When practitioners use modern measurement models to evaluate rating quality, they commonly examine rater fit statistics that summarize how well each rater's ratings fit the expectations of the measurement model. Essentially, this approach involves examining the unexpected ratings that each misfitting rater assigned (i.e., carrying out analyses of…

Descriptors: Measurement, Models, Evaluators, Simulation

Use of Data Mining Methods to Detect Test Fraud

Peer reviewed

Direct link

Man, Kaiwen; Harring, Jeffrey R.; Sinharay, Sandip – Journal of Educational Measurement, 2019

Data mining methods have drawn considerable attention across diverse scientific fields. However, few applications could be found in the areas of psychological and educational measurement, and particularly pertinent to this article, in test security research. In this study, various data mining methods for detecting cheating behaviors on large-scale…

Descriptors: Information Retrieval, Data Analysis, Identification, Tests

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

Standard Errors of IRT Parameter Scale Transformation Coefficients: Comparison of Bootstrap Method, Delta Method, and Multiple Imputation Method

Peer reviewed

Direct link

Zhang, Zhonghua; Zhao, Mingren – Journal of Educational Measurement, 2019

The present study evaluated the multiple imputation method, a procedure that is similar to the one suggested by Li and Lissitz (2004), and compared the performance of this method with that of the bootstrap method and the delta method in obtaining the standard errors for the estimates of the parameter scale transformation coefficients in item…

Descriptors: Item Response Theory, Error Patterns, Item Analysis, Simulation

Effectiveness of Equating at the Passing Score for Exams with Small Sample Sizes

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Wright, Keith D. – Journal of Educational Measurement, 2019

This article explores the amount of equating error at a passing score when equating scores from exams with small samples sizes. This article focuses on equating using classical test theory methods of Tucker linear, Levine linear, frequency estimation, and chained equipercentile equating. Both simulation and real data studies were used in the…

Descriptors: Error Patterns, Sample Size, Test Theory, Test Bias

Item Response Models for Multiple Attempts with Incomplete Data

Peer reviewed

Direct link

Bergner, Yoav; Choi, Ikkyu; Castellano, Katherine E. – Journal of Educational Measurement, 2019

Allowance for multiple chances to answer constructed response questions is a prevalent feature in computer-based homework and exams. We consider the use of item response theory in the estimation of item characteristics and student ability when multiple attempts are allowed but no explicit penalty is deducted for extra tries. This is common…

Descriptors: Models, Item Response Theory, Homework, Computer Assisted Instruction

« Previous Page | Next Page »

Pages: 1 | ... | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | ... | 90

Privacy | Copyright | Contact Us | Selection Policy | API

Sinharay, Sandip	20
van der Linden, Wim J.	19
Clauser, Brian E.	16
Dorans, Neil J.	16
Kolen, Michael J.	16
Lee, Won-Chan	16
Linn, Robert L.	16
Wainer, Howard	16
Bridgeman, Brent	12
Livingston, Samuel A.	12
Hambleton, Ronald K.	11
Wang, Wen-Chung	11
Holland, Paul W.	10
von Davier, Alina A.	10
Bennett, Randy Elliot	9
Chang, Hua-Hua	9
Kane, Michael T.	9
Lewis, Charles	9
Moses, Tim	9
Plake, Barbara S.	9
Puhan, Gautam	9
Subkoviak, Michael J.	9
Zwick, Rebecca	9
Brennan, Robert L.	8
More ▼

SAT (College Admission Test)	49
National Assessment of…	27
Graduate Record Examinations	18
Program for International…	14
Iowa Tests of Basic Skills	13
Metropolitan Achievement Tests	9
Advanced Placement…	8
Law School Admission Test	7
California Achievement Tests	6
Comprehensive Tests of Basic…	6
Wechsler Intelligence Scale…	5
General Educational…	4
National Teacher Examinations	4
Stanford Achievement Tests	4
Differential Aptitude Test	3
Peabody Picture Vocabulary…	3
Stanford Binet Intelligence…	3
Test of English as a Foreign…	3
Trends in International…	3
ACT Assessment	2
Indiana Statewide Testing for…	2
Iowa Tests of Educational…	2
Kaufman Assessment Battery…	2
Metropolitan Readiness Tests	2
National Longitudinal Study…	2
More ▼