ERIC - Search Results

Publication Date

In 2024	6
Since 2023	33
Since 2020 (last 5 years)	78
Since 2015 (last 10 years)	78
Since 2005 (last 20 years)	78

Descriptor

Test Items	31
Item Response Theory	29
Scores	20
Models	19
Accuracy	14
Computation	14
Computer Assisted Testing	13
Reaction Time	13
Evaluation Methods	12
Monte Carlo Methods	10
Scoring	10
Simulation	10
Adaptive Testing	9
Achievement Tests	8
Foreign Countries	8
Comparative Analysis	7
Statistical Analysis	7
Test Construction	7
Automation	6
Difficulty Level	6
Error of Measurement	6
International Assessment	6
Test Format	6
Validity	6
Alternative Assessment	5
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	78
Reports - Research	51
Reports - Evaluative	15
Reports - Descriptive	12

Education Level

Secondary Education	6
Elementary Education	1
Elementary Secondary Education	1
Grade 7	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Hong Kong

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Measures of Academic Progress	1
National Assessment of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 78 results Save | Export

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

MSAEM Estimation for Confirmatory Multidimensional Four-Parameter Normal Ogive Models

Peer reviewed

Direct link

Jia Liu; Xiangbin Meng; Gongjun Xu; Wei Gao; Ningzhong Shi – Journal of Educational Measurement, 2024

In this paper, we develop a mixed stochastic approximation expectation-maximization (MSAEM) algorithm coupled with a Gibbs sampler to compute the marginalized maximum a posteriori estimate (MMAPE) of a confirmatory multidimensional four-parameter normal ogive (M4PNO) model. The proposed MSAEM algorithm not only has the computational advantages of…

Descriptors: Algorithms, Achievement Tests, Foreign Countries, International Assessment

Sociocognitive Processes and Item Response Models: A Didactic Example

Peer reviewed

Direct link

Tao Gong; Lan Shuai; Robert J. Mislevy – Journal of Educational Measurement, 2024

The usual interpretation of the person and task variables in between-persons measurement models such as item response theory (IRT) is as attributes of persons and tasks, respectively. They can be viewed instead as ensemble descriptors of patterns of interactions among persons and situations that arise from sociocognitive complex adaptive system…

Descriptors: Cognitive Processes, Item Response Theory, Social Cognition, Individualized Instruction

Measuring the Impact of Peer Interaction in Group Oral Assessments with an Extended Many-Facet Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Journal of Educational Measurement, 2024

Many language proficiency tests include group oral assessments involving peer interaction. In such an assessment, examinees discuss a common topic with others. Human raters score each examinee's spoken performance on specially designed criteria. However, measurement models for analyzing group assessment data usually assume local person…

Descriptors: Peer Relationship, Interaction, Oral Language, Student Evaluation

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Computation and Accuracy Evaluation of Comparable Scores on Culturally Responsive Assessments

Peer reviewed

Direct link

Sandip Sinharay; Matthew S. Johnson – Journal of Educational Measurement, 2024

Culturally responsive assessments have been proposed as potential tools to ensure equity and fairness for examinees from all backgrounds including those from traditionally underserved or minoritized groups. However, these assessments are relatively new and, with few exceptions, are yet to be implemented in large scale. Consequently, there is a…

Descriptors: Culturally Relevant Education, Evaluation, Equal Education, Disadvantaged

Using Item Scores and Distractors in Person-Fit Assessment

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023

In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…

Descriptors: Test Items, Scores, Goodness of Fit, Statistics

Cognitive Diagnostic Multistage Testing by Partitioning Hierarchically Structured Attributes

Peer reviewed

Direct link

Kim, Rae Yeong; Yoo, Yun Joo – Journal of Educational Measurement, 2023

In cognitive diagnostic models (CDMs), a set of fine-grained attributes is required to characterize complex problem solving and provide detailed diagnostic information about an examinee. However, it is challenging to ensure reliable estimation and control computational complexity when The test aims to identify the examinee's attribute profile in a…

Descriptors: Models, Diagnostic Tests, Adaptive Testing, Accuracy

Estimating Classification Accuracy and Consistency Indices for Multiple Measures with the Simple Structure MIRT Model

Peer reviewed

Direct link

Park, Seohee; Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2023

Multiple measures, such as multiple content domains or multiple types of performance, are used in various testing programs to classify examinees for screening or selection. Despite the popular usages of multiple measures, there is little research on classification consistency and accuracy of multiple measures. Accordingly, this study introduces an…

Descriptors: Testing, Computation, Classification, Accuracy

A Deterministic Gated Lognormal Response Time Model to Identify Examinees with Item Preknowledge

Peer reviewed

Direct link

Kasli, Murat; Zopluoglu, Cengiz; Toton, Sarah L. – Journal of Educational Measurement, 2023

Response times (RTs) have recently attracted a significant amount of attention in the literature as they may provide meaningful information about item preknowledge. In this study, a new model, the Deterministic Gated Lognormal Response Time (DG-LNRT) model, is proposed to identify examinees with item preknowledge using RTs. The proposed model was…

Descriptors: Reaction Time, Test Items, Models, Familiarity

A Factor Mixture Model for Item Responses and Certainty of Response Indices to Identify Student Knowledge Profiles

Peer reviewed

Direct link

Chen, Chia-Wen; Andersson, Björn; Zhu, Jinxin – Journal of Educational Measurement, 2023

The certainty of response index (CRI) measures respondents' confidence level when answering an item. In conjunction with the answers to the items, previous studies have used descriptive statistics and arbitrary thresholds to identify student knowledge profiles with the CRIs. Whereas this approach overlooked the measurement error of the observed…

Descriptors: Item Response Theory, Factor Analysis, Psychometrics, Test Items

A New Bayesian Person-Fit Analysis Method Using Pivotal Discrepancy Measures

Peer reviewed

Direct link

Combs, Adam – Journal of Educational Measurement, 2023

A common method of checking person-fit in Bayesian item response theory (IRT) is the posterior-predictive (PP) method. In recent years, more powerful approaches have been proposed that are based on resampling methods using the popular L*[subscript z] statistic. There has also been proposed a new Bayesian model checking method based on pivotal…

Descriptors: Bayesian Statistics, Goodness of Fit, Evaluation Methods, Monte Carlo Methods

Several Variations of Simple-Structure MIRT Equating

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2023

The current study proposed several variants of simple-structure multidimensional item response theory equating procedures. Four distinct sets of data were used to demonstrate feasibility of proposed equating methods for two different equating designs: a random groups design and a common-item nonequivalent groups design. Findings indicated some…

Descriptors: Item Response Theory, Equated Scores, Monte Carlo Methods, Research Methodology

Using Simulated Retests to Estimate the Reliability of Diagnostic Assessment Systems

Peer reviewed

Direct link

Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023

As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…

Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy

Pretest Item Calibration in Computerized Multistage Adaptive Testing

Peer reviewed

Direct link

Ersen, Rabia Karatoprak; Lee, Won-Chan – Journal of Educational Measurement, 2023

The purpose of this study was to compare calibration and linking methods for placing pretest item parameter estimates on the item pool scale in a 1-3 computerized multistage adaptive testing design in terms of item parameter recovery. Two models were used: embedded-section, in which pretest items were administered within a separate module, and…

Descriptors: Pretesting, Test Items, Computer Assisted Testing, Adaptive Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Privacy | Copyright | Contact Us | Selection Policy | API

Lee, Won-Chan	5
Choe, Edison M.	3
Clauser, Brian E.	3
DeCarlo, Lawrence T.	3
Lim, Hwanggyu	3
McCaffrey, Daniel F.	3
Wollack, James A.	3
Baldwin, Peter	2
Cheng, Ying	2
Goldhammer, Frank	2
Gorney, Kylie	2
Han, Kyung T.	2
He, Yinhong	2
Jiao, Hong	2
Kim, Kyung Yong	2
Kroehne, Ulf	2
Liu, Jinghua	2
Qiao, Xin	2
Yaneva, Victoria	2
A. Corinne Huggins-Manley	1
Ackerman, Terry A.	1
Almehrizi, Rashid S.	1
Andersson, Björn	1
Becker, Benjamin	1
Becker, Betsy Jane	1
More ▼