ERIC - Search Results

Publication Date

In 2024	3
Since 2023	22
Since 2020 (last 5 years)	62
Since 2015 (last 10 years)	125
Since 2005 (last 20 years)	156

Descriptor

Test Items	156
Item Response Theory	95
Statistical Analysis	47
Models	44
Computation	35
Difficulty Level	33
Test Bias	33
Foreign Countries	31
Comparative Analysis	29
Item Analysis	29
Sample Size	27
Simulation	27
Accuracy	26
Correlation	25
Goodness of Fit	25
Error of Measurement	23
Bayesian Statistics	19
Test Length	19
Computer Assisted Testing	18
Scores	18
Factor Analysis	17
Monte Carlo Methods	17
Response Style (Tests)	17
Classification	16
Scoring	15
More ▼

Source

Educational and Psychological…

156

Publication Type

Journal Articles	156
Reports - Research	145
Reports - Evaluative	6
Reports - Descriptive	5

Education Level

Secondary Education	14
Higher Education	12
Postsecondary Education	12
Elementary Education	10
Junior High Schools	6
Middle Schools	6
Early Childhood Education	4
Elementary Secondary Education	4
Grade 3	4
Primary Education	4
Grade 4	3
Grade 5	3
Grade 8	3
High Schools	3
Intermediate Grades	3
Grade 6	2
Grade 7	2
Grade 2	1
Grade 9	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Location

Germany	7
Canada	6
Saudi Arabia	5
United States	4
Australia	3
Florida	3
Hong Kong	3
Taiwan	3
United Kingdom	3
China	2
Japan	2
Singapore	2
South Korea	2
California	1
Chile	1
Delaware	1
Greece	1
India	1
Kentucky	1
Maryland	1
Netherlands	1
New York	1
Ohio	1
South Carolina	1
Sweden	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Trends in International…	4
Raven Advanced Progressive…	2
Advanced Placement…	1
Beck Depression Inventory	1
Boehm Test of Basic Concepts	1
Childrens Manifest Anxiety…	1
Florida Comprehensive…	1
General Aptitude Test Battery	1
Graduate Record Examinations	1
Law School Admission Test	1
National Assessment of…	1
Raven Progressive Matrices	1
Rosenberg Self Esteem Scale	1
SAT (College Admission Test)	1
United States Medical…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 156 results Save | Export

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Are the Steps on Likert Scales Equidistant? Responses on Visual Analog Scales Allow Estimating Their Distances

Peer reviewed

Direct link

Miguel A. García-Pérez – Educational and Psychological Measurement, 2024

A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…

Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

Relative Robustness of CDMs and (M)IRT in Measuring Growth in Latent Skills

Peer reviewed

Direct link

Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023

Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…

Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)

A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning within a Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023

The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…

Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

The NEAT Equating via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method

Peer reviewed

Direct link

Jiang, Zhehan; Han, Yuting; Xu, Lingling; Shi, Dexin; Liu, Ren; Ouyang, Jinying; Cai, Fen – Educational and Psychological Measurement, 2023

The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven…

Descriptors: Test Items, Equated Scores, Sample Size, Artificial Intelligence

The Impact and Detection of Uniform Differential Item Functioning for Continuous Item Response Models

Peer reviewed

Direct link

Finch, W. Holmes – Educational and Psychological Measurement, 2023

Psychometricians have devoted much research and attention to categorical item responses, leading to the development and widespread use of item response theory for the estimation of model parameters and identification of items that do not perform in the same way for examinees from different population subgroups (e.g., differential item functioning…

Descriptors: Test Bias, Item Response Theory, Computation, Methods

Position of Correct Option and Distractors Impacts Responses to Multiple-Choice Items: Evidence from a National Test

Peer reviewed

Direct link

Lions, Séverin; Dartnell, Pablo; Toledo, Gabriela; Godoy, María Inés; Córdova, Nora; Jiménez, Daniela; Lemarié, Julie – Educational and Psychological Measurement, 2023

Even though the impact of the position of response options on answers to multiple-choice items has been investigated for decades, it remains debated. Research on this topic is inconclusive, perhaps because too few studies have obtained experimental data from large-sized samples in a real-world context and have manipulated the position of both…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Responses

Detecting Preknowledge Cheating via Innovative Measures: A Mixture Hierarchical Model for Jointly Modeling Item Responses, Response Times, and Visual Fixation Counts

Peer reviewed

Direct link

Man, Kaiwen; Harring, Jeffrey R. – Educational and Psychological Measurement, 2023

Preknowledge cheating jeopardizes the validity of inferences based on test results. Many methods have been developed to detect preknowledge cheating by jointly analyzing item responses and response times. Gaze fixations, an essential eye-tracker measure, can be utilized to help detect aberrant testing behavior with improved accuracy beyond using…

Descriptors: Cheating, Reaction Time, Test Items, Responses

A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning

Peer reviewed

Direct link

Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…

Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance

On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments

Peer reviewed

Direct link

Kreitchmann, Rodrigo S.; Sorrel, Miguel A.; Abad, Francisco J. – Educational and Psychological Measurement, 2023

Multidimensional forced-choice (FC) questionnaires have been consistently found to reduce the effects of socially desirable responding and faking in noncognitive assessments. Although FC has been considered problematic for providing ipsative scores under the classical test theory, item response theory (IRT) models enable the estimation of…

Descriptors: Measurement Techniques, Questionnaires, Social Desirability, Adaptive Testing

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Privacy | Copyright | Contact Us | Selection Policy | API

Dimitrov, Dimiter M.	9
Raykov, Tenko	8
Huggins-Manley, Anne Corinne	6
Marcoulides, George A.	6
Paek, Insu	4
Strobl, Carolin	4
Weiss, David J.	4
Zumbo, Bruno D.	4
Cai, Li	3
DeMars, Christine E.	3
Harring, Jeffrey R.	3
He, Wei	3
Huang, Hung-Yu	3
Kam, Chester Chun Seng	3
Liu, Ren	3
Luo, Yong	3
Man, Kaiwen	3
Sideridis, Georgios D.	3
Tsaousis, Ioannis	3
Wang, Chun	3
Wang, Wen-Chung	3
von Davier, Matthias	3
Ahn, Soyeon	2
Andrich, David	2
Bolt, Daniel M.	2
More ▼