ERIC - Search Results

Publication Date

In 2024	1
Since 2023	8
Since 2020 (last 5 years)	20
Since 2015 (last 10 years)	20
Since 2005 (last 20 years)	20

Descriptor

Foreign Countries	20
Achievement Tests	6
Comparative Analysis	6
International Assessment	6
Statistical Analysis	6
College Students	5
Item Response Theory	5
Test Items	5
Accuracy	4
Correlation	4
Item Analysis	4
Models	4
Secondary School Students	4
Artificial Intelligence	3
Bayesian Statistics	3
Classification	3
Gender Differences	3
Mathematics Tests	3
Multiple Choice Tests	3
Response Style (Tests)	3
Responses	3
Scores	3
Test Reliability	3
Test Validity	3
Difficulty Level	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	20
Reports - Research	19
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education	8
Postsecondary Education	8
Secondary Education	7
Elementary Education	2
Elementary Secondary Education	2
High Schools	2
Early Childhood Education	1
Grade 10	1
Grade 8	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Germany	6
China	3
Spain	2
Australia	1
Canada	1
Chile	1
Mexico (Mexico City)	1
Netherlands	1
New Zealand	1
United Arab Emirates	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	4
Trends in International…	2

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Identifying Disengaged Responding in Multiple-Choice Items: Extending a Latent Class Item Response Model with Novel Process Data Indicators

Peer reviewed

Direct link

Jana Welling; Timo Gnambs; Claus H. Carstensen – Educational and Psychological Measurement, 2024

Disengaged responding poses a severe threat to the validity of educational large-scale assessments, because item responses from unmotivated test-takers do not reflect their actual ability. Existing identification approaches rely primarily on item response times, which bears the risk of misclassifying fast engaged or slow disengaged responses.…

Descriptors: Foreign Countries, College Students, Guessing (Tests), Multiple Choice Tests

Fixed Effects or Mixed Effects Classifiers? Evidence from Simulated and Archival Data

Peer reviewed

Direct link

Mangino, Anthony A.; Bolin, Jocelyn H.; Finch, W. Holmes – Educational and Psychological Measurement, 2023

This study seeks to compare fixed and mixed effects models for the purposes of predictive classification in the presence of multilevel data. The first part of the study utilizes a Monte Carlo simulation to compare fixed and mixed effects logistic regression and random forests. An applied examination of the prediction of student retention in the…

Descriptors: Prediction, Classification, Monte Carlo Methods, Foreign Countries

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

Position of Correct Option and Distractors Impacts Responses to Multiple-Choice Items: Evidence from a National Test

Peer reviewed

Direct link

Lions, Séverin; Dartnell, Pablo; Toledo, Gabriela; Godoy, María Inés; Córdova, Nora; Jiménez, Daniela; Lemarié, Julie – Educational and Psychological Measurement, 2023

Even though the impact of the position of response options on answers to multiple-choice items has been investigated for decades, it remains debated. Research on this topic is inconclusive, perhaps because too few studies have obtained experimental data from large-sized samples in a real-world context and have manipulated the position of both…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Responses

Generalized Mantel-Haenszel Estimators for Simultaneous Differential Item Functioning Tests

Peer reviewed

Direct link

Liu, Ivy; Suesse, Thomas; Harvey, Samuel; Gu, Peter Yongqi; Fernández, Daniel; Randal, John – Educational and Psychological Measurement, 2023

The Mantel-Haenszel estimator is one of the most popular techniques for measuring differential item functioning (DIF). A generalization of this estimator is applied to the context of DIF to compare items by taking the covariance of odds ratio estimators between dependent items into account. Unlike the Item Response Theory, the method does not rely…

Descriptors: Test Bias, Computation, Statistical Analysis, Achievement Tests

Performance of Coefficient Alpha and Its Alternatives: Effects of Different Types of Non-Normality

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Educational and Psychological Measurement, 2023

We examined the performance of coefficient alpha and its potential competitors (ordinal alpha, omega total, Revelle's omega total [omega RT], omega hierarchical [omega h], greatest lower bound [GLB], and coefficient "H") with continuous and discrete data having different types of non-normality. Results showed the estimation bias was…

Descriptors: Statistical Bias, Statistical Analysis, Likert Scales, Statistical Distributions

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Changes in the Speed-Ability Relation through Different Treatments of Rapid Guessing

Peer reviewed

Direct link

Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023

As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…

Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias

Poisson Diagnostic Classification Models: A Framework and an Exploratory Example

Peer reviewed

Direct link

Liu, Ren; Liu, Haiyan; Shi, Dexin; Jiang, Zhehan – Educational and Psychological Measurement, 2022

Assessments with a large amount of small, similar, or often repetitive tasks are being used in educational, neurocognitive, and psychological contexts. For example, respondents are asked to recognize numbers or letters from a large pool of those and the number of correct answers is a count variable. In 1960, George Rasch developed the Rasch…

Descriptors: Classification, Models, Statistical Distributions, Scores

Matched and Fully Private? A New Self-Generated Identification Code for School-Based Cohort Studies to Increase Perceived Anonymity

Peer reviewed

Direct link

Calatrava, Maria; de Irala, Jokin; Osorio, Alfonso; Benítez, Edgar; Lopez-del Burgo, Cristina – Educational and Psychological Measurement, 2022

Anonymous questionnaires are frequently used in research with adolescents in order to obtain sincere answers about sensitive topics. Most longitudinal studies include self-generated identification codes (SGICs) to match information. Typical elements include a combination of letters and digits from personal data. However, these data may make the…

Descriptors: Privacy, Questionnaires, Coding, Adolescents

Detecting Differential Rater Functioning in Severity and Centrality: The Dual DRF Facets Model

Peer reviewed

Direct link

Jin, Kuan-Yu; Eckes, Thomas – Educational and Psychological Measurement, 2022

Performance assessments heavily rely on human ratings. These ratings are typically subject to various forms of error and bias, threatening the assessment outcomes' validity and fairness. Differential rater functioning (DRF) is a special kind of threat to fairness manifesting itself in unwanted interactions between raters and performance- or…

Descriptors: Performance Based Assessment, Rating Scales, Test Bias, Student Evaluation

On the Relationship between Item Stem Formulation and Criterion Validity of Multiple-Component Measuring Instruments

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2022

The possible dependency of criterion validity on item formulation in a multicomponent measuring instrument is examined. The discussion is concerned with evaluation of the differences in criterion validity between two or more groups (populations/subpopulations) that have been administered instruments with items having differently formulated item…

Descriptors: Test Items, Measures (Individuals), Test Validity, Difficulty Level

Detecting Careless Responding in Survey Data Using Stochastic Gradient Boosting

Peer reviewed

Direct link

Schroeders, Ulrich; Schmidt, Christoph; Gnambs, Timo – Educational and Psychological Measurement, 2022

Careless responding is a bias in survey responses that disregards the actual item content, constituting a threat to the factor structure, reliability, and validity of psychological measurements. Different approaches have been proposed to detect aberrant responses such as probing questions that directly assess test-taking behavior (e.g., bogus…

Descriptors: Response Style (Tests), Surveys, Artificial Intelligence, Identification

Effects of Response Option Order on Likert-Type Psychometric Properties and Reactions

Peer reviewed

Direct link

Robie, Chet; Meade, Adam W.; Risavy, Stephen D.; Rasheed, Sabah – Educational and Psychological Measurement, 2022

The effects of different response option orders on survey responses have been studied extensively. The typical research design involves examining the differences in response characteristics between conditions with the same item stems and response option orders that differ in valence--either incrementally arranged (e.g., strongly disagree to…

Descriptors: Likert Scales, Psychometrics, Surveys, Responses

A Mixture IRTree Model for Extreme Response Style: Accounting for Response Process Uncertainty

Peer reviewed

Direct link

Kim, Nana; Bolt, Daniel M. – Educational and Psychological Measurement, 2021

This paper presents a mixture item response tree (IRTree) model for extreme response style. Unlike traditional applications of single IRTree models, a mixture approach provides a way of representing the mixture of respondents following different underlying response processes (between individuals), as well as the uncertainty present at the…

Descriptors: Item Response Theory, Response Style (Tests), Models, Test Items

Previous Page | Next Page »

Pages: 1 | 2

Privacy | Copyright | Contact Us | Selection Policy | API

Shi, Dexin	3
Jiang, Zhehan	2
Benítez, Edgar	1
Bolin, Jocelyn H.	1
Bolt, Daniel M.	1
Calatrava, Maria	1
Claus H. Carstensen	1
Córdova, Nora	1
Dartnell, Pablo	1
Deribo, Tobias	1
Distefano, Christine	1
Eckes, Thomas	1
Fernández, Daniel	1
Finch, W. Holmes	1
Flore, Paulette C.	1
Forstner, Thomas	1
Franco-Martínez, Alicia	1
Gnambs, Timo	1
Godoy, María Inés	1
Goldhammer, Frank	1
Gu, Peter Yongqi	1
Harvey, Samuel	1
Hau, Kit-Tai	1
Jana Welling	1
Jiménez, Daniela	1
More ▼