ERIC - Search Results

Publication Date

In 2024	0
Since 2023	1
Since 2020 (last 5 years)	1
Since 2015 (last 10 years)	8
Since 2005 (last 20 years)	13

Descriptor

Computation	13
Foreign Countries	10
Item Response Theory	7
Test Items	5
Achievement Tests	4
Grade 7	4
Longitudinal Studies	4
Models	4
Secondary School Students	4
Statistical Analysis	4
Test Bias	4
Comparative Analysis	3
Correlation	3
Error of Measurement	3
Grade 8	3
Grade 9	3
Hierarchical Linear Modeling	3
Ability	2
Equated Scores	2
Goodness of Fit	2
Grade 6	2
International Assessment	2
Mathematics Achievement	2
Measurement	2
Measurement Techniques	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	13
Reports - Research	13

Education Level

Secondary Education	13
Junior High Schools	9
Middle Schools	9
Elementary Education	6
Grade 7	4
Grade 8	3
Grade 9	3
High Schools	3
Elementary Secondary Education	2
Grade 6	2
Grade 10	1
Grade 11	1
Grade 3	1
Grade 4	1
Grade 5	1
More ▼

Audience

Location

Germany	2
Australia	1
China	1
Hong Kong	1
Ireland	1
Netherlands	1
New Zealand	1
South Korea	1
Taiwan	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
Trends in International…	2
Graduate Record Examinations	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Generalized Mantel-Haenszel Estimators for Simultaneous Differential Item Functioning Tests

Peer reviewed

Direct link

Liu, Ivy; Suesse, Thomas; Harvey, Samuel; Gu, Peter Yongqi; Fernández, Daniel; Randal, John – Educational and Psychological Measurement, 2023

The Mantel-Haenszel estimator is one of the most popular techniques for measuring differential item functioning (DIF). A generalization of this estimator is applied to the context of DIF to compare items by taking the covariance of odds ratio estimators between dependent items into account. Unlike the Item Response Theory, the method does not rely…

Descriptors: Test Bias, Computation, Statistical Analysis, Achievement Tests

When Nonresponse Mechanisms Change: Effects on Trends and Group Comparisons in International Large-Scale Assessments

Peer reviewed

Direct link

Sachse, Karoline A.; Mahler, Nicole; Pohl, Steffi – Educational and Psychological Measurement, 2019

Mechanisms causing item nonresponses in large-scale assessments are often said to be nonignorable. Parameter estimates can be biased if nonignorable missing data mechanisms are not adequately modeled. In trend analyses, it is plausible for the missing data mechanism and the percentage of missing values to change over time. In this article, we…

Descriptors: International Assessment, Response Style (Tests), Achievement Tests, Foreign Countries

Exploring the Test of Covariate Moderation Effects in Multilevel MIMIC Models

Peer reviewed

Direct link

Cao, Chunhua; Kim, Eun Sook; Chen, Yi-Hsin; Ferron, John; Stark, Stephen – Educational and Psychological Measurement, 2019

In multilevel multiple-indicator multiple-cause (MIMIC) models, covariates can interact at the within level, at the between level, or across levels. This study examines the performance of multilevel MIMIC models in estimating and detecting the interaction effect of two covariates through a simulation and provides an empirical demonstration of…

Descriptors: Hierarchical Linear Modeling, Structural Equation Models, Computation, Identification

The Total Score with Maximal Reliability and Maximal Criterion Validity: An Illustration Using a Career Satisfaction Measure

Peer reviewed

Direct link

Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2018

The maximal reliability of a congeneric measure is achieved by weighting item scores to form the optimal linear combination as the total score; it is never lower than the composite reliability of the measure when measurement errors are uncorrelated. The statistical method that renders maximal reliability would also lead to maximal criterion…

Descriptors: Test Reliability, Test Validity, Comparative Analysis, Attitude Measures

Effects of Design Properties on Parameter Estimation in Large-Scale Assessments

Peer reviewed

Direct link

Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015

The selection of an appropriate booklet design is an important element of large-scale assessments of student achievement. Two design properties that are typically optimized are the "balance" with respect to the positions the items are presented and with respect to the mutual occurrence of pairs of items in the same booklet. The purpose…

Descriptors: Measurement, Computation, Test Format, Test Items

Item Response Theory Models for Wording Effects in Mixed-Format Scales

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015

Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

Descriptors: Item Response Theory, Test Format, Language Usage, Test Items

Taking the Missing Propensity into Account When Estimating Competence Scores: Evaluation of Item Response Theory Models for Nonignorable Omissions

Peer reviewed

Direct link

Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015

When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…

Descriptors: Competence, Tests, Evaluation Methods, Adults

An Alternative Way to Model Population Ability Distributions in Large-Scale Educational Surveys

Peer reviewed

Direct link

Wetzel, Eunike; Xu, Xueli; von Davier, Matthias – Educational and Psychological Measurement, 2015

In large-scale educational surveys, a latent regression model is used to compensate for the shortage of cognitive information. Conventionally, the covariates in the latent regression model are principal components extracted from background data. This operational method has several important disadvantages, such as the handling of missing data and…

Descriptors: Surveys, Regression (Statistics), Models, Research Methodology

The Effect of Observation Length and Presentation Order on the Reliability and Validity of an Observational Measure of Teaching Quality

Peer reviewed

Direct link

Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C. – Educational and Psychological Measurement, 2014

Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…

Descriptors: Observation, Teacher Evaluation, Reliability, Validity

Effects of Item Parameter Drift on Vertical Scaling with the Nonequivalent Groups with Anchor Test (NEAT) Design

Peer reviewed

Direct link

Ye, Meng; Xin, Tao – Educational and Psychological Measurement, 2014

The authors explored the effects of drifting common items on vertical scaling within the higher order framework of item parameter drift (IPD). The results showed that if IPD occurred between a pair of test levels, the scaling performance started to deviate from the ideal state, as indicated by bias of scaling. When there were two items drifting…

Descriptors: Scaling, Test Items, Equated Scores, Achievement Gains

A Comparison of Three IRT Approaches to Examinee Ability Change Modeling in a Single-Group Anchor Test Design

Peer reviewed

Direct link

Paek, Insu; Park, Hyun-Jeong; Cai, Li; Chi, Eunlim – Educational and Psychological Measurement, 2014

Typically a longitudinal growth modeling based on item response theory (IRT) requires repeated measures data from a single group with the same test design. If operational or item exposure problems are present, the same test may not be employed to collect data for longitudinal analyses and tests at multiple time points are constructed with unique…

Descriptors: Item Response Theory, Comparative Analysis, Test Items, Equated Scores

In Search of Value Added in the Case of Complex School Effects

Peer reviewed

Direct link

Timmermans, Anneke C.; Snijders, Tom A. B.; Bosker, Roel J. – Educational and Psychological Measurement, 2013

In traditional studies on value-added indicators of educational effectiveness, students are usually treated as belonging to those schools where they took their final examination. However, in practice, students sometimes attend multiple schools and therefore it is questionable whether this assumption of belonging to the last school they attended…

Descriptors: School Effectiveness, Student Mobility, Elementary Schools, Secondary Schools

Higher Order Testlet Response Models for Hierarchical Latent Traits and Testlet-Based Items

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2013

Both testlet design and hierarchical latent traits are fairly common in educational and psychological measurements. This study aimed to develop a new class of higher order testlet response models that consider both local item dependence within testlets and a hierarchy of latent traits. Due to high dimensionality, the authors adopted the Bayesian…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

Privacy | Copyright | Contact Us | Selection Policy | API

Pohl, Steffi	2
Wang, Wen-Chung	2
Allen, Joseph P.	1
Bosker, Roel J.	1
Cai, Li	1
Cao, Chunhua	1
Carstensen, Claus H.	1
Chen, Hui-Fang	1
Chen, Yi-Hsin	1
Chi, Eunlim	1
Fernández, Daniel	1
Ferron, John	1
Frey, Andreas	1
Fu, Yuanshu	1
Gu, Peter Yongqi	1
Harvey, Samuel	1
Hecht, Martin	1
Huang, Hung-Yu	1
Jin, Kuan-Yu	1
Kim, Eun Sook	1
Köhler, Carmen	1
Liu, Ivy	1
Mahler, Nicole	1
Mashburn, Andrew J.	1
Meyer, J. Patrick	1
More ▼