ERIC - Search Results

Publication Date

In 2024	0
Since 2023	3
Since 2020 (last 5 years)	5
Since 2015 (last 10 years)	11
Since 2005 (last 20 years)	15

Descriptor

Robustness (Statistics)	15
Item Response Theory	6
Statistical Analysis	6
Computation	5
Error of Measurement	5
Simulation	5
Comparative Analysis	4
Correlation	4
Sample Size	4
Evaluation Methods	3
Maximum Likelihood Statistics	3
Test Bias	3
Classification	2
Data	2
Educational Assessment	2
Hypothesis Testing	2
Item Analysis	2
Multivariate Analysis	2
Prediction	2
Regression (Statistics)	2
Scores	2
Statistical Bias	2
Statistical Distributions	2
Test Items	2
Test Reliability	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	15
Reports - Research	14
Reports - Evaluative	1

Education Level

Higher Education	2
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Relative Robustness of CDMs and (M)IRT in Measuring Growth in Latent Skills

Peer reviewed

Direct link

Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023

Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…

Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

A Small Sample Correction for Factor Score Regression

Peer reviewed

Direct link

Bogaert, Jasper; Loh, Wen Wei; Rosseel, Yves – Educational and Psychological Measurement, 2023

Factor score regression (FSR) is widely used as a convenient alternative to traditional structural equation modeling (SEM) for assessing structural relations between latent variables. But when latent variables are simply replaced by factor scores, biases in the structural parameter estimates often have to be corrected, due to the measurement error…

Descriptors: Factor Analysis, Regression (Statistics), Structural Equation Models, Error of Measurement

Robustness of Latent Profile Analysis to Measurement Noninvariance between Profiles

Peer reviewed

Direct link

Wang, Yan; Kim, Eunsook; Yi, Zhiyao – Educational and Psychological Measurement, 2022

Latent profile analysis (LPA) identifies heterogeneous subgroups based on continuous indicators that represent different dimensions. It is a common practice to measure each dimension using items, create composite or factor scores for each dimension, and use these scores as indicators of profiles in LPA. In this case, measurement models for…

Descriptors: Robustness (Statistics), Profiles, Statistical Analysis, Classification

Examining the Robustness of the Graded Response and 2-Parameter Logistic Models to Violations of Construct Normality

Peer reviewed

Direct link

Manapat, Patrick D.; Edwards, Michael C. – Educational and Psychological Measurement, 2022

When fitting unidimensional item response theory (IRT) models, the population distribution of the latent trait ([theta]) is often assumed to be normally distributed. However, some psychological theories would suggest a nonnormal [theta]. For example, some clinical traits (e.g., alcoholism, depression) are believed to follow a positively skewed…

Descriptors: Robustness (Statistics), Computational Linguistics, Item Response Theory, Psychological Patterns

Comparing the Robustness of Stepwise Mixture Modeling with Continuous Nonnormal Distal Outcomes

Peer reviewed

Direct link

Shin, Myungho; No, Unkyung; Hong, Sehee – Educational and Psychological Measurement, 2019

The present study aims to compare the robustness under various conditions of latent class analysis mixture modeling approaches that deal with auxiliary distal outcomes. Monte Carlo simulations were employed to test the performance of four approaches recommended by previous simulation studies: maximum likelihood (ML) assuming homoskedasticity…

Descriptors: Robustness (Statistics), Multivariate Analysis, Maximum Likelihood Statistics, Statistical Distributions

Investigating Measurement Invariance by Means of Parameter Instability Tests for 2PL and 3PL Models

Peer reviewed

Direct link

Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2019

M-fluctuation tests are a recently proposed method for detecting differential item functioning in Rasch models. This article discusses a generalization of this method to two additional item response theory models: the two-parametric logistic model and the three-parametric logistic model with a common guessing parameter. The Type I error rate and…

Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Maximum Likelihood Statistics

Hypothesis Testing, "p" Values, Confidence Intervals, Measures of Effect Size, and Bayesian Methods in Light of Modern Robust Techniques

Peer reviewed

Direct link

Wilcox, Rand R.; Serang, Sarfaraz – Educational and Psychological Measurement, 2017

The article provides perspectives on p values, null hypothesis testing, and alternative techniques in light of modern robust statistical methods. Null hypothesis testing and "p" values can provide useful information provided they are interpreted in a sound manner, which includes taking into account insights and advances that have…

Descriptors: Hypothesis Testing, Bayesian Statistics, Computation, Effect Size

Comparing the Performance of Approaches for Testing the Homogeneity of Variance Assumption in One-Factor ANOVA Models

Peer reviewed

Direct link

Wang, Yan; Rodríguez de Gil, Patricia; Chen, Yi-Hsin; Kromrey, Jeffrey D.; Kim, Eun Sook; Pham, Thanh; Nguyen, Diep; Romano, Jeanine L. – Educational and Psychological Measurement, 2017

Various tests to check the homogeneity of variance assumption have been proposed in the literature, yet there is no consensus as to their robustness when the assumption of normality does not hold. This simulation study evaluated the performance of 14 tests for the homogeneity of variance assumption in one-way ANOVA models in terms of Type I error…

Descriptors: Comparative Analysis, Statistical Analysis, Robustness (Statistics), Observation

Robust Coefficients Alpha and Omega and Confidence Intervals with Outlying Observations and Missing Data: Methods and Software

Peer reviewed

Direct link

Zhang, Zhiyong; Yuan, Ke-Hai – Educational and Psychological Measurement, 2016

Cronbach's coefficient alpha is a widely used reliability measure in social, behavioral, and education sciences. It is reported in nearly every study that involves measuring a construct through multiple items. With non-tau-equivalent items, McDonald's omega has been used as a popular alternative to alpha in the literature. Traditional estimation…

Descriptors: Computation, Statistical Analysis, Robustness (Statistics), Error of Measurement

A Cautionary Note on the Use of the Vale and Maurelli Method to Generate Multivariate, Nonnormal Data for Simulation Purposes

Peer reviewed

Direct link

Olvera Astivia, Oscar L.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2015

To further understand the properties of data-generation algorithms for multivariate, nonnormal data, two Monte Carlo simulation studies comparing the Vale and Maurelli method and the Headrick fifth-order polynomial method were implemented. Combinations of skewness and kurtosis found in four published articles were run and attention was…

Descriptors: Data, Simulation, Monte Carlo Methods, Comparative Analysis

Application of the Overclaiming Technique to Scholastic Assessment

Peer reviewed

Direct link

Paulhus, Delroy L.; Dubois, Patrick J. – Educational and Psychological Measurement, 2014

The overclaiming technique is a novel assessment procedure that uses signal detection analysis to generate indices of knowledge accuracy (OC-accuracy) and self-enhancement (OC-bias). The technique has previously shown robustness over varied knowledge domains as well as low reactivity across administration contexts. Here we compared the OC-accuracy…

Descriptors: Educational Assessment, Knowledge Level, Accuracy, Cognitive Ability

Investigating Halo and Ceiling Effects in Student Evaluations of Instruction

Peer reviewed

Direct link

Keeley, Jared W.; English, Taylor; Irons, Jessica; Henslee, Amber M. – Educational and Psychological Measurement, 2013

Many measurement biases affect student evaluations of instruction (SEIs). However, two have been relatively understudied: halo effects and ceiling/floor effects. This study examined these effects in two ways. To examine the halo effect, using a videotaped lecture, we manipulated specific teacher behaviors to be "good" or "bad"…

Descriptors: Robustness (Statistics), Test Bias, Course Evaluation, Student Evaluation of Teacher Performance

A Robust Outlier Approach to Prevent Type I Error Inflation in Differential Item Functioning

Peer reviewed

Direct link

Magis, David; De Boeck, Paul – Educational and Psychological Measurement, 2012

The identification of differential item functioning (DIF) is often performed by means of statistical approaches that consider the raw scores as proxies for the ability trait level. One of the most popular approaches, the Mantel-Haenszel (MH) method, belongs to this category. However, replacing the ability level by the simple raw score is a source…

Descriptors: Test Bias, Data, Error of Measurement, Raw Scores

The Assessment of Reliability Under Range Restriction: A Comparison of [Alpha], [Omega], and Test-Retest Reliability for Dichotomous Data

Peer reviewed

Direct link

Fife, Dustin A.; Mendoza, Jorge L.; Terry, Robert – Educational and Psychological Measurement, 2012

Though much research and attention has been directed at assessing the correlation coefficient under range restriction, the assessment of reliability under range restriction has been largely ignored. This article uses item response theory to simulate dichotomous item-level data to assess the robustness of KR-20 ([alpha]), [omega], and test-retest…

Descriptors: Reliability, Computation, Comparative Analysis, Item Response Theory

Privacy | Copyright | Contact Us | Selection Policy | API

Wang, Yan	2
Bezirhan, Ummugul	1
Bogaert, Jasper	1
Bolt, Daniel M.	1
Chen, Yi-Hsin	1
De Boeck, Paul	1
Debelak, Rudolf	1
Dubois, Patrick J.	1
Edwards, Michael C.	1
English, Taylor	1
Fife, Dustin A.	1
Henslee, Amber M.	1
Hong, Sehee	1
Huang, Qi	1
Irons, Jessica	1
Keeley, Jared W.	1
Kim, Eun Sook	1
Kim, Eunsook	1
Kromrey, Jeffrey D.	1
Loh, Wen Wei	1
Magis, David	1
Manapat, Patrick D.	1
Mendoza, Jorge L.	1
Nguyen, Diep	1
No, Unkyung	1
More ▼