ERIC - Search Results

Publication Date

In 2024	0
Since 2023	2
Since 2020 (last 5 years)	11
Since 2015 (last 10 years)	25
Since 2005 (last 20 years)	25

Source

Educational and Psychological…

Publication Type

Journal Articles	25
Reports - Descriptive	25
Opinion Papers	2

Education Level

Audience

Practitioners	1
Students	1
Teachers	1

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

On the Importance of Coefficient Alpha for Measurement Research: Loading Equality Is Not Necessary for Alpha's Utility as a Scale Reliability Index

Peer reviewed

Direct link

Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023

The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…

Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques

An Explanatory Multidimensional Random Item Effects Rating Scale Model

Peer reviewed

Direct link

Huang, Sijia; Luo, Jinwen; Cai, Li – Educational and Psychological Measurement, 2023

Random item effects item response theory (IRT) models, which treat both person and item effects as random, have received much attention for more than a decade. The random item effects approach has several advantages in many practical settings. The present study introduced an explanatory multidimensional random item effects rating scale model. The…

Descriptors: Rating Scales, Item Response Theory, Models, Test Items

Estimating Probabilities of Passing for Examinees with Incomplete Data in Mastery Tests

Peer reviewed

Direct link

Sinharay, Sandip – Educational and Psychological Measurement, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…

Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness

Evaluation of Second- and Third-Level Variance Proportions in Multilevel Designs with Completely Observed Populations: A Note on a Latent Variable Modeling Procedure

Peer reviewed

Direct link

Raykov, Tenko; Menold, Natalja; Leer, Jane – Educational and Psychological Measurement, 2022

Two- and three-level designs in educational and psychological research can involve entire populations of Level-3 and possibly Level-2 units, such as schools and educational districts nested within a given state, or neighborhoods and counties in a state. Such a design is of increasing relevance in empirical research owing to the growing popularity…

Descriptors: Hierarchical Linear Modeling, Computation, Statistical Analysis, Research Design

The Response Vector for Mastery Method of Standard Setting

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2022

Proposed is a new method of standard setting referred to as response vector for mastery (RVM) method. Under the RVM method, the task of panelists that participate in the standard setting process does not involve conceptualization of a borderline examinee and probability judgments as it is the case with the Angoff and bookmark methods. Also, the…

Descriptors: Standard Setting (Scoring), Cutting Scores, Computation, Mastery Learning

On Effect Size Measures for Nested Measurement Models

Peer reviewed

Direct link

Raykov, Tenko; DiStefano, Christine; Calvocoressi, Lisa; Volker, Martin – Educational and Psychological Measurement, 2022

A class of effect size indices are discussed that evaluate the degree to which two nested confirmatory factor analysis models differ from each other in terms of fit to a set of observed variables. These descriptive effect measures can be used to quantify the impact of parameter restrictions imposed in an initially considered model and are free…

Descriptors: Effect Size, Models, Measurement Techniques, Factor Analysis

Design Effect in Multilevel Settings: A Commentary on a Latent Variable Modeling Procedure for Its Evaluation

Peer reviewed

Direct link

Raykov, Tenko; DiStefano, Christine – Educational and Psychological Measurement, 2022

A latent variable modeling-based procedure is discussed that permits to readily point and interval estimate the design effect index in multilevel settings using widely circulated software. The method provides useful information about the relationship of important parameter standard errors when accounting for clustering effects relative to…

Descriptors: Hierarchical Linear Modeling, Correlation, Evaluation, Research Design

Model Selection and Average Proportion Explained Variance in Exploratory Factor Analysis

Peer reviewed

Direct link

Raykov, Tenko; Calvocoressi, Lisa – Educational and Psychological Measurement, 2021

A procedure for evaluating the average R-squared index for a given set of observed variables in an exploratory factor analysis model is discussed. The method can be used as an effective aid in the process of model choice with respect to the number of factors underlying the interrelationships among studied measures. The approach is developed within…

Descriptors: Factor Analysis, Structural Equation Models, Statistical Analysis, Selection

Large-Sample Variance of Fleiss Generalized Kappa

Peer reviewed

Direct link

Gwet, Kilem L. – Educational and Psychological Measurement, 2021

Cohen's kappa coefficient was originally proposed for two raters only, and it later extended to an arbitrarily large number of raters to become what is known as Fleiss' generalized kappa. Fleiss' generalized kappa and its large-sample variance are still widely used by researchers and were implemented in several software packages, including, among…

Descriptors: Sample Size, Statistical Analysis, Interrater Reliability, Computation

Making the A Priori Procedure Work for Differences between Means

Peer reviewed

Direct link

Trafimow, David; Wang, Cong; Wang, Tonghui – Educational and Psychological Measurement, 2020

Previous researchers have proposed the a priori procedure, whereby the researcher specifies, prior to data collection, how closely she wishes the sample means to approach corresponding population means, and the degree of confidence of meeting the specification. However, an important limitation of previous research is that researchers sometimes are…

Descriptors: Sampling, Statistical Analysis, Equations (Mathematics), Differences

A Multiprocess Item Response Model for Not-Reached Items Due to Time Limits and Quitting

Peer reviewed

Direct link

Ulitzsch, Esther; von Davier, Matthias; Pohl, Steffi – Educational and Psychological Measurement, 2020

So far, modeling approaches for not-reached items have considered one single underlying process. However, missing values at the end of a test can occur for a variety of reasons. On the one hand, examinees may not reach the end of a test due to time limits and lack of working speed. On the other hand, examinees may not attempt all items and quit…

Descriptors: Item Response Theory, Test Items, Response Style (Tests), Computer Assisted Testing

Rethinking the Interpretation of Item Discrimination and Factor Loadings

Peer reviewed

Direct link

Jordan, Pascal; Spiess, Martin – Educational and Psychological Measurement, 2019

Factor loadings and item discrimination parameters play a key role in scale construction. A multitude of heuristics regarding their interpretation are hardwired into practice--for example, neglecting low loadings and assigning items to exactly one scale. We challenge the common sense interpretation of these parameters by providing counterexamples…

Descriptors: Test Construction, Test Items, Item Response Theory, Factor Structure

A Measurement Is a Choice and Stevens' Scales of Measurement Do Not Help Make It: A Response to Chalmers

Peer reviewed

Direct link

Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019

Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…

Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models

On the Added Value of Multiple Factor Score Estimates in Essentially Unidimensional Models

Peer reviewed

Direct link

Ferrando, Pere J.; Lorenzo-Seva, Urbano – Educational and Psychological Measurement, 2019

Measures initially designed to be single-trait often yield data that are compatible with both an essentially unidimensional factor-analysis (FA) solution and a correlated-factors solution. For these cases, this article proposes an approach aimed at providing information for deciding which of the two solutions is the most appropriate and useful.…

Descriptors: Factor Analysis, Computation, Reliability, Goodness of Fit

An External Validity Approach for Assessing Essential Unidimensionality in Correlated-Factor Models

Peer reviewed

Direct link

Ferrando, Pere Joan; Lorenzo-Seva, Urbano – Educational and Psychological Measurement, 2019

Many psychometric measures yield data that are compatible with (a) an essentially unidimensional factor analysis solution and (b) a correlated-factor solution. Deciding which of these structures is the most appropriate and useful is of considerable importance, and various procedures have been proposed to help in this decision. The only fully…

Descriptors: Validity, Models, Correlation, Factor Analysis

Previous Page | Next Page »

Pages: 1 | 2

Privacy | Copyright | Contact Us | Selection Policy | API

Statistical Analysis	10
Factor Analysis	9
Item Response Theory	8
Models	8
Computation	7
Correlation	6
Error of Measurement	5
Computer Software	4
Test Items	4
Test Reliability	4
Classification	3
Equations (Mathematics)	3
Evaluation Methods	3
Measurement Techniques	3
Probability	3
Reliability	3
Sample Size	3
Scores	3
Accuracy	2
Bayesian Statistics	2
Computer Assisted Testing	2
Factor Structure	2
Goodness of Fit	2
Hierarchical Linear Modeling	2
Measurement	2
More ▼

Raykov, Tenko	8
Marcoulides, George A.	3
Menold, Natalja	3
Calvocoressi, Lisa	2
DiStefano, Christine	2
Liu, Ren	2
Lorenzo-Seva, Urbano	2
Anthony, James C.	1
Cai, Li	1
Chalmers, R. Philip	1
Cousineau, Denis	1
Cui, Mengyao	1
Dimitrov, Dimiter M.	1
Ferrando, Pere J.	1
Ferrando, Pere Joan	1
Goldammer, Philippe	1
Gwet, Kilem L.	1
Huang, Sijia	1
Jordan, Pascal	1
Kroc, Edward	1
Leer, Jane	1
Li, Tatyana	1
Li, Tenglong	1
Luo, Jinwen	1
Luo, Xiao	1
More ▼