ERIC - Search Results

Publication Date

In 2024	16
Since 2023	65
Since 2020 (last 5 years)	214
Since 2015 (last 10 years)	457
Since 2005 (last 20 years)	1012

Descriptor

Test Validity	792
Correlation	621
Factor Analysis	602
Higher Education	571
Statistical Analysis	560
Test Reliability	513
Factor Structure	441
Comparative Analysis	433
Scores	426
Test Items	406
Item Response Theory	372
Test Construction	365
Models	311
Measurement Techniques	281
College Students	271
Psychometrics	271
Foreign Countries	265
Item Analysis	264
Predictive Validity	262
Rating Scales	257
Academic Achievement	252
Error of Measurement	249
Reliability	238
Computer Programs	235
Mathematical Models	227
More ▼

Source

Educational and Psychological…

3900

Author

Michael, William B.	66
Raykov, Tenko	46
Marcoulides, George A.	42
Thompson, Bruce	26
Krus, David J.	21
Zumbo, Bruno D.	21
Vegelius, Jan	20
Wilcox, Rand R.	20
Plake, Barbara S.	19
Wang, Wen-Chung	19
Aiken, Lewis R.	18
Powers, Stephen	18
Algina, James	17
Dimitrov, Dimiter M.	17
Finch, W. Holmes	14
Keselman, H. J.	14
Schriesheim, Chester A.	14
Hakstian, A. Ralph	13
Hancock, Gregory R.	13
Kaiser, Henry F.	13
Kromrey, Jeffrey D.	13
Lewis, John	13
Martin, John D.	13
Shine, Lester C., II	13
More ▼

Publication Type

Journal Articles	2880
Reports - Research	2093
Reports - Evaluative	591
Reports - Descriptive	172
Speeches/Meeting Papers	53
Information Analyses	21
Guides - Non-Classroom	19
Opinion Papers	19
Tests/Questionnaires	17
Book/Product Reviews	13
Numerical/Quantitative Data	12
Reference Materials - General	3
Historical Materials	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Higher Education	113
Postsecondary Education	62
Secondary Education	59
Elementary Education	53
Middle Schools	42
High Schools	40
Junior High Schools	30
Elementary Secondary Education	24
Grade 4	24
Grade 8	20
Grade 3	18
Intermediate Grades	18
Early Childhood Education	16
Grade 5	16
Grade 7	16
Grade 6	14
Primary Education	13
Grade 9	10
Grade 10	9
Kindergarten	6
Adult Education	5
Grade 2	5
Preschool Education	4
Grade 1	3
Grade 11	3
More ▼

Audience

Practitioners	4
Researchers	4
Students	2
Teachers	1

Location

Canada	35
Australia	33
Germany	25
United States	18
Israel	17
Spain	13
Netherlands	12
Taiwan	11
South Korea	10
United Kingdom	10
China	9
California	8
Hong Kong	8
New Zealand	8
Japan	7
Mexico	6
Saudi Arabia	6
Georgia	5
Indiana	5
Ireland	5
Philippines	5
Belgium	4
Brazil	4
Florida	4
India	4
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Comprehensive Employment and…	1
Education Amendments 1972	1
Equal Rights Amendment	1
Title IX Education Amendments…	1

What Works Clearinghouse Rating

Educational and Psychological Measurement X

Showing 16 to 30 of 3,900 results Save | Export

Artificial Neural Networks for Short-Form Development of Psychometric Tests: A Study on Synthetic Populations Using Autoencoders

Peer reviewed

Direct link

Monica Casella; Pasquale Dolce; Michela Ponticorvo; Nicola Milano; Davide Marocco – Educational and Psychological Measurement, 2024

Short-form development is an important topic in psychometric research, which requires researchers to face methodological choices at different steps. The statistical techniques traditionally used for shortening tests, which belong to the so-called exploratory model, make assumptions not always verified in psychological data. This article proposes a…

Descriptors: Artificial Intelligence, Test Construction, Test Format, Psychometrics

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

Using Simulated Annealing to Investigate Sensitivity of SEM to External Model Misspecification

Peer reviewed

Direct link

Fisk, Charles L.; Harring, Jeffrey R.; Shen, Zuchao; Leite, Walter; Suen, King Yiu; Marcoulides, Katerina M. – Educational and Psychological Measurement, 2023

Sensitivity analyses encompass a broad set of post-analytic techniques that are characterized as measuring the potential impact of any factor that has an effect on some output variables of a model. This research focuses on the utility of the simulated annealing algorithm to automatically identify path configurations and parameter values of omitted…

Descriptors: Structural Equation Models, Algorithms, Simulation, Evaluation Methods

Awareness Is Bliss: How Acquiescence Affects Exploratory Factor Analysis

Peer reviewed

Direct link

D'Urso, E. Damiano; Tijmstra, Jesper; Vermunt, Jeroen K.; De Roover, Kim – Educational and Psychological Measurement, 2023

Assessing the measurement model (MM) of self-report scales is crucial to obtain valid measurements of individuals' latent psychological constructs. This entails evaluating the number of measured constructs and determining which construct is measured by which item. Exploratory factor analysis (EFA) is the most-used method to evaluate these…

Descriptors: Factor Analysis, Measurement Techniques, Self Evaluation (Individuals), Psychological Patterns

The Impact of Sample Size and Various Other Factors on Estimation of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sen, Sedat; Cohen, Allan S. – Educational and Psychological Measurement, 2023

The purpose of this study was to examine the effects of different data conditions on item parameter recovery and classification accuracy of three dichotomous mixture item response theory (IRT) models: the Mix1PL, Mix2PL, and Mix3PL. Manipulated factors in the simulation included the sample size (11 different sample sizes from 100 to 5000), test…

Descriptors: Sample Size, Item Response Theory, Accuracy, Classification

Is the Area under Curve Appropriate for Evaluating the Fit of Psychometric Models?

Peer reviewed

Direct link

Han, Yuting; Zhang, Jihong; Jiang, Zhehan; Shi, Dexin – Educational and Psychological Measurement, 2023

In the literature of modern psychometric modeling, mostly related to item response theory (IRT), the fit of model is evaluated through known indices, such as X[superscript 2], M2, and root mean square error of approximation (RMSEA) for absolute assessments as well as Akaike information criterion (AIC), consistent AIC (CAIC), and Bayesian…

Descriptors: Goodness of Fit, Psychometrics, Error of Measurement, Item Response Theory

Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment

Peer reviewed

Direct link

Zhou, Todd; Jiao, Hong – Educational and Psychological Measurement, 2023

Cheating detection in large-scale assessment received considerable attention in the extant literature. However, none of the previous studies in this line of research investigated the stacking ensemble machine learning algorithm for cheating detection. Furthermore, no study addressed the issue of class imbalance using resampling. This study…

Descriptors: Cheating, Measurement, Artificial Intelligence, Algorithms

Relative Robustness of CDMs and (M)IRT in Measuring Growth in Latent Skills

Peer reviewed

Direct link

Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023

Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…

Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)

A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning within a Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023

The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…

Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Fixed Effects or Mixed Effects Classifiers? Evidence from Simulated and Archival Data

Peer reviewed

Direct link

Mangino, Anthony A.; Bolin, Jocelyn H.; Finch, W. Holmes – Educational and Psychological Measurement, 2023

This study seeks to compare fixed and mixed effects models for the purposes of predictive classification in the presence of multilevel data. The first part of the study utilizes a Monte Carlo simulation to compare fixed and mixed effects logistic regression and random forests. An applied examination of the prediction of student retention in the…

Descriptors: Prediction, Classification, Monte Carlo Methods, Foreign Countries

On the Importance of Coefficient Alpha for Measurement Research: Loading Equality Is Not Necessary for Alpha's Utility as a Scale Reliability Index

Peer reviewed

Direct link

Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023

The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…

Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

Comparing the Psychometric Properties of a Scale across Three Likert and Three Alternative Formats: An Application to the Rosenberg Self-Esteem Scale

Peer reviewed

Direct link

Zhang, Xijuan; Zhou, Linnan; Savalei, Victoria – Educational and Psychological Measurement, 2023

Zhang and Savalei proposed an alternative scale format to the Likert format, called the Expanded format. In this format, response options are presented in complete sentences, which can reduce acquiescence bias and method effects. The goal of the current study was to compare the psychometric properties of the Rosenberg Self-Esteem Scale (RSES) in…

Descriptors: Psychometrics, Self Concept Measures, Self Esteem, Comparative Analysis

The NEAT Equating via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method

Peer reviewed

Direct link

Jiang, Zhehan; Han, Yuting; Xu, Lingling; Shi, Dexin; Liu, Ren; Ouyang, Jinying; Cai, Fen – Educational and Psychological Measurement, 2023

The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven…

Descriptors: Test Items, Equated Scores, Sample Size, Artificial Intelligence

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 260

Privacy | Copyright | Contact Us | Selection Policy | API

SAT (College Admission Test)	48
Wechsler Intelligence Scale…	38
Graduate Record Examinations	31
ACT Assessment	23
Dimensions of Self Concept	23
Minnesota Multiphasic…	21
Wechsler Adult Intelligence…	20
Raven Progressive Matrices	17
Myers Briggs Type Indicator	16
Program for International…	15
Stanford Achievement Tests	15
California Psychological…	14
Comprehensive Tests of Basic…	14
Coopersmith Self Esteem…	13
Iowa Tests of Basic Skills	13
Personal Orientation Inventory	13
Slosson Intelligence Test	13
Maslach Burnout Inventory	12
Metropolitan Readiness Tests	12
Piers Harris Childrens Self…	12
Rotter Internal External…	12
Sixteen Personality Factor…	11
California Achievement Tests	10
General Aptitude Test Battery	10
Adjective Check List	9
More ▼