ERIC - Search Results

Publication Date

In 2024	6
Since 2023	28
Since 2020 (last 5 years)	95
Since 2015 (last 10 years)	211
Since 2005 (last 20 years)	394

Descriptor

Test Items	150
Item Response Theory	112
Scores	92
Comparative Analysis	72
Mathematics Tests	62
Foreign Countries	58
Evaluation Methods	53
Test Bias	53
Test Construction	53
Models	47
Validity	44
Computer Assisted Testing	43
Scoring	42
Statistical Analysis	41
Computation	40
Error of Measurement	39
Simulation	39
Student Evaluation	38
Achievement Tests	37
Measurement	37
Difficulty Level	36
Multiple Choice Tests	35
Reading Tests	35
Test Validity	35
Equated Scores	34
More ▼

Source

Applied Measurement in…

464

Publication Type

Journal Articles	464
Reports - Research	323
Reports - Evaluative	94
Reports - Descriptive	41
Information Analyses	12
Tests/Questionnaires	7
Opinion Papers	4
Speeches/Meeting Papers	4
Reports - General	1

Education Level

Secondary Education	60
Higher Education	53
Elementary Secondary Education	47
Elementary Education	45
Postsecondary Education	39
Middle Schools	35
High Schools	33
Grade 8	31
Junior High Schools	26
Grade 5	24
Grade 4	22
Grade 3	20
Grade 7	17
Intermediate Grades	17
Grade 6	15
Early Childhood Education	12
Primary Education	11
Grade 10	8
Grade 11	6
Grade 9	6
Grade 12	5
Grade 2	5
Grade 1	3
Kindergarten	2
Preschool Education	2
More ▼

Audience

Researchers	3
Practitioners	2
Teachers	2
Administrators	1

Location

Canada	11
California	7
Netherlands	6
Australia	5
Germany	5
Israel	5
Massachusetts	5
New York	5
North Carolina	5
United States	5
Arizona	4
Florida	4
South Carolina	4
Texas	4
Virginia	4
Hawaii	3
United Kingdom	3
Belgium	2
Colorado	2
Europe	2
Finland	2
Georgia	2
Indiana	2
Iran	2
Michigan	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	8
Every Student Succeeds Act…	1
Race to the Top	1

What Works Clearinghouse Rating

Since 2001 X

Showing 1 to 15 of 464 results Save | Export

Computer-Based Listening Test with Full Video, Visual-Limited Video, and Audio: A Comparative Analysis Based on Difficulty, Discrimination Power, and Response Time

Peer reviewed

Direct link

Takahiro Terao – Applied Measurement in Education, 2024

This study aimed to compare item characteristics and response time between stimulus conditions in computer-delivered listening tests. Listening materials had three variants: regular videos, frame-by-frame videos, and only audios without visuals. Participants were 228 Japanese high school students who were requested to complete one of nine…

Descriptors: Computer Assisted Testing, Audiovisual Aids, Reaction Time, High School Students

Modeling Dimensions Converging at the Upper Anchor in Learning Progressions: An Example of Micro-Evolution

Peer reviewed

Direct link

Mingfeng Xue; Mark Wilson – Applied Measurement in Education, 2024

Multidimensionality is common in psychological and educational measurements. This study focuses on dimensions that converge at the upper anchor (i.e. the highest acquisition status defined in a learning progression) and compares different ways of dealing with them using the multidimensional random coefficients multinomial logit model and scale…

Descriptors: Learning Trajectories, Educational Assessment, Item Response Theory, Evolution

Don't Test after Lunch: The Relationship between Disengagement and the Time of Day That Low-Stakes Testing Occurs

Peer reviewed

Direct link

Steven L. Wise; Megan R. Kuhfeld; Marlit Annalena Lindner – Applied Measurement in Education, 2024

When student achievement is assessed, we seek to elicit a student's maximum performance -- a goal requiring the assumption that the student is fully engaged. Otherwise, to the extent that disengagement occurs, test performance is likely to suffer. Effectively managing test-taking disengagement requires an understanding of the testing conditions…

Descriptors: Testing, Attention Span, Learner Engagement, Time Factors (Learning)

Comparing Examinee-Based and Response-Based Motivation Filtering Methods in Remote Low-Stakes Testing

Peer reviewed

Direct link

Sarah Alahmadi; Christine E. DeMars – Applied Measurement in Education, 2024

Large-scale educational assessments are sometimes considered low-stakes, increasing the possibility of confounding true performance level with low motivation. These concerns are amplified in remote testing conditions. To remove the effects of low effort levels in responses observed in remote low-stakes testing, several motivation filtering methods…

Descriptors: Multiple Choice Tests, Item Response Theory, College Students, Scores

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Are Online and Paper Tests Comparable? Evidence from Statewide K-12 Tests

Peer reviewed

Direct link

Ben Backes; James Cowan – Applied Measurement in Education, 2024

We investigate two research questions using a recent statewide transition from paper to computer-based testing: first, the extent to which test mode effects found in prior studies can be eliminated; and second, the degree to which online and paper assessments offer different information about underlying student ability. We first find very small…

Descriptors: Computer Assisted Testing, Test Format, Differences, Academic Achievement

Using Bayesian Networks for Cognitive Assessment of Student Understanding of Buoyancy: A Granular Hierarchy Model

Peer reviewed

Direct link

Wang, Ling Ling; Jian, Sun Xiao; Liu, Yan Lou; Xin, Tao – Applied Measurement in Education, 2023

Cognitive diagnostic assessment based on Bayesian networks (BN) is developed in this paper to evaluate student understanding of the physical concept of buoyancy. we propose a three-order granular-hierarchy BN model which accounts for both fine-grained attributes and high-level proficiencies. Conditional independence in the BN structure is tested…

Descriptors: Bayesian Statistics, Networks, Cognitive Measurement, Diagnostic Tests

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Are Large Admissions Test Coaching Effects Widespread? A Longitudinal Analysis of Admissions Test Scores

Peer reviewed

Direct link

Dahlke, Jeffrey A.; Sackett, Paul R.; Kuncel, Nathan R. – Applied Measurement in Education, 2023

We examine longitudinal data from 120,384 students who took a version of the PSAT/SAT in the 9th, 10th, 11th, and 12th grades. We investigate score changes over time and show that socioeconomic status (SES) is related to the degree of score improvement. We note that the 9th and 10th grade PSAT are low-stakes tests, while the operational SAT is a…

Descriptors: Scores, College Entrance Examinations, Socioeconomic Status, Test Preparation

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

A Census-Level, Multi-Grade Analysis of the Association between Testing Time, Breaks, and Achievement

Peer reviewed

Direct link

Rutkowski, David; Rutkowski, Leslie; Valdivia, Dubravka Svetina; Canbolat, Yusuf; Underhill, Stephanie – Applied Measurement in Education, 2023

Several states in the US have removed time limits on their state assessments. In Indiana, where this study takes place, the state assessment is both untimed during the testing window and allows unlimited breaks during the testing session. Using grade 3 and 8 math and English state assessment data, in this paper we focus on time used for testing…

Descriptors: Testing, Time, Intervals, Academic Achievement

Keeping up the Pace: Evaluating Grade 8 Student Achievement Outcomes for New Hampshire's Innovative Assessment System

Peer reviewed

Direct link

Perez, Alexandra Lane; Evans, Carla – Applied Measurement in Education, 2023

New Hampshire's Performance Assessment of Competency Education (PACE) innovative assessment system uses student scores from classroom performance assessments as well as other classroom tests for school accountability purposes. One concern is that not having annual state testing may incentivize schools and teachers away from teaching the breadth of…

Descriptors: Grade 8, Competency Based Education, Evaluation Methods, Educational Innovation

Measurement Invariance in Relation to First Language: An Evaluation of German Reading and Spelling Tests

Peer reviewed

Direct link

Visser, Linda; Cartschau, Friederike; von Goldammer, Ariane; Brandenburg, Janin; Timmerman, Marieke; Hasselhorn, Marcus; Mähler, Claudia – Applied Measurement in Education, 2023

The growing number of children in primary schools in Germany who have German as their second language (L2) has raised questions about the fairness of performance assessment. Fair tests are a prerequisite for distinguishing between L2 learning delay and a specific learning disability. We evaluated five commonly used reading and spelling tests for…

Descriptors: Foreign Countries, Error of Measurement, Second Language Learning, German

Tracking Ordinal Development of Skills with a Longitudinal DINA Model with Polytomous Attributes

Peer reviewed

Direct link

Zhan, Peida; Liu, Yaohui; Yu, Zhaohui; Pan, Yanfang – Applied Measurement in Education, 2023

Many educational and psychological studies have shown that the development of students is generally step-by-step (i.e. ordinal development) to a specific level. This study proposed a novel longitudinal learning diagnosis model with polytomous attributes to track students' ordinal development in learning. Using the concept of polytomous attributes…

Descriptors: Skill Development, Cognitive Measurement, Models, Educational Diagnosis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 31

Privacy | Copyright | Contact Us | Selection Policy | API

Wise, Steven L.	12
Ercikan, Kadriye	9
Hambleton, Ronald K.	9
Sireci, Stephen G.	9
Lee, Won-Chan	8
Wells, Craig S.	8
Yin, Yue	7
Abedi, Jamal	6
Brandon, Paul R.	6
Bridgeman, Brent	6
Furtak, Erin Marie	6
Huynh, Huynh	6
Penfield, Randall D.	6
Sackett, Paul R.	6
Shavelson, Richard J.	6
Ayala, Carlos C.	5
Finch, Holmes	5
Kolen, Michael J.	5
Plake, Barbara S.	5
Pomplun, Mark	5
Ruiz-Primo, Maria Araceli	5
Wyse, Adam E.	5
Bolt, Daniel M.	4
Buckendahl, Chad W.	4
Carney, Michele	4
More ▼

Program for International…	18
SAT (College Admission Test)	10
Trends in International…	9
Graduate Record Examinations	7
National Assessment of…	7
ACT Assessment	3
Iowa Tests of Basic Skills	3
Measures of Academic Progress	3
Test of English as a Foreign…	3
Progress in International…	2
United States Medical…	2
Wechsler Intelligence Scale…	2
Advanced Placement…	1
Bar Examinations	1
Big Five Inventory	1
California Achievement Tests	1
College Level Examination…	1
Early Childhood Longitudinal…	1
Florida Comprehensive…	1
Georgia Criterion Referenced…	1
International Adult Literacy…	1
Iowa Tests of Educational…	1
Major Field Achievement Test…	1
Massachusetts Comprehensive…	1
Medical College Admission Test	1
More ▼