ERIC - Search Results

Publication Date

In 2024	0
Since 2023	1
Since 2020 (last 5 years)	5
Since 2015 (last 10 years)	8
Since 2005 (last 20 years)	22

Source

Educational and Psychological…

Publication Type

Journal Articles	24
Reports - Research	14
Reports - Evaluative	9
Reports - Descriptive	1

Education Level

Elementary Secondary Education	24
Elementary Education	7
Secondary Education	6
Grade 4	5
Grade 5	4
Grade 7	4
Grade 3	3
Grade 6	3
Middle Schools	3
High Schools	2
Intermediate Grades	2
Junior High Schools	2
Early Childhood Education	1
Grade 10	1
Grade 8	1
Grade 9	1
Higher Education	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Location

Australia	1
Colorado (Denver)	1
Florida	1
Germany	1
Indiana	1
Massachusetts	1
Netherlands	1
New York (New York)	1
North Carolina (Charlotte)	1
South Carolina	1
Tennessee (Memphis)	1
Texas (Dallas)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Trends in International…	2
Indiana Statewide Testing for…	1
TerraNova Multiple Assessments	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

A Mixture IRTree Model for Extreme Response Style: Accounting for Response Process Uncertainty

Peer reviewed

Direct link

Kim, Nana; Bolt, Daniel M. – Educational and Psychological Measurement, 2021

This paper presents a mixture item response tree (IRTree) model for extreme response style. Unlike traditional applications of single IRTree models, a mixture approach provides a way of representing the mixture of respondents following different underlying response processes (between individuals), as well as the uncertainty present at the…

Descriptors: Item Response Theory, Response Style (Tests), Models, Test Items

How Days between Tests Impacts Alternate Forms Reliability in Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E. – Educational and Psychological Measurement, 2021

An essential question when computing test--retest and alternate forms reliability coefficients is how many days there should be between tests. This article uses data from reading and math computerized adaptive tests to explore how the number of days between tests impacts alternate forms reliability coefficients. Results suggest that the highest…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Reliability, Reading Tests

Using Integrative Data Analysis to Investigate School Climate across Multiple Informants

Peer reviewed

Direct link

McGrath, Kathleen V.; Leighton, Elizabeth A.; Ene, Mihaela; DiStefano, Christine; Monrad, Diane M. – Educational and Psychological Measurement, 2020

Survey research frequently involves the collection of data from multiple informants. Results, however, are usually analyzed by informant group, potentially ignoring important relationships across groups. When the same construct(s) are measured, integrative data analysis (IDA) allows pooling of data from multiple sources into one data set to…

Descriptors: Educational Environment, Meta Analysis, Student Attitudes, Teacher Attitudes

Student Assessment Opt Out and the Impact on Value-Added Measures of Teacher Quality

Peer reviewed

Direct link

Marland, Joshua; Harrick, Matthew; Sireci, Stephen G. – Educational and Psychological Measurement, 2020

Student assessment nonparticipation (or opt out) has increased substantially in K-12 schools in states across the country. This increase in opt out has the potential to impact achievement and growth (or value-added) measures used for educator and institutional accountability. In this simulation study, we investigated the extent to which…

Descriptors: Value Added Models, Teacher Effectiveness, Teacher Evaluation, Elementary Secondary Education

Making Inferences about Teacher Observation Scores over Time

Peer reviewed

Direct link

Briggs, Derek C.; Alzen, Jessica L. – Educational and Psychological Measurement, 2019

Observation protocol scores are commonly used as status measures to support inferences about teacher practices. When multiple observations are collected for the same teacher over the course of a year, some portion of a teacher's score on each occasion may be attributable to the rater, lesson, and the time of year of the observation. All three of…

Descriptors: Observation, Inferences, Generalizability Theory, Scores

Using Quantile Regression to Estimate Intervention Effects beyond the Mean

Peer reviewed
PDF on ERIC

Download full text

Direct link

Konstantopoulos, Spyros; Li, Wei; Miller, Shazia; van der Ploeg, Arie – Educational and Psychological Measurement, 2019

This study discusses quantile regression methodology and its usefulness in education and social science research. First, quantile regression is defined and its advantages vis-à-vis vis ordinary least squares regression are illustrated. Second, specific comparisons are made between ordinary least squares and quantile regression methods. Third, the…

Descriptors: Regression (Statistics), Statistical Analysis, Educational Research, Social Science Research

Using a Model of Analysts' Judgments to Augment an Item Calibration Process

Peer reviewed

Direct link

Hauser, Carl; Thum, Yeow Meng; He, Wei; Ma, Lingling – Educational and Psychological Measurement, 2015

When conducting item reviews, analysts evaluate an array of statistical and graphical information to assess the fit of a field test (FT) item to an item response theory model. The process can be tedious, particularly when the number of human reviews (HR) to be completed is large. Furthermore, such a process leads to decisions that are susceptible…

Descriptors: Test Items, Item Response Theory, Research Methodology, Decision Making

A Body of Work Standard-Setting Method with Construct Maps

Peer reviewed

Direct link

Wyse, Adam E.; Bunch, Michael B.; Deville, Craig; Viger, Steven G. – Educational and Psychological Measurement, 2014

This article describes a novel variation of the Body of Work method that uses construct maps to overcome problems of transparency, rater inconsistency, and scores gaps commonly occurring with the Body of Work method. The Body of Work method with construct maps was implemented to set cut-scores for two separate K-12 assessment programs in a large…

Descriptors: Standard Setting (Scoring), Educational Assessment, Elementary Secondary Education, Measurement

Effects of Item Parameter Drift on Vertical Scaling with the Nonequivalent Groups with Anchor Test (NEAT) Design

Peer reviewed

Direct link

Ye, Meng; Xin, Tao – Educational and Psychological Measurement, 2014

The authors explored the effects of drifting common items on vertical scaling within the higher order framework of item parameter drift (IPD). The results showed that if IPD occurred between a pair of test levels, the scaling performance started to deviate from the ideal state, as indicated by bias of scaling. When there were two items drifting…

Descriptors: Scaling, Test Items, Equated Scores, Achievement Gains

In Search of Value Added in the Case of Complex School Effects

Peer reviewed

Direct link

Timmermans, Anneke C.; Snijders, Tom A. B.; Bosker, Roel J. – Educational and Psychological Measurement, 2013

In traditional studies on value-added indicators of educational effectiveness, students are usually treated as belonging to those schools where they took their final examination. However, in practice, students sometimes attend multiple schools and therefore it is questionable whether this assumption of belonging to the last school they attended…

Descriptors: School Effectiveness, Student Mobility, Elementary Schools, Secondary Schools

Equivalence of Reading and Listening Comprehension across Test Media

Peer reviewed

Direct link

Schroeders, Ulrich; Wilhelm, Oliver – Educational and Psychological Measurement, 2011

Whether an ability test delivered on either paper or computer provides the same information is an important question in applied psychometrics. Besides the validity, it is also the fairness of a measure that is at stake if the test medium affects performance. This study provides a comprehensive review of existing equivalence research in the field…

Descriptors: Reading Comprehension, Listening Comprehension, English (Second Language), Language Tests

Comparing Construct Definition in the Angoff and Objective Standard Setting Models: Playing in a House of Cards without a Full Deck

Peer reviewed

Direct link

Stone, Gregory Ethan; Koskey, Kristin L. K.; Sondergeld, Toni A. – Educational and Psychological Measurement, 2011

Typical validation studies on standard setting models, most notably the Angoff and modified Angoff models, have ignored construct development, a critical aspect associated with all conceptualizations of measurement processes. Stone compared the Angoff and objective standard setting (OSS) models and found that Angoff failed to define a legitimate…

Descriptors: Cutting Scores, Standard Setting (Scoring), Models, Construct Validity

Estimating Trends from Censored Assessment Data under No Child Left Behind

Peer reviewed

Direct link

Furgol, Katherine E.; Ho, Andrew D.; Zimmerman, Dale L. – Educational and Psychological Measurement, 2010

Under the No Child Left Behind Act, large-scale test score trend analyses are widespread. These analyses often gloss over interesting changes in test score distributions and involve unrealistic assumptions. Further complications arise from analyses of unanchored, censored assessment data, or proportions of students lying within performance levels…

Descriptors: Trend Analysis, Sample Size, Federal Legislation, Simulation

Modeling the Effects of Person Group Factors on Discrimination

Peer reviewed

Direct link

Humphry, Stephen M. – Educational and Psychological Measurement, 2010

Discrimination has traditionally been parameterized for items but not other empirical factors. Consequently, if person factors affect discrimination they cause misfit. However, by explicitly formulating the relationship between discrimination and the unit of a metric, it is possible to parameterize discrimination for person groups. This article…

Descriptors: Discriminant Analysis, Models, Simulation, Reading Tests

Previous Page | Next Page »

Pages: 1 | 2

Privacy | Copyright | Contact Us | Selection Policy | API

Elementary Secondary Education	13
Item Response Theory	8
Models	7
Scores	7
Achievement Tests	5
Computer Assisted Testing	5
Foreign Countries	5
Measures (Individuals)	5
Psychometrics	5
Test Items	5
Comparative Analysis	4
Effect Size	4
Grade 4	4
Mathematics Tests	4
Measurement Techniques	4
Academic Achievement	3
Classification	3
Computation	3
Construct Validity	3
Cutting Scores	3
Data Analysis	3
Elementary School Students	3
Evaluation Methods	3
Factor Analysis	3
Factor Structure	3
More ▼

Jiao, Hong	3
Wang, Shudong	3
Brooks, Thomas	2
Olson, John	2
Viger, Steven G.	2
Wyse, Adam E.	2
Young, Michael J.	2
Alzen, Jessica L.	1
Blohm, Stephen W.	1
Bolt, Daniel M.	1
Bosker, Roel J.	1
Briggs, Derek C.	1
Bunch, Michael B.	1
Casey, Beth M.	1
Deville, Craig	1
DiStefano, Christine	1
Dowson, Martin	1
Ene, Mihaela	1
French, Brian F.	1
Furgol, Katherine E.	1
Giancarlo, Carol Ann	1
Harrick, Matthew	1
Hauser, Carl	1
He, Wei	1
Ho, Andrew D.	1
More ▼