ERIC - Search Results

Publication Date

In 2024	5
Since 2023	26
Since 2020 (last 5 years)	89
Since 2015 (last 10 years)	160

Descriptor

Test Items	39
Scores	35
Computer Assisted Testing	23
Item Response Theory	22
Evaluation Methods	19
Models	18
Academic Achievement	17
Comparative Analysis	15
Correlation	15
Foreign Countries	15
Test Construction	15
Accuracy	14
Cutting Scores	13
Test Validity	13
Decision Making	12
Longitudinal Studies	12
Measurement	12
Grade Point Average	11
Mathematics Tests	11
Simulation	11
Achievement Tests	10
College Entrance Examinations	10
College Students	10
Difficulty Level	10
Gender Differences	10
More ▼

Source

Educational Measurement:…

160

Publication Type

Journal Articles	160
Reports - Research	160
Information Analyses	2
Tests/Questionnaires	1

Education Level

Higher Education	28
Secondary Education	26
Postsecondary Education	25
High Schools	12
Middle Schools	11
Elementary Education	10
Junior High Schools	9
Early Childhood Education	6
Elementary Secondary Education	6
Intermediate Grades	4
Grade 4	3
Primary Education	3
Adult Education	2
Grade 3	2
Grade 5	2
Grade 8	2
Grade 9	2
High School Equivalency…	2
Grade 1	1
Grade 10	1
Grade 6	1
Grade 7	1
Preschool Education	1
More ▼

Audience

Location

Canada	2
Greece	2
Florida	1
Germany	1
Hong Kong	1
Idaho	1
Massachusetts	1
Netherlands	1
New Hampshire	1
Saudi Arabia	1
United Kingdom (England)	1
Wisconsin	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	2
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	7
Program for International…	5
ACT Assessment	2
High School Longitudinal…	1
Program for the International…	1
Progress in International…	1
Teaching and Learning…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 160 results Save | Export

Measuring Variability in Proctor Decision Making on High-Stakes Assessments: Improving Test Security in the Digital Age

Peer reviewed

Direct link

William Belzak; J. R. Lockwood; Yigal Attali – Educational Measurement: Issues and Practice, 2024

Remote proctoring, or monitoring test takers through internet-based, video-recording software, has become critical for maintaining test security on high-stakes assessments. The main role of remote proctors is to make judgments about test takers' behaviors and decide whether these behaviors constitute rule violations. Variability in proctor…

Descriptors: Computer Security, High Stakes Tests, English (Second Language), Second Language Learning

An Automated Item Pool Assembly Framework for Maximizing Item Utilization for CAT

Peer reviewed

Direct link

Hwanggyu Lim; Kyung T. Han – Educational Measurement: Issues and Practice, 2024

Computerized adaptive testing (CAT) has gained deserved popularity in the administration of educational and professional assessments, but continues to face test security challenges. To ensure sustained quality assurance and testing integrity, it is imperative to establish and maintain multiple stable item pools that are consistent in terms of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Using OpenAI GPT to Generate Reading Comprehension Items

Peer reviewed

Direct link

Ayfer Sayin; Mark Gierl – Educational Measurement: Issues and Practice, 2024

The purpose of this study is to introduce and evaluate a method for generating reading comprehension items using template-based automatic item generation. To begin, we describe a new model for generating reading comprehension items called the text analysis cognitive model assessing inferential skills across different reading passages. Next, the…

Descriptors: Algorithms, Reading Comprehension, Item Analysis, Man Machine Systems

Knowledge Integration in Science Learning: Tracking Students' Knowledge Development and Skill Acquisition with Cognitive Diagnosis Models

Peer reviewed

Direct link

Xin Xu; Shixiu Ren; Danhui Zhang; Tao Xin – Educational Measurement: Issues and Practice, 2024

In scientific literacy, knowledge integration (KI) is a scaffolding-based theory to assist students' scientific inquiry learning. To drive students to be self-directed, many courses have been developed based on KI framework. However, few efforts have been made to evaluate the outcome of students' learning under KI instruction. Moreover,…

Descriptors: Science Education, Knowledge Level, Learning, Students

Achievement and Growth on English Language Proficiency and Content Assessments for English Learners in Elementary Grades

Peer reviewed

Direct link

Heather M. Buzick; Mikyung Kim Wolf; Laura Ballard – Educational Measurement: Issues and Practice, 2024

English language proficiency (ELP) assessment scores are used by states to make high-stakes decisions related to linguistic support in instruction and assessment for English learner (EL) students and for EL student reclassification. Changes to both academic content standards and ELP academic standards within the last decade have resulted in…

Descriptors: English Language Learners, Elementary School Students, English (Second Language), Language Proficiency

Using Active Learning Methods to Strategically Select Essays for Automated Scoring

Peer reviewed

Direct link

Firoozi, Tahereh; Mohammadi, Hamid; Gierl, Mark J. – Educational Measurement: Issues and Practice, 2023

Research on Automated Essay Scoring has become increasing important because it serves as a method for evaluating students' written responses at scale. Scalable methods for scoring written responses are needed as students migrate to online learning environments resulting in the need to evaluate large numbers of written-response assessments. The…

Descriptors: Active Learning, Automation, Scoring, Essays

Machine Learning-Based Profiling in Test Cheating Detection

Peer reviewed

Direct link

Meng, Huijuan; Ma, Ye – Educational Measurement: Issues and Practice, 2023

In recent years, machine learning (ML) techniques have received more attention in detecting aberrant test-taking behaviors due to advantages when compared to traditional data forensics methods. However, defining "True Test Cheaters" is challenging--different than other fraud detection tasks such as flagging forged bank checks or credit…

Descriptors: Artificial Intelligence, Cheating, Testing, Information Technology

A Machine Learning Approach for the Simultaneous Detection of Preknowledge in Examinees and Items When Both Are Unknown

Peer reviewed

Direct link

Pan, Yiqin; Wollack, James A. – Educational Measurement: Issues and Practice, 2023

Pan and Wollack (PW) proposed a machine learning method to detect compromised items. We extend the work of PW to an approach detecting compromised items and examinees with item preknowledge simultaneously and draw on ideas in ensemble learning to relax several limitations in the work of PW. The suggested approach also provides a confidence score,…

Descriptors: Artificial Intelligence, Prior Learning, Item Analysis, Test Content

Digital Module 31: Testing Accommodations for Students with Disabilities

Peer reviewed

Direct link

Lovett, Benjamin J. – Educational Measurement: Issues and Practice, 2023

Students with disabilities often take tests under different conditions than their peers do. Testing accommodations, which involve changes to test administration that maintain test content, include extending time limits, presenting written text through auditory means, and taking a test in a private room with fewer distractions. For some students…

Descriptors: Students with Disabilities, Testing Accommodations, Psychometrics, Student Needs

Do Subject Matter Experts' Judgments of Multiple-Choice Format Suitability Predict Item Quality?

Peer reviewed

Direct link

Berenbon, Rebecca F.; McHugh, Bridget C. – Educational Measurement: Issues and Practice, 2023

To assemble a high-quality test, psychometricians rely on subject matter experts (SMEs) to write high-quality items. However, SMEs are not typically given the opportunity to provide input on which content standards are most suitable for multiple-choice questions (MCQs). In the present study, we explored the relationship between perceived MCQ…

Descriptors: Test Items, Multiple Choice Tests, Standards, Difficulty Level

Hierarchical Agglomerative Clustering to Detect Test Collusion on Computer-Based Tests

Peer reviewed

Direct link

Ingrisone, Soo Jeong; Ingrisone, James N. – Educational Measurement: Issues and Practice, 2023

There has been a growing interest in approaches based on machine learning (ML) for detecting test collusion as an alternative to the traditional methods. Clustering analysis under an unsupervised learning technique appears especially promising to detect group collusion. In this study, the effectiveness of hierarchical agglomerative clustering…

Descriptors: Identification, Cooperation, Computer Assisted Testing, Artificial Intelligence

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Defining Test-Score Interpretation, Use, and Claims: Delphi Study for the Validity Argument

Peer reviewed

Direct link

Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023

Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…

Descriptors: Test Interpretation, Scores, Test Use, Test Validity

A Probabilistic Filtering Approach to Non-Effortful Responding

Peer reviewed

Direct link

Ulitzsch, Esther; Domingue, Benjamin W.; Kapoor, Radhika; Kanopka, Klint; Rios, Joseph A. – Educational Measurement: Issues and Practice, 2023

Common response-time-based approaches for non-effortful response behavior (NRB) in educational achievement tests filter responses that are associated with response times below some threshold. These approaches are, however, limited in that they require a binary decision on whether a response is classified as stemming from NRB; thus ignoring…

Descriptors: Reaction Time, Responses, Behavior, Achievement Tests

The Role of Response Style Adjustments in Cross-Country Comparisons--A Case Study Using Data from the PISA 2015 Questionnaire

Peer reviewed

Direct link

Ulitzsch, Esther; Lüdtke, Oliver; Robitzsch, Alexander – Educational Measurement: Issues and Practice, 2023

Country differences in response styles (RS) may jeopardize cross-country comparability of Likert-type scales. When adjusting for rather than investigating RS is the primary goal, it seems advantageous to impose minimal assumptions on RS structures and leverage information from multiple scales for RS measurement. Using PISA 2015 background…

Descriptors: Response Style (Tests), Comparative Analysis, Achievement Tests, Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Privacy | Copyright | Contact Us | Selection Policy | API

Kuncel, Nathan R.	9
Sackett, Paul R.	8
Wyse, Adam E.	7
Rios, Joseph A.	6
Sinharay, Sandip	4
Babcock, Ben	3
Feinberg, Richard A.	3
Keehner, Madeleine	3
Kostal, Jack W.	3
McCaffrey, Daniel F.	3
Wind, Stefanie A.	3
Almusharraf, Norah	2
Arslan, Burcu	2
Beatty, Adam S.	2
Bennett, Randy E.	2
Castellano, Katherine E.	2
Circi, Ruhan	2
Clauser, Brian E.	2
Deane, Paul	2
Domingue, Benjamin W.	2
Frey, Andreas	2
Gierl, Mark J.	2
Grammatikopoulos, Vasilis	2
Gregoriadis, Athanasios	2
Guo, Hongwen	2
More ▼