ERIC - Search Results

Publication Date

In 2024	0
Since 2023	0
Since 2020 (last 5 years)	4
Since 2015 (last 10 years)	7
Since 2005 (last 20 years)	11

Descriptor

Test Construction	30
Test Validity	30
Test Items	9
Testing Problems	9
Elementary Secondary Education	7
Test Use	7
Computer Assisted Testing	6
Standards	6
Test Reliability	5
Item Analysis	4
Psychometrics	4
Test Interpretation	4
Testing Programs	4
Accountability	3
Achievement Tests	3
Criterion Referenced Tests	3
Culture Fair Tests	3
Educational Assessment	3
Program Evaluation	3
School Districts	3
Scores	3
State Programs	3
Test Bias	3
Test Content	3
College Entrance Examinations	2
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	30
Reports - Research	10
Opinion Papers	9
Reports - Descriptive	7
Reports - Evaluative	7
Information Analyses	5
Speeches/Meeting Papers	1

Education Level

Grade 9	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Israel	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Teacher Performance…	1
Watson Glaser Critical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Supporting the Interpretive Validity of Student-Level Claims in Science Assessment with Tiered Claim Structures

Peer reviewed

Direct link

Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022

We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…

Descriptors: Science Tests, Test Validity, Test Items, Test Construction

The Effect of Drag-and-Drop Item Features on Test-Taker Performance and Response Strategies

Peer reviewed

Direct link

Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020

Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…

Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making

Using the "Joint Standards" to Design Postsecondary Assessments with Evidence of Validity and Reliability: An Approach to CAEP Accreditation

Peer reviewed

Direct link

Wilkerson, Judy R. – Educational Measurement: Issues and Practice, 2020

Validity and reliability are a major focus in teacher education accreditation by the Council for Accreditation of Educator Preparation (CAEP). CAEP requires the use of "accepted research standards," but many faculty and administrators are unsure how to meet this requirement. The Standards of Educational and Psychological Testing…

Descriptors: Test Construction, Test Validity, Test Reliability, Teacher Education Programs

Examining Effectiveness and Validity of Accommodations for English Language Learners in Mathematics: An Evidence-Based Computer Accommodation Decision System

Peer reviewed

Direct link

Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020

Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…

Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards

Digital Module 09: Sociocognitive Assessment for Diverse Populations

Peer reviewed

Direct link

Mislevy, Robert J.; Oliveri, Maria Elena – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Robert [Bob] Mislevy and Dr. Maria Elena Oliveri introduce and illustrate a sociocognitive perspective on educational measurement, which focuses on a variety of design and implementation considerations for creating fair and valid assessments for learners from diverse populations with diverse sociocultural…

Descriptors: Educational Testing, Reliability, Test Validity, Test Reliability

Using Evidence-Centered Design to Create a Special Educator Observation System

Peer reviewed

Direct link

Johnson, Evelyn S.; Crawford, Angela; Moylan, Laura A.; Zheng, Yuzhu – Educational Measurement: Issues and Practice, 2018

The evidence-centered design framework was used to create a special education teacher observation system, Recognizing Effective Special Education Teachers. Extensive reviews of research informed the domain analysis and modeling stages, and led to the conceptual framework in which effective special education teaching is operationalized as the…

Descriptors: Evidence Based Practice, Special Education Teachers, Observation, Disabilities

A Process for Reviewing and Evaluating Generated Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016

Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…

Descriptors: Test Items, Test Construction, Psychometrics, Models

Consequences of Test Score Use as Validity Evidence: Roles and Responsibilities

Peer reviewed

Direct link

Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009

This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…

Descriptors: Tests, Test Validity, Scores, Data Collection

Universal Design and Multimethod Approaches to Item Review

Peer reviewed

Direct link

Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008

Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…

Descriptors: Test Items, Disabilities, Test Construction, Testing Programs

Test Design with Cognition in Mind

Peer reviewed

Direct link

Gorin, Joanna S. – Educational Measurement: Issues and Practice, 2006

One of the primary themes of the National Research Council's 2001 book "Knowing What Students Know" was the importance of cognition as a component of assessment design and measurement theory (NRC, 2001). One reaction to the book has been an increased use of sophisticated statistical methods to model cognitive information available in test data.…

Descriptors: Test Construction, Student Evaluation, Academic Ability, Evaluation Methods

Use of Knowledge, Skill, and Ability Statements in Developing Licensure and Certification Examinations

Peer reviewed

Direct link

Wang, Ning; Schnipke, Deborah; Witt, Elizabeth A. – Educational Measurement: Issues and Practice, 2005

The task inventory approach is commonly used in job analysis for establishing content validity evidence supporting the use and interpretation of licensure and certification examinations. Although the results of a task inventory survey provide job task-related information that can be used as a reliable and valid source for test development, it is…

Descriptors: Nursing, Test Construction, Job Skills, Knowledge Level

Implications of the Golden Rule Settlement for Test Construction.

Peer reviewed

Linn, Robert L.; Drasgow, Fritz – Educational Measurement: Issues and Practice, 1987

This article discusses the application of the Golden Rule procedure to items of the Scholastic Aptitude Test. Using item response theory, the analyses indicate that the Golden Rule procedures are ineffective in detecting biased items and may undermine the reliability and validity of tests. (Author/JAZ)

Descriptors: College Entrance Examinations, Difficulty Level, Item Analysis, Latent Trait Theory

The Golden Rule Settlement: A Minority Perspective.

Peer reviewed

Bond, Lloyd – Educational Measurement: Issues and Practice, 1987

This article suggests that mechanical application of Golden Rule-like procedures is inappropriate. The fundamental idea embodied in them, namely, that of taking issues of equity into account in test construction, may reasonably be done without doing violence to test validity. (JAZ)

Descriptors: Court Litigation, Item Analysis, Minority Groups, Standards

Test-Wiseness for Teachers and Students.

Peer reviewed

Carter, Kathy – Educational Measurement: Issues and Practice, 1986

This article discusses the validity issue in teacher-made tests. Seventh-grade students' comments about their responses to a test designed to illustrate faulty items suggests students are quite proficient in using secondary clues to figure out correct answers. Teacher comments suggest teachers are unaware they provide such clues. (Author/JAZ)

Descriptors: Cues, Grade 7, Item Analysis, Junior High Schools

Coming of Age: Systematic Performance Evaluation.

Peer reviewed

Capie, William – Educational Measurement: Issues and Practice, 1985

State legislative action to control and reward teaching quality has resulted in the coming of age of systematic teacher performance evaluation for certification, merit pay, or career ladder advancement decisions. This article overviews issues of credibility and practicality--including test design, validity, and reliability--which arise when…

Descriptors: Accountability, Elementary Secondary Education, Interaction Process Analysis, Job Performance

Previous Page | Next Page »

Pages: 1 | 2

Privacy | Copyright | Contact Us | Selection Policy | API

Abedi, Jamal	1
Arslan, Burcu	1
Beller, Michal	1
Bond, Lloyd	1
Bottsford-Miller, Nicole A.	1
Capie, William	1
Carter, Kathy	1
Cole, Nancy S.	1
Crawford, Angela	1
Cronbach, Lee J.	1
Drasgow, Fritz	1
Flanagan, John C.	1
Forsyth, Robert A.	1
Frisbie, David A.	1
Gierl, Mark J.	1
Gong, Brian	1
Gong, Tao	1
Gorin, Joanna S.	1
Gramenz, Gary W.	1
Jiang, Yang	1
Johnson, Evelyn S.	1
Johnstone, Christopher J.	1
Jolly, S. Jean	1
Katz, Irvin R.	1
More ▼