ERIC - Search Results

Publication Date

In 2024	0
Since 2023	0
Since 2020 (last 5 years)	0
Since 2015 (last 10 years)	0
Since 2005 (last 20 years)	1

Descriptor

Criterion Referenced Tests	5
Test Items	5
Difficulty Level	3
Achievement Tests	2
Comparative Analysis	2
Guessing (Tests)	2
Scoring	2
Test Construction	2
Context Effect	1
Cutting Scores	1
Grade 11	1
Grade 5	1
Higher Education	1
Mastery Tests	1
Models	1
Multiple Choice Tests	1
Norm Referenced Tests	1
Objective Tests	1
Probability	1
Psychometrics	1
Reading Comprehension	1
Reading Tests	1
Reliability	1
Science Tests	1
Standard Setting (Scoring)	1
More ▼

Source

Educational and Psychological…

Author

Bennett, Judith A.	1
Haladyna, Thomas M.	1
Hanna, Gerald S.	1
Roid, G. H.	1
Suen, Hoi K.	1
Tsai, Fu-Ju	1
Wilcox, Rand R.	1
Wyse, Adam E.	1

Publication Type

Journal Articles	4
Reports - Research	2
Opinion Papers	1
Reports - Evaluative	1

Education Level

Grade 11	1
Grade 5	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Peer reviewed

Direct link

Wyse, Adam E. – Educational and Psychological Measurement, 2011

Standard setting is a method used to set cut scores on large-scale assessments. One of the most popular standard setting methods is the Bookmark method. In the Bookmark method, panelists are asked to envision a response probability (RP) criterion and move through a booklet of ordered items based on a RP criterion. This study investigates whether…

Descriptors: Testing Programs, Standard Setting (Scoring), Cutting Scores, Probability

A Comparison of Objective-Based and Modified-Bormuth Item Writing Techniques

Peer reviewed

Roid, G. H.; Haladyna, Thomas M. – Educational and Psychological Measurement, 1978

Two techniques for writing achievement test items to accompany instructional materials are contrasted: writing items from statements of instructional objectives, and writing items from semi-automated rules for transforming instructional statements. Both systems resulted in about the same number of faulty items. (Author/JKS)

Descriptors: Achievement Tests, Comparative Analysis, Criterion Referenced Tests, Difficulty Level

Instructional Sensitivity Expanded.

Peer reviewed

Hanna, Gerald S.; Bennett, Judith A. – Educational and Psychological Measurement, 1984

The presently viewed role and utility of measures of instructional sensitivity are summarized. A case is made that the rationale for the assessment of instructional sensitivity can be applied to all achievement tests and should not be restricted to criterion-referenced mastery tests. (Author/BW)

Descriptors: Achievement Tests, Context Effect, Criterion Referenced Tests, Mastery Tests

Determining the Length of Multiple Choice Criterion-Referenced Tests When an Answer-Until-Correct Scoring Procedure Is Used.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1982

When determining criterion-referenced test length, problems of guessing are shown to be more serious than expected. A new method of scoring is presented that corrects for guessing without assuming that guessing is random. Empirical investigations of the procedure are examined. Test length can be substantially reduced. (Author/CM)

Descriptors: Criterion Referenced Tests, Guessing (Tests), Multiple Choice Tests, Scoring

A Brief Report on a Comparison of Six Scoring Methods for Multiple True-False Items.

Peer reviewed

Tsai, Fu-Ju; Suen, Hoi K. – Educational and Psychological Measurement, 1993

Six methods of scoring multiple true-false items were compared in terms of reliabilities, difficulties, and discrimination. Results suggest that, for norm-referenced score interpretations, there is insufficient evidence to support any one of the methods as superior. For criterion-referenced score interpretations, effects of scoring method must be…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Difficulty Level, Guessing (Tests)

Privacy | Copyright | Contact Us | Selection Policy | API