Publication Date
| In 2015 | 0 |
| Since 2014 | 1 |
| Since 2011 (last 5 years) | 8 |
| Since 2006 (last 10 years) | 31 |
| Since 1996 (last 20 years) | 32 |
Descriptor
Source
| Measurement:… | 32 |
Author
| Hill, Heather C. | 3 |
| Briggs, Derek C. | 2 |
| Kane, Michael T. | 2 |
| Mislevy, Robert J. | 2 |
| Mroch, Andrew A. | 2 |
| Ripkey, Douglas R. | 2 |
| Suh, Youngsuk | 2 |
| Alonzo, Alicia C. | 1 |
| Beguin, Anton | 1 |
| Bejar, Isaac I. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 32 |
| Opinion Papers | 24 |
| Reports - Evaluative | 5 |
| Reports - Research | 3 |
| Reports - Descriptive | 2 |
Education Level
| Elementary Secondary Education | 20 |
| Elementary Education | 2 |
Audience
Showing 1 to 15 of 32 results
West, Stephen G.; Grimm, Kevin J. – Measurement: Interdisciplinary Research and Perspectives, 2014
These authors agree with Bainter and Bollen that causal effects represents a useful measurement structure in some applications. The structure of the science of the measurement problem should determine the model; the measurement model should not determine the science. They also applaud Bainter and Bollen's important reminder that the full…
Descriptors: Causal Models, Measurement, Test Theory, Statistical Analysis
Skaggs, Gary – Measurement: Interdisciplinary Research and Perspectives, 2013
The construct map is a particularly good way to approach instrument development, and this author states that he was delighted to read Adam Wyse's thoughts about how to use construct maps for standard setting. For a number of popular standard-setting methods, Wyse shows how typical feedback to panelists fits within a construct map framework.…
Descriptors: Standard Setting (Scoring), Maps, Test Construction, Measurement
Briggs, Derek C. – Measurement: Interdisciplinary Research and Perspectives, 2013
In his focus article "How Is Testing Supposed to Improve Schooling?" Ed Haertel distinguishes between seven uses of educational tests as a function of the intended action and what or who will be influenced by the intended action. He then applies Mike Kane's interpretive argument approach (Kane, 2006) as a basis for speculating about the validity…
Descriptors: Educational Testing, Accountability, Educational Improvement, Teacher Evaluation
Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2013
Measurement is a semantic frame, a constellation of relationships and concepts that correspond to recurring patterns in human activity, highlighting typical roles, processes, and viewpoints (e.g., the "commercial event") but not others. One uses semantic frames to reason about unique and complex situations--sometimes intuitively, sometimes…
Descriptors: Educational Assessment, Measurement, Feedback (Response), Evidence
Shepard, Lorrie A. – Measurement: Interdisciplinary Research and Perspectives, 2013
In his article, Haertel (this issue) asks a fundamental question about how use of a test is expected to cause improvements in the educational system and in learning. He also considers how test validity should be investigated and argues for a more expansive view of validity that does not stop with scoring or generalization (the more technical and…
Descriptors: Educational Testing, Test Validity, Test Results, Test Construction
Pollitt, Alastair – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article is valuable in many ways, especially for clarifying confusions and inconsistencies in the assessment business. Most importantly, he points out confusions that persist and where open discussion will help us understand what we say and what we mean to say. But I will focus here on the only faults I find in the article: three…
Descriptors: Validity, Evaluation, Definitions, Test Construction
Hood, S. Brian – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton argues in favor of a conception of validity, viz, "the consensus definition of validity," according to which the extension of the predicate "is valid" is a subset of "assessment-based decision-making procedure[s], which [are] underwritten by an argument that the assessment procedure can be used to measure the attribute entailed by…
Descriptors: Validity, Test Construction, Definitions, Psychological Testing
Lissitz, Robert W.; Calico, Tiago – Measurement: Interdisciplinary Research and Perspectives, 2012
This paper presents the authors' critique on "Clarifying the Consensus Definition of Validity" by Paul E. Newton (this issue). There are serious differences of opinion regarding the topic of validity. Newton is aware of these differences, as made clear by his choice of references and particularly his effort to respond to the various Borsboom…
Descriptors: Concept Formation, Test Construction, Test Validity, Scores
Brandt, Steffen – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's commentary on "Updating the Duplex Design for Test-Based Accountability in the Twenty-First Century," in which Isaac I. Bejar and E. Aurora Graf propose the application of a test design--the duplex design (which was proposed in 1988 by Bock and Mislevy) for application in current accountability assessments.…
Descriptors: Accountability, Educational Testing, Test Construction, Computer Assisted Testing
Alonzo, Alicia C. – Measurement: Interdisciplinary Research and Perspectives, 2010
In their article "Innovations in Setting Performance Standards for K-12 Test-Based Accountability," Kristen Huff and Barbara S. Plake (2010) lay out three preconditions for continued investment in standard-setting methodology and practice, all focused on the sound development and use of achievement level descriptors (ALDs). Among these…
Descriptors: Standard Setting (Scoring), Achievement, Elementary Secondary Education, Accountability
Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2010
In "Updating the Duplex Design for Test-Based Accountability in the Twenty-First Century," Bejar and Graf (2010) propose extensions to the duplex design for large-scale assessment presented in Bock and Mislevy (1988). Examining the range of people who use assessment results--from students, teachers, administrators, curriculum designers,…
Descriptors: Measurement, Test Construction, Educational Testing, Data Collection
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In his article…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test linkings, in…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Huff, Kristen; Plake, Barbara S. – Measurement: Interdisciplinary Research and Perspectives, 2010
Standard setting is a systematic process that uses a combination of judgmental and empirical procedures to make recommendations about where on the score continuum "cut scores" should be placed. Cut scores divide the score scale into categories consistent with the descriptions of student performance associated with multiple levels of achievement.…
Descriptors: Accountability, Educational Testing, Elementary Secondary Education, Standard Setting (Scoring)
Koretz, Daniel; Beguin, Anton – Measurement: Interdisciplinary Research and Perspectives, 2010
Test-based accountability is now the cornerstone of U.S. education policy, and it is becoming more important in many other nations as well. Educators sometimes respond to test-based accountability in ways that produce score inflation. In the past, score inflation has usually been evaluated by comparing trends in scores on a high-stakes test to…
Descriptors: Accountability, High Stakes Tests, Test Construction, Scores

Peer reviewed
Direct link
