Publication Date
| In 2015 | 0 |
| Since 2014 | 0 |
| Since 2011 (last 5 years) | 11 |
| Since 2006 (last 10 years) | 21 |
| Since 1996 (last 20 years) | 22 |
Descriptor
| Evaluation | 22 |
| Validity | 9 |
| Measurement | 8 |
| Scores | 7 |
| Definitions | 5 |
| Item Response Theory | 5 |
| Tests | 5 |
| Models | 4 |
| Test Validity | 4 |
| Equated Scores | 3 |
| More ▼ | |
Source
| Measurement:… | 22 |
Author
| Behizadeh, Nadia | 1 |
| Bejar, Isaac I. | 1 |
| Black, Paul | 1 |
| Borsboom, Denny | 1 |
| Bramley, Tom | 1 |
| Briggs, Derek C. | 1 |
| Engelhard, George, Jr. | 1 |
| Gerstorf, Denis | 1 |
| Gregus, Michelle | 1 |
| Hardy, Sam A. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 22 |
| Opinion Papers | 15 |
| Reports - Evaluative | 4 |
| Reports - Descriptive | 3 |
| Reports - Research | 1 |
Education Level
| Higher Education | 2 |
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 5 | 1 |
| Intermediate Grades | 1 |
Audience
Showing 1 to 15 of 22 results
Wyse, Adam E. – Measurement: Interdisciplinary Research and Perspectives, 2013
Construct maps are tools that display how the underlying achievement construct upon which one is trying to set cut-scores is related to other information used in the process of standard setting. This article reviews what construct maps are, uses construct maps to provide a conceptual framework to view commonly used standard-setting procedures (the…
Descriptors: Standard Setting (Scoring), Maps, Cutting Scores, Methods
McClarty, Katie Larsen – Measurement: Interdisciplinary Research and Perspectives, 2013
The construct map is a promising tool for organizing the data standard-setting panelists interpret. The challenge in applying construct maps to standard-setting procedures will be the judicious selection of data to include within this organizing framework. Therefore, this commentary focuses on decisions about what to include in the construct map.…
Descriptors: Standard Setting (Scoring), Maps, Validity, Evidence
Koretz, Daniel – Measurement: Interdisciplinary Research and Perspectives, 2013
Haertel's argument is that one must "expand the scope of test validation to include indirect testing effects" because these effects are often the "rationale for the entire testing program." The author strongly agrees that this is essential. However, he maintains that Haertel's argument does not go far enough and that there are two additional…
Descriptors: Educational Testing, Test Validity, Test Results, Testing Programs
Zand Scholten, Annemarie – Measurement: Interdisciplinary Research and Perspectives, 2012
This paper presents the author's critique to Paul E. Newton's article titled "Clarifying the consensus definition of validity." In his article, Newton not only clarifies but also redefines the consensus definition of validity. In this redefinition he omits the term "construct" and introduces the term "measurement." Both omission and introduction…
Descriptors: Validity, Definitions, Evaluation, Test Use
Pollitt, Alastair – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article is valuable in many ways, especially for clarifying confusions and inconsistencies in the assessment business. Most importantly, he points out confusions that persist and where open discussion will help us understand what we say and what we mean to say. But I will focus here on the only faults I find in the article: three…
Descriptors: Validity, Evaluation, Definitions, Test Construction
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012
This focus article provided the author with an opportunity to unpack the consensus definition of validity and to explore its implications in the light of recent debates. He proposed an elaboration of the consensus definition, which was intended to express the spirit of the "Standards for Educational and Psychological Testing" with increased…
Descriptors: Validity, Educational Testing, Psychological Testing, Definitions
Black, Paul – Measurement: Interdisciplinary Research and Perspectives, 2012
Insofar as the title of this piece might call for a straightforward answer, it seems obvious that EPMA professionals are servants. Viewed in this perspective, Paul E. Newton's analysis is carefully balanced, in that it respects the complex history of the concerns of the professionals, whilst moving towards conclusions that place the needs of the…
Descriptors: Validity, Measurement, Test Results, Evaluation
Bramley, Tom – Measurement: Interdisciplinary Research and Perspectives, 2012
There is something about the topic of validity that seems to provoke dissatisfaction in many of those who encounter it--a sense that something is not right, and that something needs to be done to sort it out. Paul E. Newton in his target essay does not attempt a radical reconstruction of the validity edifice. His position is that the "consensus…
Descriptors: Validity, Measurement, Construct Validity, Definitions
Engelhard, George, Jr.; Behizadeh, Nadia – Measurement: Interdisciplinary Research and Perspectives, 2012
In his article, Paul E. Newton has conducted a review of selected perspectives on validity theory with the goal of disambiguating the definition of validity and describing a consensus definition of validity. Newton provides a nuanced discussion of the evolution of the concept of validity over the years. His Focus article has two major goals: (1)…
Descriptors: Validity, Psychological Testing, Researchers, Definitions
Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article on the consensus definition of validity tackles a number of big issues and makes a number of strong claims. I agreed with much of what he said, and I disagreed with a number of his claims, but I found his article to be consistently interesting and thought provoking (whether I agreed or not). I will focus on three general…
Descriptors: Validity, Construct Validity, Tests, Testing
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010
The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the journal try…
Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends
Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2010
Mroch, Suh, Kane, & Ripkey (2009); Suh, Mroch, Kane, & Ripkey (2009); and Kane, Mroch, Suh, & Ripkey (2009) provided elucidating discussions on critical properties of linear equating methods under the nonequivalent groups with anchor test (NEAT) design. In this popular equating design, two test forms are administered to different groups of…
Descriptors: Equated Scores, Test Items, Factor Analysis, Models
Newton, Paul – Measurement: Interdisciplinary Research and Perspectives, 2010
Despite over a century of aligning test and examination scales, the theory of linking has received relatively little attention. Recently, though, frameworks for classifying linking relationships have proliferated, both in England and the United States. Limitations of U.S. frameworks, particularly the idea that linking relationships ought to be…
Descriptors: Foreign Countries, Models, Comparative Analysis, Evaluation
Maris, Gunter; Schmittmann, Verena D.; Borsboom, Denny – Measurement: Interdisciplinary Research and Perspectives, 2010
Test equating under the NEAT design is, at best, a necessary evil. At bottom, the procedure aims to reach a conclusion on what a tested person would have done, if he or she were administered a set of items that were in fact never administered. It is not possible to infer such a conclusion from the data, because one simply has not made the required…
Descriptors: Equated Scores, Inferences, Item Response Theory, Error of Measurement
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
