Publication Date
| In 2015 | 0 |
| Since 2014 | 2 |
| Since 2011 (last 5 years) | 11 |
| Since 2006 (last 10 years) | 21 |
| Since 1996 (last 20 years) | 42 |
Descriptor
| Validity | 46 |
| Elementary Secondary Education | 14 |
| Educational Assessment | 10 |
| Test Use | 10 |
| Accountability | 9 |
| Evaluation Methods | 9 |
| High Stakes Tests | 9 |
| Scores | 8 |
| Testing Programs | 8 |
| Educational Testing | 7 |
| More ▼ | |
Source
| Educational Measurement:… | 46 |
Author
| Moss, Pamela A. | 3 |
| Sireci, Stephen G. | 3 |
| Abedi, Jamal | 2 |
| Haertel, Edward H. | 2 |
| Kane, Michael | 2 |
| Lane, Suzanne | 2 |
| Stone, Clement A. | 2 |
| Bachman, Lyle F. | 1 |
| Bandalos, Deborah L. | 1 |
| Bechger, Timo M. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 46 |
| Reports - Descriptive | 27 |
| Reports - Research | 10 |
| Reports - Evaluative | 8 |
| Speeches/Meeting Papers | 4 |
| Opinion Papers | 2 |
| Book/Product Reviews | 1 |
| Information Analyses | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 2 |
| Secondary Education | 2 |
| Elementary Secondary Education | 1 |
| Grade 9 | 1 |
| High Schools | 1 |
Audience
| Teachers | 2 |
Showing 1 to 15 of 46 results
Koch, Martha J. – Educational Measurement: Issues and Practice, 2014
Implications of the multiple-use of accountability assessments for the process of validation are examined. Multiple-use refers to the simultaneous use of results from a single administration of an assessment for its intended use and for one or more additional uses. A theoretical discussion of the issues for validation which emerge from…
Descriptors: Foreign Countries, Test Use, Accountability, Validity
Camara, Wayne – Educational Measurement: Issues and Practice, 2014
This article reviews the intended uses of these college- and career-readiness assessments with the goal of articulating an appropriate validity argument to support such uses. These assessments differ fundamentally from today's state assessments employed for state accountability. Current assessments are used to determine if students have…
Descriptors: College Readiness, Career Readiness, Aptitude Tests, Test Use
Tiffin-Richards, Simon P.; Pant, Hans Anand; Koller, Olaf – Educational Measurement: Issues and Practice, 2013
Cut-scores were set by expert judges on assessments of reading and listening comprehension of English as a foreign language (EFL), using the bookmark standard-setting method to differentiate proficiency levels defined by the Common European Framework of Reference (CEFR). Assessments contained stratified item samples drawn from extensive item…
Descriptors: Foreign Countries, English (Second Language), Language Tests, Standard Setting (Scoring)
Cui, Ying; Roberts, Mary Roduta – Educational Measurement: Issues and Practice, 2013
The goal of this study was to investigate the usefulness of person-fit analysis in validating student score inferences in a cognitive diagnostic assessment. In this study, a two-stage procedure was used to evaluate person fit for a diagnostic test in the domain of statistical hypothesis testing. In the first stage, the person-fit statistic, the…
Descriptors: Scores, Validity, Cognitive Tests, Diagnostic Tests
Williamson, David M.; Xi, Xiaoming; Breyer, F. Jay – Educational Measurement: Issues and Practice, 2012
A framework for evaluation and use of automated scoring of constructed-response tasks is provided that entails both evaluation of automated scoring as well as guidelines for implementation and maintenance in the context of constantly evolving technologies. Consideration of validity issues and challenges associated with automated scoring are…
Descriptors: Automation, Scoring, Evaluation, Guidelines
Huggins, Anne C.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2012
A goal for any linking or equating of two or more tests is that the linking function be invariant to the population used in conducting the linking or equating. Violations of population invariance in linking and equating jeopardize the fairness and validity of test scores, and pose particular problems for test-based accountability programs that…
Descriptors: Equated Scores, Tests, Test Bias, Validity
Informing in the Information Age: How to Communicate Measurement Concepts to Education Policy Makers
Sireci, Stephen G.; Forte, Ellen – Educational Measurement: Issues and Practice, 2012
Current educational policies rely on educational assessments. However, the technical aspects of assessments are often unknown to policy makers, which is dangerous because sound assessment policy requires knowledge of the strengths and limitations of educational tests. In this article, we discuss the importance of informing policy makers of…
Descriptors: Educational Assessment, Psychometrics, Educational Policy, Educational Testing
Bandalos, Deborah L.; Kopp, Jason P. – Educational Measurement: Issues and Practice, 2012
In this article, we discuss the importance of measurement literacy and some issues encountered in teaching introductory measurement courses. We present results from a survey of introductory measurement instructors, including information about the topics included in such courses and the amount of time spent on each. Topics that were included by the…
Descriptors: Class Activities, Motivation Techniques, Item Analysis, Test Theory
Suto, Irenka – Educational Measurement: Issues and Practice, 2012
Internationally, many assessment systems rely predominantly on human raters to score examinations. Arguably, this facilitates the assessment of multiple sophisticated educational constructs, strengthening assessment validity. It can introduce subjectivity into the scoring process, however, engendering threats to accuracy. The present objectives…
Descriptors: Evaluation Methods, Scoring, Qualitative Research, Protocol Analysis
Myford, Carol M. – Educational Measurement: Issues and Practice, 2012
Over the last several decades, researchers have studied many and varied aspects of rater cognition. Those interested in pursuing basic research have focused on gaining an understanding of raters' thought processes as they score different types of performances and products, striving to understand how raters' mental representations and the cognitive…
Descriptors: Evidence, Validity, Cognitive Processes, Models
Bejar, Issac I. – Educational Measurement: Issues and Practice, 2012
The scoring process is critical in the validation of tests that rely on constructed responses. Documenting that readers carry out the scoring in ways consistent with the construct and measurement goals is an important aspect of score validity. In this article, rater cognition is approached as a source of support for a validity argument for scores…
Descriptors: Scores, Inferences, Validity, Scoring
Polikoff, Morgan S. – Educational Measurement: Issues and Practice, 2010
Standards-based reform, as codified by the No Child Left Behind Act, relies on the ability of assessments to accurately reflect the learning that takes place in U.S. classrooms. However, this property of assessments--their instructional sensitivity--is rarely, if ever, investigated by test developers, states, or researchers. In this paper, the…
Descriptors: Federal Legislation, Psychometrics, Accountability, Teaching Methods
Geisinger, Kurt F.; McCormick, Carina M. – Educational Measurement: Issues and Practice, 2010
Standard-setting studies utilizing procedures such as the Bookmark or Angoff methods are just one component of the complete standard-setting process. Decision makers ultimately must determine what they believe to be the most appropriate standard or cut score to use, employing the input of the standard-setting panelists as one piece of information…
Descriptors: Standard Setting (Scoring), Measurement, Cutting Scores, Educational Policy
Chapelle, Carol A.; Enright, Mary K.; Jamieson, Joan – Educational Measurement: Issues and Practice, 2010
Drawing on experience between 2000 and 2007 in developing a validity argument for the high-stakes Test of English as a "Foreign Language[TM]" (TOEFL[R]), this paper evaluates the differences between the argument-based approach to validity as presented by "Kane (2006)" and that described in the 1999 "AERA/APA/NCME Standards for Educational and…
Descriptors: Psychological Testing, Validity, High Stakes Tests, English (Second Language)
Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009
Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…
Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods

Peer reviewed
Direct link
