NotesFAQContact Us
Collection
Advanced
Search Tips
50 Years of ERIC
50 Years of ERIC
The Education Resources Information Center (ERIC) is celebrating its 50th Birthday! First opened on May 15th, 1964 ERIC continues the long tradition of ongoing innovation and enhancement.

Learn more about the history of ERIC here. PDF icon

Showing 1 to 15 of 18 results
Peer reviewed Peer reviewed
Direct linkDirect link
Almond, Russell G.; Kim, Yoon Jeon; Velasquez, Gertrudes; Shute, Valerie J. – Measurement: Interdisciplinary Research and Perspectives, 2014
One of the key ideas of evidence-centered assessment design (ECD) is that task features can be deliberately manipulated to change the psychometric properties of items. ECD identifies a number of roles that task-feature variables can play, including determining the focus of evidence, guiding form creation, determining item difficulty and…
Descriptors: Educational Games, Simulation, Psychometrics, Educational Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
DiCerbo, Kristen E. – Measurement: Interdisciplinary Research and Perspectives, 2014
The Task Model is a description of each task in a workflow. It defines attributes associated with that task. The creation of task models becomes increasingly important as the assessment tasks become more complex. Explicitly delineating the impact of task variables on the ability to collect evidence and make inferences demands thoughtfulness from…
Descriptors: Task Analysis, Models, Taxonomy, Design
Peer reviewed Peer reviewed
Direct linkDirect link
Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013
In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…
Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Cramer, Angelique O. J. – Measurement: Interdisciplinary Research and Perspectives, 2012
What is validity? A simple question but apparently one with many answers, as Paul Newton highlights in his review of the history of validity. The current definition of validity, as entertained in the 1999 "Standards for Educational and Psychological Testing" is indeed a consensus, one between the classical notion of attributes, and measures…
Descriptors: Validity, Educational Testing, Depression (Psychology), Psychology
Peer reviewed Peer reviewed
Direct linkDirect link
Haig, Brian D. – Measurement: Interdisciplinary Research and Perspectives, 2012
Lee Cronbach once expressed the view that all roads lead to construct validity. In looking to clarify the consensus definition of validity, and its place in assessment, Newton is also led to the troublesome idea of construct validity. To be sure, he addresses other validity issues, but in this commentary, I will restrict my attention to construct…
Descriptors: Validity, Educational Assessment, Construct Validity, Definitions
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012
The 1999 "Standards for Educational and Psychological Testing" defines validity as the degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests. Although quite explicit, there are ways in which this definition lacks precision, consistency, and clarity. The history of validity has taught us…
Descriptors: Evidence, Validity, Educational Testing, Risk
Peer reviewed Peer reviewed
Direct linkDirect link
Dunne, Timothy T. – Measurement: Interdisciplinary Research and Perspectives, 2011
The notion of the road map, advocated by Black, Wilson, and Yao (2011), and the associated minutiae of the construct map have several powerful features. At one level these notions assist the teacher to select and embody a suitable sequence of constructs within a specified curriculum. Whatever disparate sequenced pathways individual learners may…
Descriptors: Student Evaluation, Concept Mapping, Learning Processes, Learning Strategies
Peer reviewed Peer reviewed
Direct linkDirect link
Black, Paul; Wilson, Mark; Yao, Shih-Ying – Measurement: Interdisciplinary Research and Perspectives, 2011
In this rejoinder, the authors provide their thoughts on each of the commentaries of the seven respondents to their article. They find that the response of Kyngdon differs markedly from the others in questioning some basic elements of the methods of analysis that they propose for the construction of a "road map." The authors emphasize that they…
Descriptors: Teaching Methods, Data Analysis, Measurement Techniques, Educational Improvement
Peer reviewed Peer reviewed
Direct linkDirect link
Coburn, Cynthia E.; Turner, Erica O. – Measurement: Interdisciplinary Research and Perspectives, 2011
Data use is a phenomenon that spans boundaries of disciplines, implicating issues of measurement and assessment, issues of learning and cognition, issues of organizational context and change, and issues of power and politics, among others. Traditionally, scholarship on these different aspects of data use lives in different disciplinary homes. In…
Descriptors: Data, Information Utilization, Models, Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael T.; Mroch, Andrew A.; Suh, Youngsuk; Ripkey, Douglas R. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the authors' rejoinder to commentaries on linear equating and the NEAT design. The authors appreciate the insightful work of the commentary writers. Each has made a number of interesting points, many of which the authors had not considered at all. Before responding to some of those points, the authors reiterate what they see…
Descriptors: Weighted Scores, Equated Scores, Models, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010
The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the journal try…
Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2010
Mroch, Suh, Kane, & Ripkey (2009); Suh, Mroch, Kane, & Ripkey (2009); and Kane, Mroch, Suh, & Ripkey (2009) provided elucidating discussions on critical properties of linear equating methods under the nonequivalent groups with anchor test (NEAT) design. In this popular equating design, two test forms are administered to different groups of…
Descriptors: Equated Scores, Test Items, Factor Analysis, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul – Measurement: Interdisciplinary Research and Perspectives, 2010
Despite over a century of aligning test and examination scales, the theory of linking has received relatively little attention. Recently, though, frameworks for classifying linking relationships have proliferated, both in England and the United States. Limitations of U.S. frameworks, particularly the idea that linking relationships ought to be…
Descriptors: Foreign Countries, Models, Comparative Analysis, Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Huff, Kristen; Plake, Barbara S. – Measurement: Interdisciplinary Research and Perspectives, 2010
Standard setting is a systematic process that uses a combination of judgmental and empirical procedures to make recommendations about where on the score continuum "cut scores" should be placed. Cut scores divide the score scale into categories consistent with the descriptions of student performance associated with multiple levels of achievement.…
Descriptors: Accountability, Educational Testing, Elementary Secondary Education, Standard Setting (Scoring)
Peer reviewed Peer reviewed
Direct linkDirect link
Borsboom, Denny – Measurement: Interdisciplinary Research and Perspectives, 2008
Achilles and the tortoise want to predict peoples' running performance from their age. In determining age the tortoise has cleverly asked his participants for their passports and has noted their date of birth, before letting them run half the distance between Achilles and himself. Achilles, however, has given his participants a questionnaire…
Descriptors: Psychometrics, Measurement, Models, Applied Linguistics
Previous Page | Next Page ยป
Pages: 1  |  2