ERIC - Search Results

Publication Date

In 2024	0
Since 2023	4
Since 2020 (last 5 years)	6
Since 2015 (last 10 years)	17
Since 2005 (last 20 years)	76

Descriptor

Test Items	63
Test Construction	53
Item Response Theory	51
Scores	41
Evaluation Methods	37
Educational Assessment	29
Performance Based Assessment	29
Comparative Analysis	28
Elementary Secondary Education	26
Computer Assisted Testing	25
Scoring	23
Equated Scores	22
Multiple Choice Tests	22
Test Validity	22
Achievement Tests	21
Psychometrics	21
Decision Making	19
Models	19
Validity	19
Test Use	18
Academic Achievement	17
Student Evaluation	17
Test Bias	17
Measurement Techniques	16
Simulation	16
More ▼

Source

Applied Measurement in…

220

Publication Type

Journal Articles	220
Reports - Evaluative	220
Information Analyses	15
Speeches/Meeting Papers	15
Reports - Research	4
Book/Product Reviews	1
Collected Works - General	1
Guides - Non-Classroom	1
Opinion Papers	1

Education Level

Higher Education	10
Elementary Secondary Education	8
Middle Schools	6
Secondary Education	6
Elementary Education	5
High Schools	5
Grade 5	4
Grade 7	4
Grade 8	4
Postsecondary Education	4
Grade 3	3
Grade 4	3
Grade 6	3
Junior High Schools	3
Intermediate Grades	2
Adult Education	1
Grade 11	1
Grade 2	1
Grade 9	1
Kindergarten	1
Primary Education	1
Two Year Colleges	1
More ▼

Audience

Location

United States	3
United Kingdom	2
Arizona	1
Europe	1
Georgia	1
Germany	1
Israel	1
Kansas	1
Massachusetts	1
North Carolina	1
South Carolina	1
Texas	1
Vermont	1
Virginia	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

National Assessment of…	11
SAT (College Admission Test)	6
Graduate Record Examinations	3
Advanced Placement…	1
College Board Achievement…	1
College Level Examination…	1
Georgia Criterion Referenced…	1
Law School Admission Test	1
Program for International…	1
Self Description Questionnaire	1
TerraNova Multiple Assessments	1
Test of English as a Foreign…	1
Wechsler Intelligence Scale…	1
Woodcock Johnson Psycho…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Reports - Evaluative X

Showing 1 to 15 of 220 results Save | Export

Measurement Invariance in Relation to First Language: An Evaluation of German Reading and Spelling Tests

Peer reviewed

Direct link

Visser, Linda; Cartschau, Friederike; von Goldammer, Ariane; Brandenburg, Janin; Timmerman, Marieke; Hasselhorn, Marcus; Mähler, Claudia – Applied Measurement in Education, 2023

The growing number of children in primary schools in Germany who have German as their second language (L2) has raised questions about the fairness of performance assessment. Fair tests are a prerequisite for distinguishing between L2 learning delay and a specific learning disability. We evaluated five commonly used reading and spelling tests for…

Descriptors: Foreign Countries, Error of Measurement, Second Language Learning, German

The "Standards" Will Never Be Enough: A Racial Justice Extension

Peer reviewed

Direct link

Poe, Mya; Oliveri, Maria Elena; Elliot, Norbert – Applied Measurement in Education, 2023

Since 1952, the "Standards for Educational and Psychological Testing" has provided criteria for developing and evaluating educational and psychological tests and testing practice. Yet, we argue that the foundations, operations, and applications in the "Standards" are no longer sufficient to meet the current U.S. testing demands…

Descriptors: Racism, Social Justice, Standards, Psychological Testing

Validity and Racial Justice in Educational Assessment

Peer reviewed

Direct link

Lederman, Josh – Applied Measurement in Education, 2023

Given its centrality to assessment, until the concept of validity includes concern for racial justice, such matters will be seen as residing outside the "real" work of validation, rendering them powerless to count against the apparent scientific merit of the test. As the definition of validity has evolved, however, it holds great…

Descriptors: Educational Assessment, Validity, Social Justice, Race

Shifting Educational Measurement from an Agent of Systemic Racism to an Anti-Racist Endeavor

Peer reviewed

Direct link

Russell, Michael – Applied Measurement in Education, 2023

In recent years, issues of race, racism and social justice have garnered increased attention across the nation. Although some aspects of social justice, particularly cultural sensitivity and test bias, have received similar attention within the field of educational measurement, sharp focus of racism has alluded the field. This manuscript focuses…

Descriptors: Racism, Social Justice, Theories, Race

Evaluating Human Scoring Using Generalizability Theory

Peer reviewed

Direct link

Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020

Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…

Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries

Some Methods and Evaluation for Linking and Equating with Small Samples

Peer reviewed

Direct link

Peabody, Michael R. – Applied Measurement in Education, 2020

The purpose of the current article is to introduce the equating and evaluation methods used in this special issue. Although a comprehensive review of all existing models and methodologies would be impractical given the format, a brief introduction to some of the more popular models will be provided. A brief discussion of the conditions required…

Descriptors: Evaluation Methods, Equated Scores, Sample Size, Item Response Theory

Empirical Considerations on Intelligence Testing and Models of Intelligence: Updates for Educational Measurement Professionals

Peer reviewed

Direct link

Geisinger, Kurt F. – Applied Measurement in Education, 2019

This brief article introduces the topic of intelligence as highly appropriate for educational measurement professionals. It describes some of the uses of intelligence tests both historically and currently. It argues why knowledge of intelligence theory and intelligence testing is important for educational measurement professionals. The articles…

Descriptors: Intelligence Tests, Intelligence, Models, Educational Assessment

Challenges to the Cattell-Horn-Carroll Theory: Empirical, Clinical, and Policy Implications

Peer reviewed

Direct link

Canivez, Gary L.; Youngstrom, Eric A. – Applied Measurement in Education, 2019

The Cattell-Horn-Carroll (CHC) taxonomy of cognitive abilities married John Horn and Raymond Cattell's Extended Gf-Gc theory with John Carroll's Three-Stratum Theory. While there are some similarities in arrangements or classifications of tasks (observed variables) within similar broad or narrow dimensions, other salient theoretical features and…

Descriptors: Taxonomy, Cognitive Ability, Intelligence, Cognitive Tests

Critically Reflecting on the Origins, Evolution, and Impact of the Cattell-Horn-Carroll (CHC) Model

Peer reviewed

Direct link

McGill, Ryan J.; Dombrowski, Stefan C. – Applied Measurement in Education, 2019

The Cattell-Horn-Carroll (CHC) model presently serves as a blueprint for both test development and a taxonomy for clinical interpretation of modern tests of cognitive ability. Accordingly, the trend among test publishers has been toward creating tests that provide users with an ever-increasing array of scores that comport with CHC. However, an…

Descriptors: Models, Cognitive Ability, Intelligence Tests, Intelligence

Measuring the Reliability of Diagnostic Mastery Classifications at Multiple Levels of Reporting

Peer reviewed

Direct link

Thompson, W. Jake; Clark, Amy K.; Nash, Brooke – Applied Measurement in Education, 2019

As the use of diagnostic assessment systems transitions from research applications to large-scale assessments for accountability purposes, reliability methods that provide evidence at each level of reporting are needed. The purpose of this paper is to summarize one simulation-based method for estimating and reporting reliability for an…

Descriptors: Test Reliability, Diagnostic Tests, Classification, Computation

A Validation Argument from Soup to Nuts: Assessing Progress on Learning Trajectories for Middle-School Mathematics

Peer reviewed

Direct link

Confrey, Jere; Toutkoushian, Emily; Shah, Meetal – Applied Measurement in Education, 2019

Fully articulating validation arguments in the context of classroom assessment requires connecting evidence from multiple sources and addressing multiple types of validity in a coherent chain of reasoning. This type of validation argument is particularly complex for assessments that function in close proximity to instruction, address the fine…

Descriptors: Test Validity, Item Response Theory, Middle School Students, Mathematics Instruction

Prescribing Structure for Validation Arguments: Elemental, Structural, and Ecological Validity

Peer reviewed

Direct link

Jacobson, Erik; Svetina, Dubravka – Applied Measurement in Education, 2019

Contingent argument-based approaches to validity require a unique argument for each use, in contrast to more prescriptive approaches that identify the common kinds of validity evidence researchers should consider for every use. In this article, we evaluate our use of an approach that is both prescriptive "and" argument-based to develop a…

Descriptors: Test Validity, Test Items, Test Construction, Test Interpretation

Comparison of Two Approaches to Interpretive Use Arguments

Peer reviewed

Direct link

Carney, Michele; Crawford, Angela; Siebert, Carl; Osguthorpe, Rich; Thiede, Keith – Applied Measurement in Education, 2019

The "Standards for Educational and Psychological Testing" recommend an argument-based approach to validation that involves a clear statement of the intended interpretation and use of test scores, the identification of the underlying assumptions and inferences in that statement--termed the interpretation/use argument, and gathering of…

Descriptors: Inquiry, Test Interpretation, Validity, Scores

Where Are We Now? Learning Progressions and Formative Assessment

Peer reviewed

Direct link

Gotwals, Amelia Wenk – Applied Measurement in Education, 2018

In this commentary, I consider the three empirical studies in this special issue based on two main aspects: (a) the nature of the learning progressions and (b) what formative assessment practice(s) were investigated. Specifically, I describe differences among the learning progressions in terms of scope and grain size. I also identify three…

Descriptors: Skill Development, Behavioral Objectives, Formative Evaluation, Evaluation Methods

Establishing a Crosswalk between the Common European Framework for Languages (CEFR) and Writing Domains Scored by Automated Essay Scoring

Peer reviewed

Direct link

Shermis, Mark D. – Applied Measurement in Education, 2018

This article employs the Common European Framework Reference for Language Acquisition (CEFR) as a basis for evaluating writing in the context of machine scoring. The CEFR was designed as a framework for evaluating proficiency levels of speaking for the 49 languages comprising the European Union. The intent was to impact language instruction so…

Descriptors: Scoring, Automation, Essays, Language Proficiency

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

Privacy | Copyright | Contact Us | Selection Policy | API

Linn, Robert L.	7
Plake, Barbara S.	6
Hambleton, Ronald K.	5
Feldt, Leonard S.	4
Haladyna, Thomas M.	4
Kolen, Michael J.	4
Pomplun, Mark	4
Wainer, Howard	4
Wise, Steven L.	4
Bridgeman, Brent	3
Cline, Frederick	3
Dorans, Neil J.	3
Downing, Steven M.	3
Ercikan, Kadriye	3
Frary, Robert B.	3
Gierl, Mark J.	3
Kane, Michael	3
Mehrens, William A.	3
Meijer, Rob R.	3
Puhan, Gautam	3
Bandalos, Deborah L.	2
Bejar, Isaac I.	2
Bolt, Daniel M.	2
Brandon, Paul R.	2
More ▼