Publication Date
| In 2015 | 0 |
| Since 2014 | 3 |
| Since 2011 (last 5 years) | 11 |
| Since 2006 (last 10 years) | 25 |
| Since 1996 (last 20 years) | 37 |
Descriptor
| Test Items | 37 |
| Mathematics Tests | 12 |
| Difficulty Level | 11 |
| Science Tests | 10 |
| Grade 4 | 9 |
| Scores | 9 |
| Test Construction | 9 |
| English (Second Language) | 7 |
| Multiple Choice Tests | 7 |
| Item Response Theory | 6 |
| More ▼ | |
Source
| Educational Assessment | 37 |
Author
| Lee, Hee-Sun | 3 |
| Linn, Marcia C. | 3 |
| Liu, Ou Lydia | 3 |
| Solano-Flores, Guillermo | 3 |
| Huff, Kristen L. | 2 |
| Plake, Barbara S. | 2 |
| Sireci, Stephen G. | 2 |
| Taylor, Catherine S. | 2 |
| Abedi, Jamal | 1 |
| Alonzo, Alicia C. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 37 |
| Reports - Research | 19 |
| Reports - Evaluative | 14 |
| Reports - Descriptive | 4 |
| Information Analyses | 1 |
| Speeches/Meeting Papers | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Grade 4 | 12 |
| Elementary Education | 10 |
| Elementary Secondary Education | 7 |
| Middle Schools | 7 |
| Grade 5 | 6 |
| Grade 7 | 6 |
| Grade 8 | 6 |
| Intermediate Grades | 6 |
| Junior High Schools | 5 |
| Secondary Education | 5 |
| More ▼ | |
Audience
Showing 1 to 15 of 37 results
Pae, Hye K. – Educational Assessment, 2014
This study investigated the role of item formats in the performance of 206 nonnative speakers of English on expressive skills (i.e., speaking and writing). Test scores were drawn from the field test of the "Pearson Test of English Academic" for Chinese, French, Hebrew, and Korean native speakers. Four item formats, including…
Descriptors: Test Items, Test Format, Speech Skills, Writing Skills
Lakin, Joni M. – Educational Assessment, 2014
The purpose of test directions is to familiarize examinees with a test so that they respond to items in the manner intended. However, changes in educational measurement as well as the U.S. student population present new challenges to test directions and increase the impact that differential familiarity could have on the validity of test score…
Descriptors: Test Content, Test Construction, Best Practices, Familiarity
Solano-Flores, Guillermo; Wang, Chao; Kachchaf, Rachel; Soltero-Gonzalez, Lucinda; Nguyen-Le, Khanh – Educational Assessment, 2014
We address valid testing for English language learners (ELLs)--students in the United States who are schooled in English while they are still acquiring English as a second language. Also, we address the need for procedures for systematically developing ELL testing accommodations--changes in tests intended to support ELLs to gain access to the…
Descriptors: English Language Learners, Testing Accommodations, Illustrations, Educational Testing
Solano-Flores, Guillermo; Barnett-Clarke, Carne; Kachchaf, Rachel R. – Educational Assessment, 2013
We examined the performance of English language learners (ELLs) and non-ELLs on Grade 4 and Grade 5 mathematics content knowledge (CK) and academic language (AL) tests. CK and AL items had different semiotic loads (numbers of different types of semiotic features) and different semiotic structures (relative frequencies of different semiotic…
Descriptors: English Language Learners, Performance, Mathematics Tests, Semiotics
Cawthon, Stephanie; Leppo, Rachel; Carr, Therese; Kopriva, Rebecca – Educational Assessment, 2013
When do item adaptations veer from their intent and, instead of increasing access, modify the construct being measured? This study analyzed early elementary student achievement data from a statewide field test containing both standard and adapted science items. Four student groups were included in this analysis: English language learners, students…
Descriptors: Testing Accommodations, Test Items, Adaptive Testing, Science Tests
Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Jung, EunJu; Liu, Kimy; Geller, Josh – Educational Assessment, 2013
In this article, we highlight the need for a precisely defined construct in score-based validation and discuss the contribution of cognitive theories to accurately and comprehensively defining the construct. We propose a framework for integrating cognitively based theoretical and empirical evidence to specify and evaluate the construct. We apply…
Descriptors: Test Validity, Construct Validity, Scores, Evidence
Schneider, M. Christina; Huff, Kristen L.; Egan, Karla L.; Gaines, Margie L.; Ferrara, Steve – Educational Assessment, 2013
A primary goal of standards-based statewide achievement tests is to classify students into achievement levels that enable valid inferences about student content area knowledge and skill. Explicating how knowledge and skills are expected to differ in complexity in achievement level descriptors, and how that complexity is related to empirical item…
Descriptors: Test Items, Difficulty Level, Achievement Tests, Test Interpretation
Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012
Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…
Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4
Taylor, Catherine S.; Lee, Yoonsun – Educational Assessment, 2011
This article presents a study of ethnic Differential Item Functioning (DIF) for 4th-, 7th-, and 10th-grade reading items on a state criterion-referenced achievement test. The tests, administered 1997 to 2001, were composed of multiple-choice and constructed-response items. Item performance by focal groups (i.e., students from Asian/Pacific Island,…
Descriptors: Test Bias, Test Items, Pacific Islanders, American Indians
Wyse, Adam E.; Viger, Steven G. – Educational Assessment, 2011
An important part of test development is ensuring alignment between test forms and content standards. One common way of measuring alignment is the Webb (1997, 2007) alignment procedure. This article investigates (a) how well item writers understand components of the definition of Depth of Knowledge (DOK) from the Webb alignment procedure and (b)…
Descriptors: Test Items, Difficulty Level, Test Construction, Alignment (Education)
Liu, Ou Lydia; Lee, Hee-Sun; Linn, Marcia C. – Educational Assessment, 2011
Both multiple-choice and constructed-response items have known advantages and disadvantages in measuring scientific inquiry. In this article we explore the function of explanation multiple-choice (EMC) items and examine how EMC items differ from traditional multiple-choice and constructed-response items in measuring scientific reasoning. A group…
Descriptors: Science Tests, Multiple Choice Tests, Responses, Test Items
Kim, Do-Hong; Huynh, Huynh – Educational Assessment, 2010
This study investigated whether scores obtained from the online and paper-and-pencil administrations of the statewide end-of-course English test were equivalent for students with and without disabilities. Score comparability was evaluated by examining equivalence of factor structure (measurement invariance) and differential item and bundle…
Descriptors: Computer Assisted Testing, Language Tests, English, Scores
Liu, Ou Lydia; Lee, Hee-Sun; Linn, Marcia C. – Educational Assessment, 2010
To improve student science achievement in the United States we need inquiry-based instruction that promotes coherent understanding and assessments that are aligned with the instruction. Instead, current textbooks often offer fragmented ideas and most assessments only tap recall of details. In this study we implemented 10 inquiry-based science…
Descriptors: Inquiry, Active Learning, Science Achievement, Science Instruction
Wolf, Mikyung Kim; Leon, Seth – Educational Assessment, 2009
The purpose of the present study is to examine the language characteristics of a few states' large-scale assessments of mathematics and science and investigate whether the language demands of the items are associated with the degree of differential item functioning (DIF) for English language learner (ELL) students. A total of 542 items from 11…
Descriptors: Mathematics Tests, Science Tests, Measurement, Test Bias
Abedi, Jamal – Educational Assessment, 2009
This study compared performance of both English language learners (ELLs) and non-ELL students in Grades 4 and 8 under accommodated and nonaccommodated testing conditions. The accommodations used in this study included a computerized administration of a math test with a pop-up glossary, a customized English dictionary, extra testing time, and…
Descriptors: Computer Assisted Testing, Testing Accommodations, Mathematics Tests, Grade 4

Peer reviewed
Direct link
