Publication Date
| In 2015 | 0 |
| Since 2014 | 1 |
| Since 2011 (last 5 years) | 7 |
| Since 2006 (last 10 years) | 16 |
| Since 1996 (last 20 years) | 23 |
Descriptor
| Test Validity | 23 |
| Student Evaluation | 7 |
| Scores | 6 |
| Test Reliability | 6 |
| Mathematics Tests | 5 |
| Academic Achievement | 4 |
| Foreign Countries | 4 |
| Models | 4 |
| Performance Based Assessment | 4 |
| Test Bias | 4 |
| More ▼ | |
Author
| Baker, Eva L. | 4 |
| Wise, Steven L. | 2 |
| Young, John W. | 2 |
| Abedi, Jamal | 1 |
| Bancroft, Kim | 1 |
| Bhola, Dennison S. | 1 |
| Boone, Williame | 1 |
| Burstein, Leigh | 1 |
| Buschang, Rebecca E. | 1 |
| Chen, Chun-ya Becky | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 23 |
| Reports - Research | 9 |
| Reports - Evaluative | 8 |
| Reports - Descriptive | 5 |
| Information Analyses | 1 |
Education Level
| Elementary Secondary Education | 8 |
| Elementary Education | 5 |
| Higher Education | 4 |
| Postsecondary Education | 4 |
| Grade 4 | 3 |
| Grade 5 | 2 |
| Grade 8 | 2 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| Intermediate Grades | 1 |
| More ▼ | |
Audience
Showing 1 to 15 of 23 results
Daniels, Lia M.; Poth, Cheryl; Papile, Chiara; Hutchison, Marnie – Educational Assessment, 2014
The purpose of this study was to test the validity of the Teachers' Conceptions of Assessment Scale III-Abridged Version (CoA-IIIA; Brown, 2006), a measure created, validated, and applied outside of North America, in a sample of Canadian preservice teachers (n = 436). This work is important because although we have long known that…
Descriptors: Foreign Countries, Preservice Teachers, Attitude Measures, Test Validity
Dodeen, Hamzeh – Educational Assessment, 2013
Students' opinions continue to be a significant factor in the evaluation of teaching in higher education institutions. The purpose of this study was to psychometrically assess short students evaluation of teaching (SET) forms using the UAE University form as a model. The study evaluated the form validity, reliability, the overall question,…
Descriptors: Foreign Countries, Student Evaluation of Teacher Performance, Test Validity, Test Reliability
Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Jung, EunJu; Liu, Kimy; Geller, Josh – Educational Assessment, 2013
In this article, we highlight the need for a precisely defined construct in score-based validation and discuss the contribution of cognitive theories to accurately and comprehensively defining the construct. We propose a framework for integrating cognitively based theoretical and empirical evidence to specify and evaluate the construct. We apply…
Descriptors: Test Validity, Construct Validity, Scores, Evidence
Buschang, Rebecca E.; Chung, Gregory K. W. K.; Delacruz, Girlie C.; Baker, Eva L. – Educational Assessment, 2012
The purpose of this study was to validate inferences about scores of one task designed to measure subject matter knowledge and three tasks designed to measure aspects of pedagogical content knowledge. Evidence for the validity of inferences was based on two expectations. First, if tasks were sensitive to expertise, we would find group differences.…
Descriptors: Algebra, Mathematics Teachers, Teacher Characteristics, Knowledge Base for Teaching
Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012
Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…
Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4
Cheng, Liying; DeLuca, Christopher – Educational Assessment, 2011
Test-takers' interpretations of validity as related to test constructs and test use have been widely debated in large-scale language assessment. This study contributes further evidence to this debate by examining 59 test-takers' perspectives in writing large-scale English language tests. Participants wrote about their test-taking experiences in…
Descriptors: Language Tests, Test Validity, Test Use, English
Karelitz, Tzur M.; Parrish, Deborah Montgomery; Yamada, Hiroyuki; Wilson, Mark – Educational Assessment, 2010
Assessment systems that track children's progress across time need to be sensitive to the variegated nature of development. Although instruments are commonly designed to assess behaviors within a specific age range, some children advance slower or faster than others and, as a result, often show behaviors from a younger or older age group. This…
Descriptors: Age Groups, Inferences, Test Validity, Test Reliability
Young, John W.; Steinberg, Jonathan; Cline, Fred; Stone, Elizabeth; Martiniello, Maria; Ling, Guangming; Cho, Yeonsuk – Educational Assessment, 2010
To date, assessment validity research on non-native English speaking students in the United States has focused exclusively on those who are presently English language learners (ELLs). However, little, if any, research has been conducted on two other sizable groups of language minority students: (a) bilingual or multilingual students who were…
Descriptors: Test Validity, English (Second Language), Multilingualism, Bilingualism
Young, John W. – Educational Assessment, 2009
In this article, I specify a conceptual framework for test validity research on content assessments taken by English language learners (ELLs) in U.S. schools in grades K-12. This framework is modeled after one previously delineated by Willingham et al. (1988), which was developed to guide research on students with disabilities. In this framework…
Descriptors: Test Validity, Evaluation Research, Achievement Tests, Elementary Secondary Education
Abedi, Jamal – Educational Assessment, 2009
This study compared performance of both English language learners (ELLs) and non-ELL students in Grades 4 and 8 under accommodated and nonaccommodated testing conditions. The accommodations used in this study included a computerized administration of a math test with a pop-up glossary, a customized English dictionary, extra testing time, and…
Descriptors: Computer Assisted Testing, Testing Accommodations, Mathematics Tests, Grade 4
Juttner, Melanie; Boone, Williame; Park, Soonhye; Neuhaus, Birgit J. – Educational Assessment, Evaluation and Accountability, 2013
Research on teachers' professionalism and professional development has increased in the last two decades. A main focus of this line of research has been the cognitive component of teacher professionalism, i.e., professional knowledge. Most of the previous studies on teacher knowledge--such as the Learning Mathematics for Teaching (LMT) (Hill et…
Descriptors: Science Teachers, Biology, Teacher Characteristics, Knowledge Base for Teaching
Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007
This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…
Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research
Goldschmidt, Pete; Martinez, Jose Felipe; Niemi, David; Baker, Eva L. – Educational Assessment, 2007
In this article we examine empirical evidence on the criterion, predictive, transfer, and fairness aspects of validity of a large-scale language arts performance assessment, referred to as the Performance Assignment (PA). We use multilevel models to avoid biased inferences that might result from the naturally nested data. Specifically, we examine…
Descriptors: Language Arts, Performance Based Assessment, Academic Achievement, Performance Tests
Wise, Vicki L.; Wise, Steven L.; Bhola, Dennison S. – Educational Assessment, 2006
Accountability for educational quality is a priority at all levels of education. Low-stakes testing is one way to measure the quality of education that students receive and make inferences about what students know and can do. Aggregate test scores from low-stakes testing programs are suspect, however, to the degree that these scores are influenced…
Descriptors: Motivation, Scores, Test Validity, Accountability
Bancroft, Kim – Educational Assessment, Evaluation and Accountability, 2010
As mandated by No Child Left Behind, schools must find ways to improve test scores. How do benchmark tests fare as a means of informing teachers in order to raise achievement for low-income students? This study of English language arts instruction at a low-income high school investigates the administration's use of standardized benchmark…
Descriptors: Academic Achievement, Benchmarking, Program Implementation, Language Arts
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
