Publication Date
| In 2015 | 0 |
| Since 2014 | 0 |
| Since 2011 (last 5 years) | 15 |
| Since 2006 (last 10 years) | 61 |
| Since 1996 (last 20 years) | 97 |
Descriptor
Author
| Abrami, Philip C. | 2 |
| Chen, Weiyun | 2 |
| Cliffordson, Christina | 2 |
| Johnson, Martin | 2 |
| Kuyper, Hans | 2 |
| Luyten, Hans | 2 |
| Marcoulides, George A. | 2 |
| Watt, Helen M. G. | 2 |
| Aarnoutse, Cor | 1 |
| Archibald, Kelsi | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 97 |
| Reports - Evaluative | 97 |
| Numerical/Quantitative Data | 3 |
| Information Analyses | 2 |
Education Level
| Secondary Education | 20 |
| Elementary Secondary Education | 10 |
| Higher Education | 7 |
| Elementary Education | 6 |
| Grade 8 | 6 |
| Grade 9 | 6 |
| Grade 10 | 5 |
| Grade 11 | 5 |
| High Schools | 4 |
| Postsecondary Education | 4 |
| More ▼ | |
Audience
Showing 1 to 15 of 97 results
Sireci, Stephen G.; Rios, Joseph A. – Educational Research and Evaluation, 2013
There are numerous statistical procedures for detecting items that function differently across subgroups of examinees that take a test or survey. However, in endeavouring to detect items that may function differentially, selection of the statistical method is only one of many important decisions. In this article, we discuss the important decisions…
Descriptors: Effect Size, Test Bias, Item Analysis, Statistical Analysis
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Cainey, Jill; Bowker, Robert; Humphrey, Lauren; Murray, Nicola – Educational Research and Evaluation, 2012
This article discusses learning that occurred during primary school visits to the UK National Marine Aquarium (Aquarium). Before visiting, children were asked to independently create a drawing of marine life off the Devon coast and a drawing of tropical reef marine life, allowing an assessment of prior knowledge. Post-visit, children created…
Descriptors: Informal Education, Prior Learning, Recreational Facilities, Adult Learning
Wiberg, Marie – Educational Research and Evaluation, 2012
The aim of this study was to evaluate possible consequences of using unidimensional item response theory (UIRT) on a multidimensional college admission test. The test consists of 5 subscales and can be divided into two sections, that is, it can be considered both as a unidimensional and a multidimensional test. The test was examined with both UIRT…
Descriptors: College Entrance Examinations, Item Response Theory, Factor Analysis, Goodness of Fit
Martin, Stewart – Educational Research and Evaluation, 2012
This article reports a quasi-experimental study on the effects of multimedia teaching and learning in English Literature--a subject which places high cognitive load on students. A large-scale study was conducted in 4 high-achieving secondary schools to examine the differences made to students' learning and performance by the use of multimedia and…
Descriptors: English Literature, Multimedia Materials, Statistical Significance, English Instruction
Lam, Chi-Ming – Educational Research and Evaluation, 2012
This article reports the results of the first systematic, though only exploratory, study that assesses the effectiveness of the Philosophy for Children (commonly known as P4C) programme in promoting children's critical thinking in Hong Kong. Forty-two Secondary 1 students volunteered for this study, from whom 28 students were randomly selected and…
Descriptors: Foreign Countries, Critical Thinking, Skill Development, Philosophy
Johnson, Martin; Hopkin, Rebecca; Shiell, Hannah; Bell, John F. – Educational Research and Evaluation, 2012
In the UK and elsewhere, large-scale educational assessment agencies are shifting the mode of school examination marking towards having examiners mark examination scripts on screen rather than on paper. This shift has prompted questions about whether the mode of marking might influence examiner marking accuracy, particularly in relation to…
Descriptors: Correlation, Essays, Student Evaluation, Validity
Thorsen, Cecilia; Cliffordson, Christina – Educational Research and Evaluation, 2012
Research has found that grades are the most valid instruments for predicting educational success. Why grades have better predictive validity than, for example, standardized tests is not yet fully understood. One possible explanation is that grades reflect not only subject-specific knowledge and skills but also individual differences in other…
Descriptors: Grades (Scholastic), Predictive Validity, Grading, Criteria
Willis, Paul; Bland, Robert; Manka, Louise; Craft, Cec – Educational Research and Evaluation, 2012
Cross-age peer mentoring is an educational model that builds on peer support and mentoring to assist young people to enhance social relationships, develop cognitive skills, and promote positive identity development. In this article, we outline the evaluation process of a cross-age peer-mentoring program implemented in an Australian secondary…
Descriptors: Mentors, Student Attitudes, Focus Groups, School Support
Johnson, Martin; Nadas, Rita – Educational Research and Evaluation, 2012
Comparability between different educational qualifications is an important issue within policy discourse in the UK. In this context, the comparability of qualification demands has been explored through the use of expert human judgement. The involvement of human judgement in estimating assessment demands has consequences for methodology. This…
Descriptors: Educational Assessment, Evaluation Methods, Comparative Analysis, Research Methodology
Lohmeier, Jill Hendrickson; Lee, Steven W. – Educational Research and Evaluation, 2011
Evaluators are frequently asked to assess the effectiveness of school programs implemented to improve academic achievement. School connectedness has been shown to be directly related to academic achievement (McNeely, Nonnemaker, & Blum, 2002) and is therefore of interest to evaluators. The construct of school connectedness has been shown to…
Descriptors: Urban Schools, Suburban Schools, Evaluators, Academic Achievement
Chen, Weiyun; Hendricks, Kristin; Archibald, Kelsi – Educational Research and Evaluation, 2011
The purpose of this study was to design and validate the Assessing Quality Teaching Rubrics (AQTR) that assesses the pre-service teachers' quality teaching practices in a live lesson or a videotaped lesson. Twenty-one lessons taught by 13 Physical Education Teacher Education (PETE) students were videotaped. The videotaped lessons were evaluated…
Descriptors: Preservice Teachers, Teaching Skills, Physical Education, Construct Validity
Advantages of the Rasch Measurement Model in Analysing Educational Tests: An Applicator's Reflection
Tormakangas, Kari – Educational Research and Evaluation, 2011
Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…
Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory
Eggen, Theo J. H. M. – Educational Research and Evaluation, 2011
If classification in a limited number of categories is the purpose of testing, computerized adaptive tests (CATs) with algorithms based on sequential statistical testing perform better than estimation-based CATs (e.g., Eggen & Straetmans, 2000). In these computerized classification tests (CCTs), the Sequential Probability Ratio Test (SPRT) (Wald,…
Descriptors: Test Length, Adaptive Testing, Classification, Item Analysis
Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011
Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…
Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing

Peer reviewed
Direct link
