Publication Date
| In 2024 | 32 |
| Since 2023 | 53 |
| Since 2020 (last 5 years) | 107 |
| Since 2015 (last 10 years) | 281 |
| Since 2005 (last 20 years) | 610 |
Descriptor
Source
Author
| Keselman, H. J. | 13 |
| Yuan, Ke-Hai | 10 |
| Algina, James | 9 |
| Zhang, Zhiyong | 8 |
| Bentler, Peter M. | 7 |
| Lix, Lisa M. | 7 |
| Tipton, Elizabeth | 6 |
| Wilcox, Rand R. | 6 |
| Gorard, Stephen | 5 |
| Blankmeyer, Eric | 4 |
| Goldhaber, Dan | 4 |
| More ▼ | |
Publication Type
Education Level
Location
| Australia | 17 |
| United Kingdom | 16 |
| United Kingdom (England) | 16 |
| Germany | 13 |
| United States | 11 |
| California | 10 |
| Canada | 10 |
| Netherlands | 10 |
| Italy | 9 |
| China | 6 |
| Florida | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 13 |
| Individuals with Disabilities… | 2 |
| Race to the Top | 2 |
| American Recovery and… | 1 |
| Social Security Act | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 2 |
Sass, Tim R. – National Center for Analysis of Longitudinal Data in Education Research, 2008
There is little doubt that teacher quality is a key determinant of student achievement, but finding ways to identify and reward the best teachers has proven illusive. This research brief considers the stability of value-added measures of teacher effectiveness over time and the resulting implications for the design and implementation of…
Descriptors: Teacher Effectiveness, Academic Achievement, Compensation (Remuneration), Personnel Policy
Lavy, Victor – Centre for the Economics of Education (NJ1), 2010
There are large differences across countries in instructional time in public schooling institutions. For example, among European countries such as Belgium, France and Greece, pupils aged 15 have an average of over a thousand hours per year of total compulsory classroom instruction while in England, Luxembourg and Sweden the average is only 750…
Descriptors: Time on Task, Time Factors (Learning), Academic Achievement, Achievement Gap
Sahin, Sami – Online Submission, 2008
The purpose of this study is to practice peer evaluation and to determine if the result of the evaluation shows similarity with the lecturer evaluation, thus to make assumption about the validity of peer evaluation in higher education. For this purpose, students of "Specific Teaching Methods I" class, which is included in the 3. Class of…
Descriptors: Higher Education, Peer Evaluation, Educational Technology, Teaching Methods
Gorard, Stephen – Adults Learning, 2008
When governments and pressure groups attend only to research that suits their political purposes, it makes a mockery of the idea of evidence-informed policy and practice. In this article, the author cites examples of research that picked up weak and misleading evidence for political purposes. The author argues that researchers are tied in to the…
Descriptors: Politics of Education, Educational Policy, Evidence, Theory Practice Relationship
Almond, Douglas; Mazumder, Bhashkar; van Ewijk, Reyn – Centre for the Economics of Education (NJ1), 2012
We consider the effects of daytime fasting by pregnant women during the lunar month of Ramadan on their children's test scores at age seven. Using English register data, we find that scores are 0.05 to 0.08 standard deviations lower for Pakistani and Bangladeshi students exposed to Ramadan in early pregnancy. These estimates are downward biased to…
Descriptors: Foreign Countries, Pregnancy, Eating Habits, Islam
Darling-Hammond, Linda – Stanford Center for Opportunity Policy in Education, 2009
Recent findings from a Mathematica study comparing the performance of teachers prepared via alternative and traditional routes have been interpreted to suggest that policymakers and practitioners should expand the use of fast-entry alternative routes and seek teachers trained through such programs, as they presumably perform as well in the…
Descriptors: Evidence, Traditional Schools, Educational Opportunities, Alternative Teacher Certification
Vassar, Matt; Hale, William – Journal of Interpersonal Violence, 2009
Empirical research on anger and hostility has pervaded the academic literature for more than 50 years. Accurate measurement of anger/hostility and subsequent interpretation of results requires that the instruments yield strong psychometric properties. For consistent measurement, reliability estimates must be calculated with each administration,…
Descriptors: Research Methodology, Psychometrics, Psychological Patterns, Affective Behavior
Neal, Derek; Schanzenbach, Diane Whitmore – Urban Institute (NJ1), 2009
Many test-based accountability systems, including the No Child Left Behind Act of 2001 (NCLB), place great weight on the numbers of students who score at or above specified proficiency levels in various subjects. Accountability systems based on these metrics often provide incentives for teachers and principals to target children near current…
Descriptors: Federal Legislation, Metric System, Standardized Tests, Grade 6
Royal, Kenneth D. – Online Submission, 2009
Quality measurement is essential in every form of research, including institutional research and assessment. Unfortunately, most survey research today (both published and unpublished) is lacking with regards to quality measurement. Reporting means and standard deviations based on ordinal measures is an inappropriate, yet widespread practice in the…
Descriptors: Higher Education, Institutional Research, Measurement Techniques, Item Response Theory
Sak, Ugur – Roeper Review, 2009
In this study, psychometric properties of the test of the three-mathematical minds (M3) were investigated. The M3 test was developed based on a multidimensional conception of giftedness to identify mathematically talented students. Participants included 291 middle-school students. Data analysis indicated that the M3 had a 0.73 coefficient as a…
Descriptors: Academically Gifted, Factor Analysis, Psychometrics, Ability Identification
A Generally Robust Approach for Testing Hypotheses and Setting Confidence Intervals for Effect Sizes
Keselman, H. J.; Algina, James; Lix, Lisa M.; Wilcox, Rand R.; Deering, Kathleen N. – Psychological Methods, 2008
Standard least squares analysis of variance methods suffer from poor power under arbitrarily small departures from normality and fail to control the probability of a Type I error when standard assumptions are violated. This article describes a framework for robust estimation and testing that uses trimmed means with an approximate degrees of…
Descriptors: Intervals, Testing, Least Squares Statistics, Effect Size
Meath, Sian E.; Aye, Lu; Haritos, Nicholas – Bulletin of Science, Technology & Society, 2008
This article focuses on the accuracy of satellite data, which may then be used in wave power applications. The satellite data are compared to data from wave buoys, which are currently considered to be the most accurate of the devices available for measuring wave characteristics. This article presents an analysis of satellite- (Topex/Poseidon) and…
Descriptors: Spectroscopy, Structural Analysis (Science), Satellites (Aerospace), Program Validation
Rice, Jennifer King – National Education Policy Center, 2012
Schools and school systems throughout the nation are increasingly experimenting with using various instructional technologies to improve productivity and decrease costs, but evidence on both the effectiveness and the costs of education technology is limited. A recent report published by the Thomas B. Fordham Institute sets out to describe "the…
Descriptors: Evidence, Electronic Learning, Distance Education, Online Courses
Yuan, Ke-Hai; Bentler, Peter M. – Psychometrika, 2006
An extension of multiple correspondence analysis is proposed that takes into account cluster-level heterogeneity in respondents' preferences/choices. The method involves combining multiple correspondence analysis and k-means in a unified framework. The former is used for uncovering a low-dimensional space of multivariate categorical variables…
Descriptors: Robustness (Statistics), Statistics, Item Response Theory
Setzer, J. Carl; He, Yi – GED Testing Service, 2009
Reliability Analysis for the Internationally Administered 2002 Series GED (General Educational Development) Tests Reliability refers to the consistency, or stability, of test scores when the authors administer the measurement procedure repeatedly to groups of examinees (American Educational Research Association [AERA], American Psychological…
Descriptors: Educational Research, Error of Measurement, Scores, Test Reliability

Direct link
Peer reviewed
