Publication Date
| In 2024 | 7 |
| Since 2023 | 23 |
| Since 2020 (last 5 years) | 69 |
| Since 2015 (last 10 years) | 187 |
| Since 2005 (last 20 years) | 443 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 28 |
| Practitioners | 2 |
| Policymakers | 1 |
| Students | 1 |
Location
| Turkey | 14 |
| Canada | 10 |
| United States | 10 |
| California | 9 |
| Netherlands | 9 |
| Australia | 6 |
| Germany | 6 |
| South Korea | 6 |
| Iowa | 5 |
| Norway | 5 |
| Turkey (Ankara) | 5 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kim, Youngdeok; Park, Ilhyeok; Kang, Minsoo – Adapted Physical Activity Quarterly, 2012
The purpose of this study was to investigate rater effects on the TGMD-2 when it applied to children with intellectual disability. A total of 22 children with intellectual disabilities participated in this study. Children's performances in each of 12 subtests of the TGMD-2 were recorded via video and scored by three adapted physical activity…
Descriptors: Children, Mental Retardation, Motor Development, Performance Tests
Attali, Yigal – Journal of Educational Measurement, 2010
Generalizability theory and analysis of variance methods are employed, together with the concept of objective time pressure, to estimate response time distributions and the degree of time pressure in timed tests. By estimating response time variance components due to person, item, and their interaction, and fixed effects due to item types and…
Descriptors: Generalizability Theory, Statistical Analysis, Reaction Time, Timed Tests
Yelboga, Atilla; Tavsancil, Ezel – Educational Sciences: Theory and Practice, 2010
In this research, the classical test theory and generalizability theory analyses were carried out with the data obtained by a job performance scale for the years 2005 and 2006. The reliability coefficients obtained (estimated) from the classical test theory and generalizability theory analyses were compared. In classical test theory, test retest…
Descriptors: Test Theory, Generalizability Theory, Job Performance, Measures (Individuals)
Kelcey, Ben; McGinn, Daniel; Hill, Heather – Society for Research on Educational Effectiveness, 2013
Recent policy has charged schools and districts with maintaining highly qualified teachers and differentiating among teachers in terms of their effectiveness (U.S. Department of Education, 2009). This emphasis has driven the development and implementation of teacher quality measures which are increasingly being used to evaluate teachers with…
Descriptors: Teacher Effectiveness, Measures (Individuals), Observation, Teacher Evaluation
Solano-Flores, Guillermo; Li, Min – Educational Research and Evaluation, 2013
We discuss generalizability (G) theory and the fair and valid assessment of linguistic minorities, especially emergent bilinguals. G theory allows examination of the relationship between score variation and language variation (e.g., variation of proficiency across languages, language modes, and social contexts). Studies examining score variation…
Descriptors: Measurement, Testing, Language Proficiency, Test Construction
Volpe, Robert J.; Briesch, Amy M.; Gadow, Kenneth D. – Journal of School Psychology, 2011
Although the efficiency with which a wide range of behavioral data can be obtained makes behavior rating scales particularly attractive tools for the purposes of screening and evaluation, feasibility concerns arise in the context of formative assessment. Specifically, informant load, or the amount of time informants are asked to contribute to the…
Descriptors: Generalizability Theory, Formative Evaluation, Behavior Rating Scales, Measures (Individuals)
Praetorius, Anna-Katharina; Lenske, Gerlinde; Helmke, Andreas – Learning and Instruction, 2012
Despite considerable interest in the topic of instructional quality in research as well as practice, little is known about the quality of its assessment. Using generalizability analysis as well as content analysis, the present study investigates how reliably and validly instructional quality is measured by observer ratings. Twelve trained raters…
Descriptors: Student Teachers, Interrater Reliability, Content Analysis, Observation
Huerta, Margarita; Lara-Alecio, Rafael; Tong, Fuhui; Irby, Beverly J. – International Journal of Science Education, 2014
We present the development and validation of a science notebook rubric intended to measure the academic language and conceptual understanding of non-mainstream students, specifically fifth-grade male and female economically disadvantaged Hispanic English language learner (ELL) and African-American or Hispanic native English-speaking students. The…
Descriptors: Scoring Rubrics, Science Instruction, Student Journals, Academic Discourse
Keller, Lisa A.; Clauser, Brian E.; Swanson, David B. – Advances in Health Sciences Education, 2010
In recent years, demand for performance assessments has continued to grow. However, performance assessments are notorious for lower reliability, and in particular, low reliability resulting from task specificity. Since reliability analyses typically treat the performance tasks as randomly sampled from an infinite universe of tasks, these estimates…
Descriptors: Generalizability Theory, Test Reliability, Performance Based Assessment, Error of Measurement
Alkahtani, Saif F. – ProQuest LLC, 2012
The principal aim of the present study was to better guide the Quranic recitation appraisal practice by presenting an application of Generalizability theory and Many-facet Rasch Measurement Model for assessing the dependability and fit of two suggested rubrics. Recitations of 93 students were rated holistically and analytically by 3 independent…
Descriptors: Generalizability Theory, Item Response Theory, Verbal Tests, Islam
Maier, Kimberly S.; Maiti, Tapabrata; Dass, Sarat C.; Lim, Chae Young – Society for Research on Educational Effectiveness, 2012
The purpose of this study is to develop an estimate of Adequate Yearly Progress (AYP) that will allow for reliable and valid comparisons among student subgroups, schools, and districts. A shrinkage-type estimator of AYP using the Bayesian framework is described. Using simulated data, the performance of the Bayes estimator will be compared to…
Descriptors: Educational Improvement, Federal Programs, Academic Achievement, Educational Indicators
Heilmann, John; DeBrock, Lindsay; Riley-Tillman, T. Chris – American Journal of Speech-Language Pathology, 2013
Purpose: The purpose of this study was to examine the reliability of, and sources of variability in, language measures from interviews collected from young school-age children. Method: Two 10-min interviews were collected from 20 at-risk kindergarten children by an examiner using a standardized set of questions. Test-retest reliability…
Descriptors: Measures (Individuals), Structured Interviews, Reliability, Kindergarten
Carman, Carol A. – Journal of Advanced Academics, 2013
The lack of a unified definition of giftedness leads researchers to use very different operationalizations when selecting a sample of gifted individuals for use in research. We found 104 empirical articles from 38 journals that differentiated between gifted and nongifted students which were analyzed to determine the most common methods of…
Descriptors: Gifted, Educational Research, Educational History, Bibliometrics
Thipwiwatpotjana, Phantipa – ProQuest LLC, 2010
Uncertainty occurs when there is more than one realization that can represent an information. This dissertation concerns merely discrete realizations of an uncertainty. Different interpretations of an uncertainty and their relationships are addressed when the uncertainty is not a probability of each realization. A well known model that can handle…
Descriptors: Intervals, Programming, Mathematical Applications, Probability
Orem, Chris D. – ProQuest LLC, 2012
Meta-assessment, or the assessment of assessment, can provide meaningful information about the trustworthiness of an academic program's assessment results (Bresciani, Gardner, & Hickmott, 2009; Palomba & Banta, 1999; Suskie, 2009). Many institutions conduct meta-assessments for their academic programs (Fulcher, Swain, & Orem, 2012),…
Descriptors: Validity, Evidence, Evaluation Methods, Meta Analysis

Peer reviewed
Direct link
