Publication Date
| In 2024 | 48 |
| Since 2023 | 115 |
| Since 2020 (last 5 years) | 383 |
| Since 2015 (last 10 years) | 946 |
| Since 2005 (last 20 years) | 2090 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 52 |
| United Kingdom | 46 |
| Canada | 44 |
| Netherlands | 40 |
| California | 37 |
| China | 34 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Japan | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
van Daal, Tine; Lesterhuis, Marije; Coertjens, Liesje; Donche, Vincent; De Maeyer, Sven – Assessment in Education: Principles, Policy & Practice, 2019
Recently, comparative judgement has been introduced as an alternative method for scoring essays. Although this method is promising in terms of obtaining reliable scores, empirical evidence concerning its validity is lacking. The current study examines implications resulting from two critical assumptions underpinning the use of comparative…
Descriptors: Academic Discourse, Validity, Writing Evaluation, Value Judgment
Manzano, Dexter L. – International Journal of Language Testing, 2022
The increasing popularity of self-assessment prompted several scholars to investigate its effectiveness and accuracy in relation to teacher assessment. However, most of these studies focused only on the consistency estimate perspective. Thus, the current study investigated the interrater reliability between self- and teacher assessment of…
Descriptors: Oral Language, Self Evaluation (Individuals), College Students, Interrater Reliability
Stager, Sheila V.; Gupta, Simran; Amdur, Richard; Bielamowicz, Steven A. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: The purpose of this study was to use objective measures of glottal gap, bowing, and supraglottic compression from selected images of laryngoscopic examinations from adults over 60 years of age with voice complaints and signs of aging to test current hypotheses on whether degree of severity impacts treatment recommendations and potential…
Descriptors: Older Adults, Patients, Aging (Individuals), Voice Disorders
Saritas Akyol, Seyhan; Karakaya, Ismail – Eurasian Journal of Educational Research, 2021
Purpose: To assess students' problem-solving skills, this study aims to investigate the consistency between self- and peer-ratings in consideration of the teachers' ratings in the process. Method: This study was a descriptive study which examines the mathematical problem-solving skills with the MFRM model concerning self-, peer- and teachers'…
Descriptors: Problem Solving, Item Response Theory, Self Evaluation (Individuals), Peer Evaluation
Doosti, Mehdi; Ahmadi Safa, Mohammad – International Journal of Language Testing, 2021
This study examined the effect of rater training on promoting inter-rater reliability in oral language assessment. It also investigated whether rater training and the consideration of the examinees' expectations by the examiners have any effect on test-takers' perceptions of being fairly evaluated. To this end, four raters scored 31 Iranian…
Descriptors: Oral Language, Language Tests, Interrater Reliability, Training
Dillon, Emily; Holingue, Calliope; Herman, Dana; Landa, Rebecca J. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: Social communication or pragmatic skills are continuously distributed in the general population. Impairment in these skills is associated with two clinical disorders, autism spectrum disorder (ASD) and social (pragmatic) communication disorder. Such impairment can impact a child's peer acceptance, school performance, and current and later…
Descriptors: Psychometrics, Pragmatics, Rating Scales, Elementary School Students
Lamprianou, Iasonas; Tsagari, Dina; Kyriakou, Nansia – Language Testing, 2021
This longitudinal study (2002-2014) investigates the stability of rating characteristics of a large group of raters over time in the context of the writing paper of a national high-stakes examination. The study uses one measure of rater severity and two measures of rater consistency. The results suggest that the rating characteristics of…
Descriptors: Longitudinal Studies, Evaluators, High Stakes Tests, Writing Evaluation
Al-Salmani, Fatema; Thacker, Beth – Physical Review Physics Education Research, 2021
We designed a rubric to assess free-response exam problems in order to compare thinking skills evidenced in exams in classes taught by different pedagogies. The rubric was designed based on Bloom's taxonomy and then used to code exam problems. We have analyzed historical and recent exam problems in both algebra-based and calculus-based exams. In…
Descriptors: Inquiry, Thinking Skills, Scoring Rubrics, Algebra
Kaharu, Sarintan N.; Mansyur, Jusman – Pegem Journal of Education and Instruction, 2021
This study aims to develop a test that can be used to explore mental models and representation patterns of objects in liquid fluid. The test developed by adapting the Reeves's Development Model was carried out in several stages, namely: determining the orientation and test segments; initial survey; preparation of the initial draft; try out;…
Descriptors: Test Construction, Schemata (Cognition), Scientific Concepts, Water
Shin, Sangeun; Park, HyunJu; Hill, Katya – Journal of Speech, Language, and Hearing Research, 2021
Purpose: This study is aimed to identify the high-frequency vocabulary (HFV), otherwise termed "core vocabulary" for adults with complex communication needs. Method: Three major characteristics of the HFV--a relatively small number of different words (NDW), a relatively high word frequency, and a high word commonality across…
Descriptors: Word Frequency, Vocabulary Skills, Adults, Age Differences
Wang, Peiyu; Coetzee, Karen; Strachan, Andrea; Monteiro, Sandra; Cheng, Liying – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2020
Internationally educated nurses' (IENs) English language proficiency is critical to professional licensure as communication is a key competency for safe practice. The Canadian English Language Benchmark Assessment for Nurses (CELBAN) is Canada's only Canadian Language Benchmarks (CLB) referenced examination used in the context of healthcare…
Descriptors: Item Response Theory, Language Tests, English (Second Language), Nurses
Koriakin, Taylor A.; McKee, Sarah L.; Schwartz, Marlene B.; Chafouleas, Sandra M. – Journal of School Health, 2020
Background: Stakeholders increasingly recognize the role of policy in implementing Whole School, Whole Community, Whole Child (WSCC) frameworks in schools; however, few tools are currently available to assess alignment between district policies and WSCC concepts. The purpose of this study was to expand the Wellness School Assessment Tool (WellSAT)…
Descriptors: School Policy, Health Services, Health Promotion, Wellness
McQuade, Richard; Kometa, Simon; Brown, Jeremy; Bevitt, Debra; Hall, Judith – Assessment & Evaluation in Higher Education, 2020
Research project modules are a key part of UK undergraduate and postgraduate bioscience degree programmes. Report marking invariably uses two assessors, but marking models are mixed with some institutions using two independent markers and others using the project supervisor as one of the assessors. This latter model is controversial with critics…
Descriptors: Foreign Countries, Research Projects, Student Research, Supervisors
Wang, Lifeng; Khalaf, Ahmad Taha; Lei, Dongyu; Gale, Mengke; Li, Jing; Jiang, Ping; Du, Jing; Yinayeti, Xuehereti; Abudureheman, Mayinuer; Wei, Yuanyuan – Advances in Physiology Education, 2020
Traditional oral examination (TOE) is criticized for the shortage of objectivity, standardization, and reliability. These perceived limitations can be mitigated by the introduction of structured oral examination (SOE). There is little evidence of the implementation of SOE in physiology laboratory courses. The purpose of this study was to…
Descriptors: Verbal Tests, Evaluation Methods, Science Laboratories, Physiology
Rogers, Kimberly Cervello; Petrulis, Robert; Yee, Sean P.; Deshler, Jessica – International Journal of Research in Undergraduate Mathematics Education, 2020
This paper presents the development and validation of the 17-item mathematics Graduate Student Instructor Observation Protocol (GSIOP) at two universities. The development of this instrument attended to some unique needs of novice undergraduate mathematics instructors while building on an existing instrument that focused on classroom interactions…
Descriptors: Measures (Individuals), Observation, Test Construction, Test Validity

Peer reviewed
Direct link
