Publication Date
| In 2024 | 19 |
| Since 2023 | 40 |
| Since 2020 (last 5 years) | 133 |
| Since 2015 (last 10 years) | 325 |
| Since 2005 (last 20 years) | 695 |
Descriptor
| Cutting Scores | 1704 |
| Test Validity | 634 |
| Test Reliability | 571 |
| Evaluation Criteria | 483 |
| Aptitude Tests | 445 |
| Norms | 441 |
| Job Skills | 434 |
| Personnel Evaluation | 431 |
| Job Applicants | 429 |
| Career Guidance | 425 |
| Standard Setting (Scoring) | 228 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Elementary Education | 149 |
| Higher Education | 132 |
| Postsecondary Education | 105 |
| Elementary Secondary Education | 103 |
| Secondary Education | 101 |
| Middle Schools | 88 |
| Grade 3 | 86 |
| Grade 4 | 81 |
| Grade 8 | 81 |
| Grade 5 | 79 |
| Grade 6 | 68 |
| More ▼ | |
Audience
| Researchers | 58 |
| Practitioners | 14 |
| Policymakers | 11 |
| Teachers | 11 |
| Administrators | 5 |
| Students | 4 |
| Parents | 1 |
Location
| California | 29 |
| Florida | 28 |
| Texas | 20 |
| Canada | 16 |
| New York | 15 |
| Massachusetts | 14 |
| North Carolina | 14 |
| United Kingdom | 14 |
| Washington | 13 |
| Pennsylvania | 12 |
| New Jersey | 11 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 3 |
Schmidgall, Jonathan – Educational Testing Service, 2021
The redesigned "TOEIC Bridge"® tests are designed to measure the reading, listening, speaking, and writing proficiency of beginning to low-intermediate English learners in the context of everyday adult life. This report describes the comprehensive and multifaceted process used to enhance the meaningfulness of TOEIC Bridge test score…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Language Proficiency
Elizabeth H. Park – ProQuest LLC, 2021
Scholars have long explored the lack of diversity in gifted-and-talented education and specifically the role that gifted-and-talented test performance plays as a barrier to access. However, there is limited work, particularly quantitative work, examining the ways in which policies perpetuate racial/ethnic and socioeconomic inequalities within the…
Descriptors: Student Diversity, Academically Gifted, Gifted Education, Racism
Kampa, Nele; Wagner, Helene; Köller, Olaf – Large-scale Assessments in Education, 2019
Background: Stakeholders' interpretations of the findings of large-scale educational assessments can influence important decisions. In the context of educational assessment, standard-setting remains an especially critical element, because it is complex and largely unstandardized. Instruments established by means of standard-setting procedures such…
Descriptors: Standard Setting (Scoring), Test Interpretation, Stakeholders, Validity
Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020
An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…
Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting
Wu, Chin-Chin; Chu, Ching-Lin; Stewart, Lydia; Chiang, Chung-Hsin; Hou, Yuh-Ming; Liu, Jiun-Horng – Journal of Autism and Developmental Disorders, 2020
The present longitudinal study examined the utility of the screening tool for autism in 2-year-olds (STAT) in detecting autism spectrum disorder (ASD) in toddlers who are less than 24 months of age. The study sample, which consisted of 119 toddlers with developmental problems, were assessed when they were between 16 and 24 months of age (Time 1)…
Descriptors: Foreign Countries, Longitudinal Studies, Toddlers, Cutting Scores
Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020
In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…
Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
DeHondt, Benjamin G.; Madi, Samar A.; Drignei, Dorin; Buchan, Duncan S.; Brown, Elise C. – Measurement in Physical Education and Exercise Science, 2023
Identification of cardiometabolic risk (CMR) in U.S. younger population by assessing muscular strength via handgrip (HG) dynamometry may aid in prevention efforts. Currently, no nationally representative HG cut-points are available for identifying increased CMR in U.S. adolescents or young adults. In this study, we propose normalized grip strength…
Descriptors: Muscular Strength, Adolescents, Young Adults, Screening Tests
Wyse, Adam E. – Applied Measurement in Education, 2018
An important consideration in standard setting is recruiting a group of panelists with different experiences and backgrounds to serve on the standard-setting panel. This study uses data from 14 different Angoff standard settings from a variety of medical imaging credentialing programs to examine whether people with different professional roles and…
Descriptors: Standard Setting (Scoring), Test Construction, Cutting Scores, Accuracy
Courteau, Émilie; Loignon, Guillaume; Steinhauer, Karsten; Royle, Phaedra – Journal of Speech, Language, and Hearing Research, 2023
Purpose: This research aimed to identify reliable tasks discriminating French-speaking adolescents with developmental language disorder (DLD) from their peers with typical language (TL) and to assess which linguistic domains represent areas of particular weakness in DLD. Unlike English, morphosyntax has not been identified as a special area of…
Descriptors: French, Language Impairments, Developmental Delays, Morphology (Languages)
Eunice Eunhee Jang; Christie Barron; Hyunah Kim; Bruce Russell – Language Teaching Research Quarterly, 2023
Research on the use of standardized test scores in higher education reveals significant variations in attitudes and perceptions of language proficiency tests among test score users. Most test score users have limited knowledge about test score interpretations in terms of what English as additional language (EAL) students typically know and can do…
Descriptors: Scores, Standardized Tests, English (Second Language), Second Language Learning
Bramley, Tom – Research Matters, 2020
The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…
Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy
Peabody, Michael R.; Wind, Stefanie A. – Journal of Educational Measurement, 2019
Setting performance standards is a judgmental process involving human opinions and values as well as technical and empirical considerations. Although all cut score decisions are by nature somewhat arbitrary, they should not be capricious. Judges selected for standard-setting panels should have the proper qualifications to make the judgments asked…
Descriptors: Standard Setting, Decision Making, Performance Based Assessment, Evaluators
Insufficient Effort Responding in Surveys Assessing Self-Regulated Learning: Nuisance or Fatal Flaw?
Iaconelli, Ryan; Wolters, Christopher A. – Frontline Learning Research, 2020
Despite concerns about their validity, self-report surveys remain the primary data collection method in the research of self-regulated learning (SRL). To address some of these concerns, we took a data set comprised of college students' self-reported beliefs and behaviours related to SRL, assessed across three surveys, and examined it for instances…
Descriptors: Metacognition, College Students, Student Attitudes, Validity
Lee, Helen Y.; Vigen, Cheryl; Zwaigenbaum, Lonnie; Bryson, Susan; Smith, Isabel; Brian, Jessica; Watson, Linda R.; Crais, Elizabeth R.; Turner-Brown, Lauren; Reznick, J. Steven; Baranek, Grace T. – Journal of Autism and Developmental Disorders, 2019
This study examined the performance of the First Year Inventory (FYI; version 2.0), a community-normed parent-reported screening instrument, in a high-risk (HR) sample of 12-month-olds with older siblings diagnosed with autism spectrum disorder (ASD). The FYI 2.0 was completed by parents of 86 HR infants and 35 low-risk control infants at age…
Descriptors: Toddlers, Screening Tests, At Risk Persons, Autism

Direct link
Peer reviewed
