Publication Date
| In 2024 | 19 |
| Since 2023 | 40 |
| Since 2020 (last 5 years) | 133 |
| Since 2015 (last 10 years) | 325 |
| Since 2005 (last 20 years) | 695 |
Descriptor
| Cutting Scores | 1704 |
| Test Validity | 634 |
| Test Reliability | 571 |
| Evaluation Criteria | 483 |
| Aptitude Tests | 445 |
| Norms | 441 |
| Job Skills | 434 |
| Personnel Evaluation | 431 |
| Job Applicants | 429 |
| Career Guidance | 425 |
| Standard Setting (Scoring) | 228 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Elementary Education | 149 |
| Higher Education | 132 |
| Postsecondary Education | 105 |
| Elementary Secondary Education | 103 |
| Secondary Education | 101 |
| Middle Schools | 88 |
| Grade 3 | 86 |
| Grade 4 | 81 |
| Grade 8 | 81 |
| Grade 5 | 79 |
| Grade 6 | 68 |
| More ▼ | |
Audience
| Researchers | 58 |
| Practitioners | 14 |
| Policymakers | 11 |
| Teachers | 11 |
| Administrators | 5 |
| Students | 4 |
| Parents | 1 |
Location
| California | 29 |
| Florida | 28 |
| Texas | 20 |
| Canada | 16 |
| New York | 15 |
| Massachusetts | 14 |
| North Carolina | 14 |
| United Kingdom | 14 |
| Washington | 13 |
| Pennsylvania | 12 |
| New Jersey | 11 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 3 |
Orr, Margaret Terry; Hollingworth, Liz – Journal of Educational Administration, 2023
Purpose: This paper explores the school leadership career outcomes, timing and educator evaluation of those who complete the Massachusetts Performance Assessment for Leaders (PAL) in comparison with others who did not. It also compares outcomes for those with different PAL score completion requirements. Design/methodology/approach: Using PAL…
Descriptors: Labor Market, State Policy, State Licensing Boards, Certification
Eldeeb, Sherief Y.; Ludwig, Natasha N.; Wieckowski, Andrea Trubanova; Dieckhaus, Mary F. S.; Algur, Yasemin; Ryan, Victoria; Dufek, Sarah; Stahmer, Aubyn; Robins, Diana L. – Autism: The International Journal of Research and Practice, 2023
Males are more likely to be diagnosed with autism than females, and at earlier ages, yet few studies examine sex differences in screening. This study explored sex differences in psychometric properties, recommended cutoff scores, and overall scores of the Modified Checklist for Autism in Toddlers, Revised, with Follow-Up. Participants were 28,088…
Descriptors: Gender Differences, Autism Spectrum Disorders, Screening Tests, Disability Identification
Yang, Shuran; Han, Dong; Zhou, Huizhi; Yang, Chen; Zhang, Kun; Chen, Shi; Yang, Runxu; Cao, Xia; Grodberg, David; Zhao, Xudong; Kang, Chuanyuan – Journal of Autism and Developmental Disorders, 2023
The Autism Mental Status exam (AMSE) has demonstrated excellent sensitivity and specificity in Western high-risk population with suspected autism spectrum disorder (ASD). This study aimed to evaluate the psychometric properties of the AMSE in a sample of high-risk Chinese children, and to determine the optimal cutoff score of the Chinese version…
Descriptors: Validity, Cutting Scores, Accuracy, Diagnostic Tests
Baldwin, Peter; Margolis, Melissa J.; Clauser, Brian E.; Mee, Janet; Winward, Marcia – Educational Measurement: Issues and Practice, 2020
Evidence of the internal consistency of standard-setting judgments is a critical part of the validity argument for tests used to make classification decisions. The bookmark standard-setting procedure is a popular approach to establishing performance standards, but there is relatively little research that reflects on the internal consistency of the…
Descriptors: Standard Setting (Scoring), Probability, Cutting Scores, Evaluation Methods
Wyse, Adam E. – Educational Measurement: Issues and Practice, 2020
One commonly used compromise standard-setting method is the Beuk (1984) method. A key assumption of the Beuk method is that the emphasis given to the pass rate and the percent correct ratings should be proportional to the extent that the panelists agree on their ratings. However, whether the slope of Beuk line reflects the emphasis that panelists…
Descriptors: Standard Setting (Scoring), Cutting Scores, Weighted Scores, Evaluation Methods
Wyse, Adam E. – Applied Measurement in Education, 2020
This article compares cut scores from two variations of the Hofstee and Beuk methods, which determine cut scores by resolving inconsistencies in panelists' judgments about cut scores and pass rates, with the Angoff method. The first variation uses responses to the Hofstee and Beuk percentage correct and pass rate questions to calculate cut scores.…
Descriptors: Cutting Scores, Evaluation Methods, Standard Setting (Scoring), Equations (Mathematics)
Karrie A. Shogren; Jesse R. Pace; Tyler A. Hicks; Sheida K. Raley; Kathleen Lynne Lane – Psychology in the Schools, 2024
This study used the standard setting to establish cutscores for the fidelity of implementation of an evidence-based intervention, the Self-Determined Learning Model of Instruction (SDLMI) designed to enhance goal-directed actions in secondary students with and without disabilities. Cutscores were then applied to fidelity data from a large,…
Descriptors: Cutting Scores, Fidelity, Program Implementation, Evidence Based Practice
Morris, Nicole M.; Ingram, Paul B.; Mitchell, Sean M.; Victor, Sarah E. – Measurement and Evaluation in Counseling and Development, 2023
We investigated the validity and screening effectiveness of the PHQ-2 and PHQ-9 scores in 229 college students in a cross-sectional design. PHQ associations with Minnesota Multiphasic Personality Inventory-3 internalizing scales suggest PHQ scores are effective screening tools for college students and may aid in effective triage and service needs.
Descriptors: Personality Measures, Test Validity, College Students, Screening Tests
Oliva, Jose M.; Blanco, Ángel – European Journal of Science and Mathematics Education, 2023
A questionnaire was recently developed for the use with the Spanish-speaking, and evidence have been provided about the construct internal validity by means of structural equation modelling. In this paper, two research questions were considered: (i) What new evidence does application of the Rasch model provide regarding the validity of this…
Descriptors: Spanish Speaking, High School Students, College Students, Item Response Theory
Weiss, Brandi A.; Dardick, William – Journal of Experimental Education, 2021
Classification measures and entropy variants can be used as indicators of model fit for logistic regression. These measures rely on a cut-point, "c," to determine predicted group membership. While recommendations exist for determining the location of the cut-point, these methods are primarily anecdotal. The current study used Monte Carlo…
Descriptors: Cutting Scores, Regression (Statistics), Classification, Monte Carlo Methods
Alexis Clyde; Danna Bismar; Gabrielle Agnew; Laura E. Kuper – Journal of Autism and Developmental Disorders, 2024
Autism spectrum disorder (ASD) and ASD symptoms are overrepresented among gender-diverse youth across studies. Gender-diverse and ASD youth are at risk for anxiety, but anxiety is unclear among gender-diverse youth with ASD. The Social Communication Questionnaire (SCQ) is a commonly used ASD screener, including in multidisciplinary…
Descriptors: Autism Spectrum Disorders, Identification, Accuracy, Interpersonal Competence
Prentza, Alexandra; Tafiadis, Dionysios; Chondrogianni, Vasiliki; Tsimpli, Ianthi-Maria – Journal of Psycholinguistic Research, 2022
This study provides a preliminary validation of a Greek Sentence Repetition Task (SRT) with a sample of 110 monolingual and bilingual typically developing (TLD) children and examines the test's ability to distinguish between Greek monolingual children and age-matched Albanian-Greek bilinguals using a Receiver Operating Characteristics (ROC)…
Descriptors: Greek, Sentences, Repetition, Monolingualism
Melissa G. Wolf; Daniel McNeish – Grantee Submission, 2023
To evaluate the fit of a confirmatory factor analysis model, researchers often rely on fit indices such as SRMR, RMSEA, and CFI. These indices are frequently compared to benchmark values of 0.08, 0.06, and 0.96, respectively, established by Hu and Bentler (1999). However, these indices are affected by model characteristics and their sensitivity to…
Descriptors: Programming Languages, Cutting Scores, Benchmarking, Factor Analysis
Acar, Selcuk; Branch, Marcus J.; Burnett, Cyndi; Cabra, John F. – Gifted Child Quarterly, 2021
Originality is scored based on standard zero-originality lists (ZOLs) in the Torrance Tests of Creative Thinking (TTCT). The applicability of those ZOLs to diverse groups has not been examined. We examined the consistency of TTCT-Figural's sample-based (SB) ZOLs and the published ZOLs based on a sample of predominantly African American college…
Descriptors: Creative Thinking, Creativity Tests, African American Students, College Students
Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020
In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…
Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

Peer reviewed
Direct link
