Publication Date
In 2024 | 1 |
Since 2023 | 3 |
Since 2020 (last 5 years) | 12 |
Since 2015 (last 10 years) | 17 |
Since 2005 (last 20 years) | 21 |
Descriptor
Decision Making | 22 |
Test Construction | 6 |
Accuracy | 5 |
Evaluation Methods | 4 |
Test Reliability | 4 |
Test Validity | 4 |
Accountability | 3 |
Classification | 3 |
Computer Assisted Testing | 3 |
Educational Policy | 3 |
High Stakes Tests | 3 |
More ▼ |
Source
Educational Measurement:… | 22 |
Author
Abedi, Jamal | 1 |
Aray, Henry | 1 |
Arends, Lidia R. | 1 |
Arslan, Burcu | 1 |
Attali, Yigal | 1 |
Bouwmeester, Samantha | 1 |
Camara, Wayne J. | 1 |
Clauser, Brian E. | 1 |
Dunbar, Stephen B. | 1 |
Feinberg, Richard A. | 1 |
Gong, Tao | 1 |
More ▼ |
Publication Type
Journal Articles | 22 |
Reports - Research | 14 |
Reports - Evaluative | 5 |
Reports - Descriptive | 3 |
Education Level
Higher Education | 2 |
Elementary Secondary Education | 1 |
Grade 9 | 1 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
United States | 1 |
Laws, Policies, & Programs
Every Student Succeeds Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
William Belzak; J. R. Lockwood; Yigal Attali – Educational Measurement: Issues and Practice, 2024
Remote proctoring, or monitoring test takers through internet-based, video-recording software, has become critical for maintaining test security on high-stakes assessments. The main role of remote proctors is to make judgments about test takers' behaviors and decide whether these behaviors constitute rule violations. Variability in proctor…
Descriptors: Computer Security, High Stakes Tests, English (Second Language), Second Language Learning
Lovett, Benjamin J. – Educational Measurement: Issues and Practice, 2023
Students with disabilities often take tests under different conditions than their peers do. Testing accommodations, which involve changes to test administration that maintain test content, include extending time limits, presenting written text through auditory means, and taking a test in a private room with fewer distractions. For some students…
Descriptors: Students with Disabilities, Testing Accommodations, Psychometrics, Student Needs
Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023
The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…
Descriptors: Item Response Theory, Standard Setting, Testing, Sampling
Camara, Wayne J.; Mattern, Krista – Educational Measurement: Issues and Practice, 2022
In 2020, the onset of COVID-19 greatly restricted access to admissions testing in higher education and required innovative solutions and flexibility such as at home testing with remote proctoring, reducing testing time, pop-up locations, and additional testing dates. Increased focus on social justice, diversity, and fairness continued to concern…
Descriptors: College Entrance Examinations, College Admission, Decision Making, COVID-19
Sireci, Stephen G.; Suarez-Alvarez, Javier – Educational Measurement: Issues and Practice, 2022
The COVID-19 pandemic negatively affected the quality of data from educational testing programs. These data were previously used for many important purposes ranging from placing students in instructional programs to school accountability. In this article, we draw from the research design literature to point out the limitations inherent in…
Descriptors: Decision Making, Data Use, COVID-19, Pandemics
Feinberg, Richard A. – Educational Measurement: Issues and Practice, 2021
Unforeseen complications during the administration of large-scale testing programs are inevitable and can prevent examinees from accessing all test material. For classification tests in which the primary purpose is to yield a decision, such as a pass/fail result, the current study investigated a model-based standard error approach, Bayesian…
Descriptors: High Stakes Tests, Classification, Decision Making, Bayesian Statistics
Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021
Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…
Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making
Woolverton, Genevieve Alice; Pollastri, Alisha R. – Educational Measurement: Issues and Practice, 2021
Within classrooms, psychologists and teachers use direct behavior observation methods, systematic behavior observations (SBOs) and direct behavior ratings (DBRs), to gather information about students' behaviors for the purposes of making decisions related to diagnosis and classroom management or behavioral feedback respectively. Observers use SBOs…
Descriptors: Student Behavior, Classroom Observation Techniques, Behavior Rating Scales, Behavior Patterns
Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020
Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…
Descriptors: Test Construction, Test Bias, Classification, Accuracy
Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020
Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…
Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making
Welch, Catherine J.; Dunbar, Stephen B. – Educational Measurement: Issues and Practice, 2020
The use of assessment results to inform school accountability relies on the assumption that the test design appropriately represents the content and cognitive emphasis reflected in the state's standards. Since the passage of the Every Student Succeeds Act and the certification of accountability assessments through federal peer review practices,…
Descriptors: Accountability, Test Construction, State Standards, Content Validity
Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020
Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…
Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards
Aray, Henry; Pedauga, Luis – Educational Measurement: Issues and Practice, 2019
This article presents a novel experimental methodology in which groups of students were offered the option to choose between two equivalent scoring rules to assess a multiple-choice test. The effect of choosing the scoring rule on marks is tested. Two major contributions arise from this research. First, it contributes to the literature on the…
Descriptors: Multiple Choice Tests, Scoring, Student Attitudes, Decision Making
Zwick, Rebecca – Educational Measurement: Issues and Practice, 2019
Selection decisions have a major impact on our education, occupation, and quality of life, and the role of standardized tests in selection has always been a source of controversy. Here, I consider various definitions of fairness in measurement and selection--those emerging from within educational measurement and statistics, those from philosophy,…
Descriptors: Culture Fair Tests, Decision Making, Standardized Tests, Selection Criteria
Attali, Yigal – Educational Measurement: Issues and Practice, 2019
Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…
Descriptors: Evaluators, Certification, High Stakes Tests, Scoring
Previous Page | Next Page ยป
Pages: 1 | 2