NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 256 to 270 of 3,630 results Save | Export
Steedle, Jeffrey T. – ACT, Inc., 2018
Debilitating test anxiety is a general threat to validity if it biases assessment scores. Moreover, if bias differs between demographic groups, anxiety also raises concerns about test fairness. This study applied structural equation modeling to investigate possible measurement bias due to anxiety on the ACT® assessment and relationships among…
Descriptors: Test Anxiety, College Entrance Examinations, Test Bias, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Roberson, Nathan D.; Zumbo, Bruno D. – International Journal of Testing, 2019
This paper investigates measurement invariance as it relates to migration background using the Program for International Student Assessment measure of social belonging. We explore how the use of two measurement invariance techniques provide insights into differential item functioning using the alignment method in conjunction with logistic…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Woods, Kevin; McCaldin, Tamsin; Hipkiss, Amanda; Tyrrell, Beverley; Dawes, Megan – Oxford Review of Education, 2019
This paper presents a novel explanation for the continued absence of a children's rights strategy within high-stakes educational assessment with reference to the competing purposes of high-stakes assessments and group-based constructions of fairness in assessment. We provide an original critique of group-based perspectives on the validity of…
Descriptors: Childrens Rights, Student Rights, Student Evaluation, High Stakes Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lang, David – Grantee Submission, 2019
Whether high-stakes exams such as the SAT or College Board AP exams should penalize incorrect answers is a controversial question. In this paper, we document that penalty functions can have differential effects depending on a student's risk tolerance. Moreover, literature shows that risk aversion tends to vary along other areas of concern such as…
Descriptors: High Stakes Tests, Risk, Item Response Theory, Test Bias
Fager, Meghan L. – ProQuest LLC, 2019
Recent research in multidimensional item response theory has introduced within-item interaction effects between latent dimensions in the prediction of item responses. The objective of this study was to extend this research to bifactor models to include an interaction effect between the general and specific latent variables measured by an item.…
Descriptors: Test Items, Item Response Theory, Factor Analysis, Simulation
Pearson, 2019
Pearson Test of English Academic (PTE Academic) is a computer-based international English language test. Pearson developed PTE Academic in response to demand from higher education, governments, and other customers for a test that could more accurately measure the English communication skills of international students in an academic environment.…
Descriptors: Language Tests, English for Academic Purposes, Computer Assisted Testing, Communication Skills
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Doosti, Mehdi; Ahmadi Safa, Mohammad – International Journal of Language Testing, 2021
This study examined the effect of rater training on promoting inter-rater reliability in oral language assessment. It also investigated whether rater training and the consideration of the examinees' expectations by the examiners have any effect on test-takers' perceptions of being fairly evaluated. To this end, four raters scored 31 Iranian…
Descriptors: Oral Language, Language Tests, Interrater Reliability, Training
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Uyar, Seyma – Eurasian Journal of Educational Research, 2020
Purpose: This study aimed to compare the performance of latent class differential item functioning (DIF) approach and IRT based DIF methods using manifest grouping. With this study, it was thought to draw attention to carry out latent class DIF studies in Turkey. The purpose of this study was to examine DIF in PISA 2015 science data set. Research…
Descriptors: Item Response Theory, Foreign Countries, Cross Cultural Studies, Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Houri, Alaa K.; Miller, Faith G. – Early Education and Development, 2020
Research Findings: Universal screening practices that utilize reliable and valid screening measures are vital for identifying social-emotional and behavioral (SEB) concerns for students at-risk for future behavioral and academic difficulties. Screening procedures implemented at the start of kindergarten can result in early identification and…
Descriptors: School Readiness, Kindergarten, Screening Tests, Social Development
Peer reviewed Peer reviewed
Direct linkDirect link
Drackert, Anastasia; Timukova, Anna – Language Testing, 2020
In view of the ubiquitous increase in the use of C-tests, which are almost unanimously believed to measure general language proficiency, this study investigates whether the aspects of language proficiency tapped into by the C-test format are the same when the test is taken by a learner population other than that of foreign language learners.…
Descriptors: Cloze Procedure, Language Tests, Russian, Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Heritage, Brody; Mancini, Vincent; Rigoli, Daniela; Piek, Jan – British Journal of Educational Psychology, 2020
Background: The self-concept of children has an impact on later behavioural development and psychopathology; therefore, evidence of the accurate measurement of self-concept is important. Harter and Pike's (1984, "Child Development," 55, 1969) commonly used measure of self-concept, the Pictorial Scale of Perceived Competence and Social…
Descriptors: Measures (Individuals), Competence, Peer Acceptance, Self Concept
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Peer reviewed Peer reviewed
Direct linkDirect link
Geiger, Tray; Amerein-Beardsley, Audrey – AERA Online Paper Repository, 2017
The Education Value-Added Assessment System (EVAAS), the value-added model (VAM) sold by the business analytics software company SAS Institute Inc., is advertised as offering "precise, reliable and unbiased results that go far beyond what other simplistic [value-added] models found in the market today can provide." In this study, we…
Descriptors: Value Added Models, Test Validity, Test Reliability, Teacher Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Wells, James; Henderson, Rachel; Stewart, John; Stewart, Gay; Yang, Jie; Traxler, Adrienne – Physical Review Physics Education Research, 2019
Module analysis for multiple-choice responses (MAMCR) was applied to a large sample of Force Concept Inventory (FCI) pretest and post-test responses (N[subscript pre] = 4509 and N[subscript post] = 4716) to replicate the results of the original MAMCR study and to understand the origins of the gender differences reported in a previous study of this…
Descriptors: Physics, Misconceptions, Science Tests, Scientific Concepts
Pages: 1  |  ...  |  14  |  15  |  16  |  17  |  18  |  19  |  20  |  21  |  22  |  ...  |  242