NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing 1 to 15 of 509 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Saskia van Laar; Johan Braeken – International Journal of Testing, 2024
This study examined the impact of two questionnaire characteristics, scale position and questionnaire length, on the prevalence of random responders in the TIMSS 2015 eighth-grade student questionnaire. While there was no support for an absolute effect of questionnaire length, we did find a positive effect for scale position, with an increase of…
Descriptors: Middle School Students, Grade 8, Questionnaires, Test Length
Peer reviewed Peer reviewed
Direct linkDirect link
Xiaowen Liu – International Journal of Testing, 2024
Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…
Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Dragos Iliescu; Dave Bartram; Pia Zeinoun; Matthias Ziegler; Paula Elosua; Stephen Sireci; Kurt F. Geisinger; Aletta Odendaal; Maria Elena Oliveri; Jon Twing; Wayne Camara – International Journal of Testing, 2024
The "Test Adaptation Reporting Standards" (TARES), or "TARES statement" was developed to alleviate the problems arising from inadequate reporting of test adaptation procedures. The TARES contains a short preamble and a checklist, that comprises an evidence-based minimum set of information for reporting in test adaptations. The…
Descriptors: Test Use, Outcome Measures, Check Lists, Evidence Based Practice
Peer reviewed Peer reviewed
Direct linkDirect link
Voss, Nathaniel M.; Chlevin-Thiele, Cassandra; Lake, Christopher J.; Warren, Chi-Leigh – International Journal of Testing, 2023
The goal of this study was to extend research on scale contextualization (i.e., frame-of-reference effect) to the decision making styles construct, compare the effects of contextualization across three unique decision style scales, and examine the consequences of scale contextualization within an item response theory framework. Based on a mixed…
Descriptors: Item Response Theory, Decision Making, Decision Making Skills, College Students
Peer reviewed Peer reviewed
Direct linkDirect link
Sarac, Merve; Loken, Eric – International Journal of Testing, 2023
This study is an exploratory analysis of examinee behavior in a large-scale language proficiency test. Despite a number-right scoring system with no penalty for guessing, we found that 16% of examinees omitted at least one answer and that women were more likely than men to omit answers. Item-response theory analyses treating the omitted responses…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Cassiani-Miranda, Carlos Arturo; Pedrozo-Pupo, John Carlos; Campo-Arias, Adalberto – International Journal of Testing, 2023
The study aimed to adapt and evaluate a scale to measure COVID-19-CED in COVID-19 survivors. A sample of 330 COVID-19 survivors filled out the COVID-19 Perceived Discrimination Scale (C-19-PDS). C-19-PDS was adapted from the Tuberculosis Perceived Discrimination Scale (11 items). Confirmatory factor analysis showed poor goodness-of-fit indicators.…
Descriptors: COVID-19, Pandemics, Test Construction, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Bulut, Hatice Cigdem; Bulut, Okan; Arikan, Serkan – International Journal of Testing, 2023
This study examined group differences in online reading comprehension (ORC) using student data from the 2016 administration of the Progress in International Reading Literacy Study (ePIRLS). An explanatory item response modeling approach was used to explore the effects of item properties (i.e., item format, text complexity, and cognitive…
Descriptors: International Assessment, Achievement Tests, Grade 4, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
de Francisco Carvalho, Lucas; Santos, Camila Grillo; Fernandes, Nelson, Junior; da Rocha, Rafael Moreton Alves; Flores, Talita Meireles; Machado, Gisele Magarotto – International Journal of Testing, 2023
We aimed to refine the previously proposed antisocial subscale for the Dimensional Clinical Personality Inventory 2 (IDCP-ASPD). The sample involved 628 Brazilian adults between 18 and 81 years old. We administered the revised ASPD subscale (IDCP-ASPD-R), the Affective and Cognitive Measure of Empathy (ACME), the Crime and Analogous Behavior Scale…
Descriptors: Personality Measures, Personality Traits, Antisocial Behavior, Empathy
Peer reviewed Peer reviewed
Direct linkDirect link
de Francisco Carvalho, Lucas; Gonçalves, André Pereira; Romano, Amanda Rizzieri; Montes, Antônio da Conceição; Machado, Gisele Magarotto; Pianowski, Giselle – International Journal of Testing, 2023
We developed and validated a self-report scale for screening pathological traits of dependent personality disorder (DPD) from the Hierarchical Taxonomy of psychopathology (HiTOP) perspective. The sample was 693 adults who answered the new scale, the Dimensional Clinical Personality Inventory DPD (IDCP-DPD), the PID-5, the FFDI, and the FFBI. The…
Descriptors: Adults, Personality Problems, Pathology, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Badham, Louise; Furlong, Antony – International Journal of Testing, 2023
Multilingual summative assessments face significant challenges due to tensions that exist between multiple language provision and comparability. Yet, conventional approaches for investigating comparability in multilingual assessments fail to accommodate assessments that comprise extended responses that target complex constructs. This article…
Descriptors: Summative Evaluation, Multilingualism, Comparative Analysis, Literature
Peer reviewed Peer reviewed
Direct linkDirect link
Wu, Rongxiu; Chiu, Chungyi; Dueber, David; Park, Mirang; Lange, Dustin; Umucu, Emre; Strauser, David – International Journal of Testing, 2023
The current study examined the factor structure, measurement invariance, and construct validity of the 14-item Revised Developmental Work Personality Scale (RDWPS) using a sample of 603 college students in a Midwest university of the United States. Exploratory and confirmatory factor analysis results indicated that the 11-item RDWPS resulted in a…
Descriptors: Test Validity, College Students, Gender Differences, Personality Traits
Peer reviewed Peer reviewed
Direct linkDirect link
Streckert, Nico; Kurtz, Lara; Kajonius, Petri J. – International Journal of Testing, 2023
The Dark Factor of Personality (D) measures the latent core of antagonistic traits. The present study evaluated the psychometric properties of the Swedish version of the full (D70) and the brief (D16) versions, concerning structural validity, item information, and convergent validity. An online sample (N = 294) was analyzed using CFA (Maximum…
Descriptors: Foreign Countries, Personality, Measures (Individuals), Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Martí Valls, Carla; Balazadeh, Kitty; Kajonius, Petri – International Journal of Testing, 2023
The Alternative DSM-5 Model for Personality Disorders (AMPD) consists of level of personality functioning (Criterion A) and maladaptive personality traits (Criterion B). The brief scale versions of these are understudied, while often being used by clinicians and researchers. In this study, we wanted to investigate the overlap and predictive…
Descriptors: Personality Problems, Factor Analysis, Personality Traits, Guides
Peer reviewed Peer reviewed
Direct linkDirect link
Estaji, Masoomeh; Banitalebi, Zahra – International Journal of Testing, 2023
This study used Latent Growth Curve Modeling (LGCM) to examine the overtime patterns of the score and test-taking strategy changes in an international high-stakes standardized proficiency test. To this end, the test records of 178 Iranian IELTS repeaters were analyzed, using close- and open-ended questionnaires to measure test scores as a function…
Descriptors: Foreign Countries, Language Tests, Second Language Learning, English (Second Language)
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  34