Publication Date
| In 2015 | 4 |
| Since 2014 | 20 |
| Since 2011 (last 5 years) | 79 |
| Since 2006 (last 10 years) | 177 |
| Since 1996 (last 20 years) | 278 |
Descriptor
| Foreign Countries | 86 |
| Test Items | 61 |
| Item Response Theory | 51 |
| Psychometrics | 50 |
| Comparative Analysis | 47 |
| Scores | 46 |
| Measures (Individuals) | 42 |
| Models | 41 |
| Test Bias | 38 |
| Evaluation Methods | 36 |
| More ▼ | |
Source
| International Journal of… | 278 |
Author
| Bartram, Dave | 7 |
| Ercikan, Kadriye | 7 |
| Zumbo, Bruno D. | 7 |
| Byrne, Barbara M. | 5 |
| Oakland, Thomas | 5 |
| Sireci, Stephen G. | 5 |
| Buckendahl, Chad W. | 4 |
| Evers, Arne | 4 |
| Gregoire, Jacques | 4 |
| Hambleton, Ronald K. | 4 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 41 |
| Postsecondary Education | 18 |
| Elementary Secondary Education | 15 |
| Secondary Education | 14 |
| Elementary Education | 12 |
| High Schools | 11 |
| Grade 4 | 7 |
| Grade 8 | 6 |
| Intermediate Grades | 6 |
| Grade 3 | 4 |
| More ▼ | |
Audience
| Administrators | 1 |
| Counselors | 1 |
| Parents | 1 |
| Teachers | 1 |
Showing 106 to 120 of 278 results
Sijtsma, Klaas – International Journal of Testing, 2009
This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…
Descriptors: Construct Validity, Reliability, Classification, Test Theory
Jang, Eunice Eunhee; Roussos, Louis – International Journal of Testing, 2009
In this article we present results of a Differential Item Functioning (DIF) study using Shealy and Stout's (1993) multidimensionality-based DIF analysis framework. In this framework, differences in test score distributions across different groups of examinees may be a result of multidimensionality if secondary dimensions (not the primary dimension…
Descriptors: Test Bias, Vocabulary, English (Second Language), Scores
Le, Luc T. – International Journal of Testing, 2009
This study uses PISA cycle 3 field trial data to investigate the relationships between gender differential item functioning (DIF) across countries and test languages for science items and their formats and the four other dimensions defined in PISA framework: focus, context, competency, and scientific knowledge. The data used were collected from 60…
Descriptors: Test Bias, Gender Bias, Science Tests, Test Items
Yildirim, Huseyin Husnu; Berberoglu, Giray – International Journal of Testing, 2009
Comparisons of human characteristics across different language groups and cultures become more important in today's educational assessment practices as evidenced by the increasing interest in international comparative studies. Within this context, the fairness of the results across different language and cultural groups draws the attention of…
Descriptors: Test Bias, Cross Cultural Studies, Comparative Analysis, Factor Analysis
Kahraman, Nilufer; De Boeck, Paul; Janssen, Rianne – International Journal of Testing, 2009
This study introduces an approach for modeling multidimensional response data with construct-relevant group and domain factors. The item level parameter estimation process is extended to incorporate the refined effects of test dimension and group factors. Differences in item performances over groups are evaluated, distinguishing two levels of…
Descriptors: Test Bias, Test Items, Groups, Interaction
Schechtman, Edna; Yitzhaki, Shlomo – International Journal of Testing, 2009
The huge technological improvement in data processing and the globalization have increased the demand for and the supply of indices that quantify the consequences of a policy. However, there are certain cases in which quantification may be misleading in the sense that it gives the impression of an accurate measurement while in reality it is not.…
Descriptors: Ability, Measurement, Classification, Students
Rotsika, V.; Vlassopoulos, M.; Legaki, L.; Sini, A.; Rogakou, E.; Sakellariou, K.; Pehlivanidou, H.; Anagnostopoulos, D. C. – International Journal of Testing, 2009
This study investigates the WISC-III profile in Greek children with learning disabilities (LD). The sample consisted of 180 children diagnosed with learning disability (136 boys, 44 girls) aged 6.11 to 14.4 years. The Mean Full-scale IQ is 96.08, Mean Verbal IQ is 96.38, and Mean Performance IQ is 96.61. On individual subtests, the lowest mean…
Descriptors: Intelligence Tests, Speech Communication, Learning Disabilities, Intelligence Quotient
Solano-Flores, Guillermo; Backhoff, Eduardo; Contreras-Nino, Luis Angel – International Journal of Testing, 2009
In this article, we present a theory of test translation whose intent is to provide the conceptual foundation for effective, systematic work in the process of test translation and test translation review. According to the theory, translation error is multidimensional; it is not simply the consequence of defective translation but an inevitable fact…
Descriptors: Test Items, Investigations, Semantics, Translation
Paquet, Stephanie L.; Kline, Theresa J. B. – International Journal of Testing, 2009
Cross-cultural research in many psychology-related fields is becoming commonplace. To further the research in a methodologically rigorous fashion it is critical to be able to measure adequately the constructs under investigation. This study (N = 238) examined three measures used to assess individualist and collectivist orientations. The internal…
Descriptors: Psychometrics, Individualism, Attitude Measures, Self Concept Measures
Allalouf, Avi; Rapp, Joel; Stoller, Reuven – International Journal of Testing, 2009
When a test is adapted from a source language (SL) into a target language (TL), the two forms are usually not psychometrically equivalent. If linking between test forms is necessary, those items that have had their psychometric characteristics altered by the translation (differential item functioning [DIF] items) should be eliminated from the…
Descriptors: Test Items, Test Format, Verbal Tests, Psychometrics
Furlan, Luis Alberto; Cassady, Jerrell C.; Perez, Edgardo Raul – International Journal of Testing, 2009
A new Spanish version of the Cognitive Test Anxiety Scale (CTAS) was created to be used explicitly with Argentinean university students. The scale was translated and verified through blind back translation and given to a large sample of students majoring in psychology or chemistry (N = 752). Exploratory Factor Analysis (N = 376) showed an internal…
Descriptors: Factor Structure, Cognitive Tests, Measures (Individuals), Factor Analysis
Wiberg, Marie – International Journal of Testing, 2009
The aim of this study was to examine log linear modelling (LLM) compared with logistic regression (LR) and Mantel-Haenszel (MH) test for detecting Differential Item Functioning (DIF) in a mastery test. The three methods were chosen because they have similar components. The results showed fairly high matching percentages together with high…
Descriptors: Test Bias, Mastery Tests, Comparative Analysis, Regression (Statistics)
Liu, Ou Lydia; Wilson, Mark – International Journal of Testing, 2009
Differential gender performance in standardized mathematics assessment has long been a heated topic. Gender gaps of varied magnitude have been identified on large-scale assessments in the United States. To continue the investigation, this study examined male and female performance on the Programme for International Student Assessment (PISA) 2003…
Descriptors: Foreign Countries, Probability, Gender Differences, Standardized Tests
Khoshouei, Mahdieh Sadat – International Journal of Testing, 2009
The purpose of this study was to evaluate the psychometric properties of the Persian version of the Connor-Davidson Resilience Scale (CD-RISC). The CD-RISC was completed by a sample of 323 Isfahan university students (168 females, 155 males) aged 19-34 years. A maximum likelihood method with an oblique solution resulted in four factors…
Descriptors: Achievement Need, Student Motivation, Measures (Individuals), Maximum Likelihood Statistics
Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008
Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…
Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies

Peer reviewed
Direct link
