Publication Date
| In 2015 | 0 |
| Since 2014 | 0 |
| Since 2011 (last 5 years) | 6 |
| Since 2006 (last 10 years) | 52 |
| Since 1996 (last 20 years) | 70 |
Descriptor
| Foreign Countries | 19 |
| Comparative Analysis | 16 |
| Item Response Theory | 16 |
| Scores | 15 |
| Test Items | 15 |
| Psychometrics | 14 |
| Measures (Individuals) | 13 |
| Factor Analysis | 11 |
| Test Bias | 10 |
| Testing | 10 |
| More ▼ | |
Source
| International Journal of… | 70 |
Author
| Cascallar, Alicia S. | 2 |
| Dorans, Neil J. | 2 |
| Evers, Arne | 2 |
| Gorin, Joanna S. | 2 |
| Hau, Kit-Tai | 2 |
| Sijtsma, Klaas | 2 |
| Sireci, Stephen G. | 2 |
| Tatsuoka, Kikumi K. | 2 |
| Veldkamp, Bernard P. | 2 |
| Abdelfattah, Faisal | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 70 |
| Reports - Evaluative | 70 |
| Information Analyses | 2 |
Education Level
| Higher Education | 6 |
| Secondary Education | 4 |
| High Schools | 3 |
| Grade 8 | 2 |
| Adult Education | 1 |
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 3 | 1 |
| Grade 5 | 1 |
| Grade 7 | 1 |
| More ▼ | |
Audience
| Administrators | 1 |
| Counselors | 1 |
| Parents | 1 |
| Teachers | 1 |
Showing 1 to 15 of 70 results
Dodeen, Hamzeh; Abdelfattah, Faisal; Shumrani, Saleh; Hilal, Maher Abu – International Journal of Testing, 2012
This study focused on comparing mathematics teachers' qualifications, practices, and perceptions between Saudi and Taiwanese schools. Data analyzed in this study were the responses of mathematics teachers to the Teacher Background Questionnaire--8th Grade from the Trends in International Mathematics and Science Study (TIMSS) in 2007. The Saudi…
Descriptors: Grade 8, Teacher Background, Mathematics Teachers, Educational Environment
Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012
The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…
Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling
Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack – International Journal of Testing, 2011
Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…
Descriptors: Standard Setting (Scoring), Cutting Scores, Educational Testing, Licensing Examinations (Professions)
Svetina, Dubravka; Gorin, Joanna S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2011
As a construct definition, the current study develops a cognitive model describing the knowledge, skills, and abilities measured by critical reading test items on a high-stakes assessment used for selection decisions in the United States. Additionally, in order to establish generalizability of the construct meaning to other similarly structured…
Descriptors: Reading Tests, Reading Comprehension, Critical Reading, Test Items
Chulu, Bob Wajizigha; Sireci, Stephen G. – International Journal of Testing, 2011
Many examination agencies, policy makers, media houses, and the public at large make high-stakes decisions based on test scores. Unfortunately, in some cases educational tests are not statistically equated to account for test differences over time, which leads to inappropriate interpretations of students' performance. In this study we illustrate…
Descriptors: Classification, Foreign Countries, Item Response Theory, High Stakes Tests
Hjemdal, Odin; Friborg, Oddgeir; Braun, Stephanie; Kempenaers, Chantal; Linkowski, Paul; Fossion, Pierre – International Journal of Testing, 2011
The Resilience Scale for Adults (RSA) was developed and has been extensively validated in Norwegian samples. The purpose of this study was to explore the construct validity of the Resilience Scale for Adults in a French-speaking Belgian sample and test measurement invariance between the Belgian and a Norwegian sample. A Belgian student sample (N =…
Descriptors: Measurement Techniques, Construct Validity, French, Adults
Barry, Carol L.; Horst, S. Jeanne; Finney, Sara J.; Brown, Allison R.; Kopp, Jason P. – International Journal of Testing, 2010
Given the prevalence of low-stakes testing internationally (e.g., NAEP, TIMSS, PIRLS), it is crucial to try to better understand examinee motivation in these contexts. In the current study, mixture modeling results supported three different profiles of test-taking effort over the course of five tests. Classes 1 and 2 had varying levels of effort…
Descriptors: Testing, Comparative Analysis, Accountability, College Students
Evers, Arne; Sijtsma, Klaas; Lucassen, Wouter; Meijer, Rob R. – International Journal of Testing, 2010
This article describes the 2009 revision of the Dutch Rating System for Test Quality and presents the results of test ratings from almost 30 years. The rating system evaluates the quality of a test on seven criteria: theoretical basis, quality of the testing materials, comprehensiveness of the manual, norms, reliability, construct validity, and…
Descriptors: Rating Scales, Documentation, Educational Quality, Educational Testing
Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010
Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…
Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing
In'nami, Yo; Koizumi, Rie – International Journal of Testing, 2010
Because structural equation models are widely used in testing and assessment, investigation into the accuracy of such models may help raise awareness of the value of reanalysis or replication. We focused on second language testing and learning studies and examined: (a) To what extent is information necessary for replication provided by authors?…
Descriptors: Structural Equation Models, Second Language Learning, Second Languages, Testing
Martin, Andrew J.; Hau, Kit-Tai – International Journal of Testing, 2010
The present study explored motivation and engagement among Chinese and Australian school students. Based on a sample of 528 Hong Kong Chinese 12-13 year olds and an archive sample of 6,366 Australian 12-13 year olds, achievement motivation was assessed using the Motivation and Engagement Scale-High School (MES-HS). Confirmatory factor analysis and…
Descriptors: Foreign Countries, Achievement Need, Student Motivation, Learner Engagement
Lee, Soon-Mook – International Journal of Testing, 2010
CEFA 3.02(Browne, Cudeck, Tateneni, & Mels, 2008) is a factor analysis computer program designed to perform exploratory factor analysis. It provides the main properties that are needed for exploratory factor analysis, namely a variety of factoring methods employing eight different discrepancy functions to be minimized to yield initial solutions, a…
Descriptors: Factor Structure, Computer Software, Factor Analysis, Research Methodology
Kim, Se-Kang – International Journal of Testing, 2010
The aim of the current study is to validate the invariance of major profile patterns derived from multidimensional scaling (MDS) by bootstrapping. Profile Analysis via Multidimensional Scaling (PAMS) was employed to obtain profiles and bootstrapping was used to construct the sampling distributions of the profile coordinates and the empirical…
Descriptors: Intervals, Multidimensional Scaling, Profiles, Evaluation
Bodkin-Andrews, Gawaian H.; Ha, My Trinh; Craven, Rhonda G.; Yeung, Alexander Seesing – International Journal of Testing, 2010
This investigation reports on the cross-cultural equivalence testing of the Self-Description Questionnaire II (short version; SDQII-S) for Indigenous and non-Indigenous Australian secondary student samples. A variety of statistical analysis techniques were employed to assess the psychometric properties of the SDQII-S for both the Indigenous and…
Descriptors: Indigenous Populations, Disadvantaged, Testing, Measures (Individuals)
Item Equivalence in English and Chinese Translation of a Cognitive Development Test for Preschoolers
He, Wei; Wolfe, Edward W. – International Journal of Testing, 2010
This article reports the results of a study of potential sources of item nonequivalence between English and Chinese language versions of a cognitive development test for preschool-aged children. Items were flagged for potential nonequivalence through statistical and judgment-based procedures, and the relationship between flag status and item…
Descriptors: Preschool Children, Mandarin Chinese, Cognitive Development, Item Analysis

Peer reviewed
Direct link
