Publication Date
| In 2015 | 2 |
| Since 2014 | 3 |
| Since 2011 (last 5 years) | 11 |
| Since 2006 (last 10 years) | 24 |
| Since 1996 (last 20 years) | 25 |
Descriptor
| Measurement | 25 |
| Foreign Countries | 10 |
| Psychometrics | 10 |
| Scores | 10 |
| Comparative Analysis | 9 |
| Evaluation Methods | 9 |
| Test Items | 9 |
| Measurement Techniques | 8 |
| Testing | 8 |
| Models | 7 |
| More ▼ | |
Source
| International Journal of… | 25 |
Author
| Ercikan, Kadriye | 3 |
| Oliveri, Maria Elena | 3 |
| Zumbo, Bruno D. | 2 |
| Bank, Jurgen | 1 |
| Bartram, Dave | 1 |
| Bradshaw, Laine P. | 1 |
| Byrne, Barbara M. | 1 |
| Campillo-Alvarez, Angela | 1 |
| Childs, Ruth A. | 1 |
| Cui, Ying | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 25 |
| Reports - Research | 18 |
| Reports - Descriptive | 5 |
| Opinion Papers | 1 |
| Reports - Evaluative | 1 |
Education Level
| Higher Education | 5 |
| Elementary Secondary Education | 4 |
| Secondary Education | 3 |
| Elementary Education | 2 |
| Grade 4 | 2 |
| Grade 8 | 2 |
| High Schools | 2 |
| Middle Schools | 2 |
| Postsecondary Education | 2 |
| Early Childhood Education | 1 |
| More ▼ | |
Audience
Showing 1 to 15 of 25 results
Wei, Hua; Lin, Jie – International Journal of Testing, 2015
Out-of-level testing refers to the practice of assessing a student with a test that is intended for students at a higher or lower grade level. Although the appropriateness of out-of-level testing for accountability purposes has been questioned by educators and policymakers, incorporating out-of-level items in formative assessments for accurate…
Descriptors: Test Items, Computer Assisted Testing, Adaptive Testing, Instructional Program Divisions
Cui, Ying; Mousavi, Amin – International Journal of Testing, 2015
The current study applied the person-fit statistic, l[subscript z], to data from a Canadian provincial achievement test to explore the usefulness of conducting person-fit analysis on large-scale assessments. Item parameter estimates were compared before and after the misfitting student responses, as identified by l[subscript z], were removed. The…
Descriptors: Measurement, Achievement Tests, Comparative Analysis, Test Items
Jurich, Daniel P.; Bradshaw, Laine P. – International Journal of Testing, 2014
The assessment of higher-education student learning outcomes is an important component in understanding the strengths and weaknesses of academic and general education programs. This study illustrates the application of diagnostic classification models, a burgeoning set of statistical models, in assessing student learning outcomes. To facilitate…
Descriptors: College Outcomes Assessment, Classification, Statistical Analysis, Models
Oliveri, Maria Elena; Ercikan, Kadriye; Zumbo, Bruno – International Journal of Testing, 2013
In this study, we investigated differential item functioning (DIF) and its sources using a latent class (LC) modeling approach. Potential sources of LC DIF related to instruction and teacher-related variables were investigated using substantive and three statistical approaches: descriptive discriminant function, multinomial logistic regression,…
Descriptors: Test Bias, Test Items, Multivariate Analysis, Discriminant Analysis
Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013
International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…
Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries
Muniz, Jose; Fernandez-Hermida, Jose R.; Fonseca-Pedrero, Eduardo; Campillo-Alvarez, Angela; Pena-Suarez, Elsa – International Journal of Testing, 2012
The proper use of psychological tests requires that the measurement instruments have adequate psychometric properties, such as reliability and validity, and that the professionals who use the instruments have the necessary expertise. In this article, we present the first review of tests published in Spain, carried out with an assessment model…
Descriptors: Student Evaluation, Measurement, Foreign Countries, Psychometrics
Oliveri, Maria Elena; Olson, Brent F.; Ercikan, Kadriye; Zumbo, Bruno D. – International Journal of Testing, 2012
In this study, the Canadian English and French versions of the Problem-Solving Measure of the Programme for International Student Assessment 2003 were examined to investigate their degree of measurement comparability at the item- and test-levels. Three methods of differential item functioning (DIF) were compared: parametric and nonparametric item…
Descriptors: Foreign Students, Test Bias, Speech Communication, Effect Size
Duong, Minh Q.; von Davier, Alina A. – International Journal of Testing, 2012
Test equating is a statistical procedure for adjusting for test form differences in difficulty in a standardized assessment. Equating results are supposed to hold for a specified target population (Kolen & Brennan, 2004; von Davier, Holland, & Thayer, 2004) and to be (relatively) independent of the subpopulations from the target population (see…
Descriptors: Ability Grouping, Difficulty Level, Psychometrics, Statistical Analysis
Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D. – International Journal of Testing, 2012
Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…
Descriptors: Foreign Countries, Psychometrics, Test Bias, Test Items
Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012
Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…
Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement
D'Agostino, Jerome; Karpinski, Aryn; Welsh, Megan – International Journal of Testing, 2011
After a test is developed, most content validation analyses shift from ascertaining domain definition to studying domain representation and relevance because the domain is assumed to be set once a test exists. We present an approach that allows for the examination of alternative domain structures based on extant test items. In our example based on…
Descriptors: Expertise, Test Items, Mathematics Tests, Factor Analysis
Byrne, Barbara M.; van de Vijver, Fons J. R. – International Journal of Testing, 2010
A critical assumption in cross-cultural comparative research is that the instrument measures the same construct(s) in exactly the same way across all groups (i.e., the instrument is measurement and structurally equivalent). Structural equation modeling (SEM) procedures are commonly used in testing these assumptions of multigroup equivalence.…
Descriptors: Measures (Individuals), Cross Cultural Studies, Measurement, Comparative Analysis
Schechtman, Edna; Yitzhaki, Shlomo – International Journal of Testing, 2009
The huge technological improvement in data processing and the globalization have increased the demand for and the supply of indices that quantify the consequences of a policy. However, there are certain cases in which quantification may be misleading in the sense that it gives the impression of an accurate measurement while in reality it is not.…
Descriptors: Ability, Measurement, Classification, Students
Luo, Wenshu; Watkins, David – International Journal of Testing, 2008
Despite the importance of self-structural variables to understand self-processes, research in this area has been hampered by measurement problems. The current study seeks to clarify this situation by examining the interrelationships among six self-structural measures of trait-sorting data of 252 Chinese college students: the "H" statistic of…
Descriptors: Measurement, Measures (Individuals), Correlation, Personality Traits
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
