Publication Date
| In 2015 | 0 |
| Since 2014 | 0 |
| Since 2011 (last 5 years) | 11 |
| Since 2006 (last 10 years) | 25 |
| Since 1996 (last 20 years) | 33 |
Descriptor
| Validity | 33 |
| Foreign Countries | 14 |
| Measures (Individuals) | 10 |
| Factor Structure | 9 |
| Item Response Theory | 9 |
| Reliability | 9 |
| Scores | 9 |
| Factor Analysis | 6 |
| Measurement | 5 |
| Models | 5 |
| More ▼ | |
Source
| International Journal of… | 33 |
Author
| Buckendahl, Chad W. | 2 |
| Zumbo, Bruno D. | 2 |
| Balaguer, Isabel | 1 |
| Bartram, Dave | 1 |
| Beaudoin, Isabelle | 1 |
| Beltyukova, Svetlana | 1 |
| Breithaupt, Krista | 1 |
| Byrne, Barbara M. | 1 |
| Castillo, Isabel | 1 |
| Charnas, Jocelyn W. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 33 |
| Reports - Research | 17 |
| Reports - Evaluative | 10 |
| Reports - Descriptive | 4 |
| Information Analyses | 1 |
| Reports - General | 1 |
Education Level
| Grade 8 | 3 |
| Higher Education | 3 |
| Secondary Education | 3 |
| High Schools | 2 |
| Postsecondary Education | 2 |
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 4 | 1 |
| Grade 7 | 1 |
| Grade 9 | 1 |
| More ▼ | |
Audience
Showing 1 to 15 of 33 results
Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013
International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…
Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries
Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013
A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…
Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores
Lim, Gad S.; Geranpayeh, Ardeshir; Khalifa, Hanan; Buckendahl, Chad W. – International Journal of Testing, 2013
Standard setting theory has largely developed with reference to a typical situation, determining a level or levels of performance for one exam for one context. However, standard setting is now being used with international reference frameworks, where some parameters and assumptions of classical standard setting do not hold. We consider the…
Descriptors: Standard Setting (Scoring), Validity, Models, Language Tests
Clauser, Brian E.; Mee, Janet; Margolis, Melissa J. – International Journal of Testing, 2013
This study investigated the extent to which the performance data format impacted data use in Angoff standard setting exercises. Judges from two standard settings (a total of five panels) were randomly assigned to one of two groups. The full-data group received two types of data: (1) the proportion of examinees selecting each option and (2) plots…
Descriptors: Standard Setting (Scoring), Cutting Scores, Validity, Reliability
Mucherah, Winnie; Finch, W. Holmes; Keaikitse, Setlhomo – International Journal of Testing, 2012
Understanding adolescent self-concept is of great concern for educators, mental health professionals, and parents, as research consistently demonstrates that low self-concept is related to a number of problem behaviors and poor outcomes. Thus, accurate measurements of self-concept are key, and the validity of such measurements, including the…
Descriptors: Test Bias, Mental Health Workers, Validity, Self Concept Measures
Duong, Minh Q.; von Davier, Alina A. – International Journal of Testing, 2012
Test equating is a statistical procedure for adjusting for test form differences in difficulty in a standardized assessment. Equating results are supposed to hold for a specified target population (Kolen & Brennan, 2004; von Davier, Holland, & Thayer, 2004) and to be (relatively) independent of the subpopulations from the target population (see…
Descriptors: Ability Grouping, Difficulty Level, Psychometrics, Statistical Analysis
Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D. – International Journal of Testing, 2012
Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…
Descriptors: Foreign Countries, Psychometrics, Test Bias, Test Items
Hopfenbeck, Therese N.; Maul, Andrew – International Journal of Testing, 2011
The aim of this study was to investigate response-process based evidence for the validity of the Programme for International Student Assessment's (PISA) self-report questionnaire scales as measures of specific psychological constructs, with a focus on scales meant to measure inclination toward specific learning strategies. Cognitive interviews (N…
Descriptors: Student Reaction, Learning Strategies, Validity, Questionnaires
Viglione, Donald J.; Perry, William; Giromini, Luciano; Meyer, Gregory J. – International Journal of Testing, 2011
We used multiple regression to calculate a new Ego Impairment Index (EII-3). The aim was to incorporate changes in the component variables and distribution of the number of responses as found in the new Rorschach Performance Assessment System, while sustaining the validity and reliability of previous EIIs. The EII-3 formula was derived from a…
Descriptors: Test Items, Self Concept, Validity, Evaluation
D'Agostino, Jerome; Karpinski, Aryn; Welsh, Megan – International Journal of Testing, 2011
After a test is developed, most content validation analyses shift from ascertaining domain definition to studying domain representation and relevance because the domain is assumed to be set once a test exists. We present an approach that allows for the examination of alternative domain structures based on extant test items. In our example based on…
Descriptors: Expertise, Test Items, Mathematics Tests, Factor Analysis
Xie, Qin – International Journal of Testing, 2011
This study examined test takers' perception of assessment demand and its impact on the measurement of intended constructs. More than 800 test takers took a pre- and a posttest of College English Test Band 4 and filled in a perception questionnaire to report the skills they perceive as necessary for answering the test. The study found test takers…
Descriptors: College English, Reading Tests, Essay Tests, Academic Achievement
Crocetti, Elisabetta; Shokri, Omid – International Journal of Testing, 2010
The purpose of this study was to validate the Iranian version of the Identity Style Inventory (ISI). Participants were 376 (42% males) university students. Confirmatory factor analyses revealed a clear three-factor structure of identity style and a mono-factor structure of commitment in the overall sample as well as in gender subgroups. Convergent…
Descriptors: Validity, Self Concept Measures, College Students, Adjustment (to Environment)
Lee, John Chi-kin; Yin, Hongbiao; Zhang, Zhonghua – International Journal of Testing, 2010
This article reports the adaptation and analysis of Pintrich's Motivated Strategies for Learning Questionnaire (MSLQ) in Hong Kong. First, this study examined the psychometric qualities of the existing Chinese version of MSLQ (MSLQ-CV). Based on this examination, this study developed a revised Chinese version of MSLQ (MSLQ-RCV) for junior…
Descriptors: Foreign Countries, Questionnaires, Psychometrics, Secondary School Students
Byrne, Barbara M.; van de Vijver, Fons J. R. – International Journal of Testing, 2010
A critical assumption in cross-cultural comparative research is that the instrument measures the same construct(s) in exactly the same way across all groups (i.e., the instrument is measurement and structurally equivalent). Structural equation modeling (SEM) procedures are commonly used in testing these assumptions of multigroup equivalence.…
Descriptors: Measures (Individuals), Cross Cultural Studies, Measurement, Comparative Analysis
Moura, Octavio; dos Santos, Rute Andrade; Rocha, Magda; Matos, Paula Mena – International Journal of Testing, 2010
The Children's Perception of Interparental Conflict Scale (CPIC) is based on the cognitive-contextual framework for understanding interparental conflict. This study investigates the factor validity and the invariance of two factor models of CPIC within a sample of Portuguese adolescents and emerging adults (14 to 25 years old; N = 677). At the…
Descriptors: Conflict, Factor Structure, Adolescents, Measures (Individuals)

Peer reviewed
Direct link
