Publication Date
| In 2015 | 0 |
| Since 2014 | 0 |
| Since 2011 (last 5 years) | 8 |
| Since 2006 (last 10 years) | 21 |
| Since 1996 (last 20 years) | 29 |
Descriptor
| Reliability | 29 |
| Foreign Countries | 9 |
| Validity | 9 |
| Scores | 8 |
| Factor Analysis | 7 |
| Correlation | 6 |
| Psychometrics | 6 |
| Factor Structure | 5 |
| Item Response Theory | 5 |
| Measures (Individuals) | 5 |
| More ▼ | |
Source
| International Journal of… | 29 |
Author
| Zumbo, Bruno D. | 3 |
| Sijtsma, Klaas | 2 |
| Bodkin-Andrews, Gawaian H. | 1 |
| Breithaupt, Krista | 1 |
| Brennan, Robert L. | 1 |
| Breyer, F. Jay | 1 |
| Byrne, Barbara M. | 1 |
| Carlstedt, Berit | 1 |
| Chen, Yi-Hsin | 1 |
| Childs, Ruth A. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 29 |
| Reports - Research | 17 |
| Reports - Evaluative | 8 |
| Reports - Descriptive | 4 |
Education Level
| Grade 8 | 2 |
| Secondary Education | 2 |
| Grade 12 | 1 |
| Grade 7 | 1 |
| Grade 9 | 1 |
| High Schools | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
Audience
Showing 1 to 15 of 29 results
Kim, Sooyeon; Moses, Tim – International Journal of Testing, 2013
The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…
Descriptors: Scoring, Test Format, Licensing Examinations (Professions), Test Items
King, Ronnel B.; Watkins, David A. – International Journal of Testing, 2013
The aim of this study is to assess the cross-cultural applicability of the Chinese version of the Inventory of School Motivation (ISM; McInerney & Sinclair, 1991) in the Hong Kong context using both within-network and between-network approaches to construct validation. The ISM measures four types of achievement goals: mastery, performance, social,…
Descriptors: Factor Analysis, Reliability, Learning Motivation, Foreign Countries
Clauser, Brian E.; Mee, Janet; Margolis, Melissa J. – International Journal of Testing, 2013
This study investigated the extent to which the performance data format impacted data use in Angoff standard setting exercises. Judges from two standard settings (a total of five panels) were randomly assigned to one of two groups. The full-data group received two types of data: (1) the proportion of examinees selecting each option and (2) plots…
Descriptors: Standard Setting (Scoring), Cutting Scores, Validity, Reliability
Kolen, Michael J.; Wang, Tianyou; Lee, Won-Chan – International Journal of Testing, 2012
Composite scores are often formed from test scores on educational achievement test batteries to provide a single index of achievement over two or more content areas or two or more item types on that test. Composite scores are subject to measurement error, and as with scores on individual tests, the amount of error variability typically depends on…
Descriptors: Mathematics Tests, Achievement Tests, College Entrance Examinations, Error of Measurement
Oliveri, Maria Elena; Olson, Brent F.; Ercikan, Kadriye; Zumbo, Bruno D. – International Journal of Testing, 2012
In this study, the Canadian English and French versions of the Problem-Solving Measure of the Programme for International Student Assessment 2003 were examined to investigate their degree of measurement comparability at the item- and test-levels. Three methods of differential item functioning (DIF) were compared: parametric and nonparametric item…
Descriptors: Foreign Students, Test Bias, Speech Communication, Effect Size
Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012
Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…
Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement
Zhang, Mo; Williamson, David M.; Breyer, F. Jay; Trapani, Catherine – International Journal of Testing, 2012
This article describes two separate, related studies that provide insight into the effectiveness of "e-rater" score calibration methods based on different distributional targets. In the first study, we developed and evaluated a new type of "e-rater" scoring model that was cost-effective and applicable under conditions of absent human rating and…
Descriptors: Automation, Scoring, Models, Essay Tests
Viglione, Donald J.; Perry, William; Giromini, Luciano; Meyer, Gregory J. – International Journal of Testing, 2011
We used multiple regression to calculate a new Ego Impairment Index (EII-3). The aim was to incorporate changes in the component variables and distribution of the number of responses as found in the new Rorschach Performance Assessment System, while sustaining the validity and reliability of previous EIIs. The EII-3 formula was derived from a…
Descriptors: Test Items, Self Concept, Validity, Evaluation
Lee, John Chi-kin; Yin, Hongbiao; Zhang, Zhonghua – International Journal of Testing, 2010
This article reports the adaptation and analysis of Pintrich's Motivated Strategies for Learning Questionnaire (MSLQ) in Hong Kong. First, this study examined the psychometric qualities of the existing Chinese version of MSLQ (MSLQ-CV). Based on this examination, this study developed a revised Chinese version of MSLQ (MSLQ-RCV) for junior…
Descriptors: Foreign Countries, Questionnaires, Psychometrics, Secondary School Students
Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010
Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…
Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing
Martin, Andrew J.; Hau, Kit-Tai – International Journal of Testing, 2010
The present study explored motivation and engagement among Chinese and Australian school students. Based on a sample of 528 Hong Kong Chinese 12-13 year olds and an archive sample of 6,366 Australian 12-13 year olds, achievement motivation was assessed using the Motivation and Engagement Scale-High School (MES-HS). Confirmatory factor analysis and…
Descriptors: Foreign Countries, Achievement Need, Student Motivation, Learner Engagement
Bodkin-Andrews, Gawaian H.; Ha, My Trinh; Craven, Rhonda G.; Yeung, Alexander Seesing – International Journal of Testing, 2010
This investigation reports on the cross-cultural equivalence testing of the Self-Description Questionnaire II (short version; SDQII-S) for Indigenous and non-Indigenous Australian secondary student samples. A variety of statistical analysis techniques were employed to assess the psychometric properties of the SDQII-S for both the Indigenous and…
Descriptors: Indigenous Populations, Disadvantaged, Testing, Measures (Individuals)
Sijtsma, Klaas – International Journal of Testing, 2009
This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…
Descriptors: Construct Validity, Reliability, Classification, Test Theory
Schechtman, Edna; Yitzhaki, Shlomo – International Journal of Testing, 2009
The huge technological improvement in data processing and the globalization have increased the demand for and the supply of indices that quantify the consequences of a policy. However, there are certain cases in which quantification may be misleading in the sense that it gives the impression of an accurate measurement while in reality it is not.…
Descriptors: Ability, Measurement, Classification, Students
Paquet, Stephanie L.; Kline, Theresa J. B. – International Journal of Testing, 2009
Cross-cultural research in many psychology-related fields is becoming commonplace. To further the research in a methodologically rigorous fashion it is critical to be able to measure adequately the constructs under investigation. This study (N = 238) examined three measures used to assess individualist and collectivist orientations. The internal…
Descriptors: Psychometrics, Individualism, Attitude Measures, Self Concept Measures
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
