Publication Date
| In 2015 | 0 |
| Since 2014 | 1 |
| Since 2011 (last 5 years) | 2 |
| Since 2006 (last 10 years) | 5 |
| Since 1996 (last 20 years) | 6 |
Descriptor
| Language Tests | 5 |
| Second Language Learning | 5 |
| Scoring | 4 |
| English (Second Language) | 3 |
| Oral Language | 3 |
| Computer Assisted Testing | 2 |
| Correlation | 2 |
| Evaluators | 2 |
| Familiarity | 2 |
| Graphs | 2 |
| More ▼ | |
Source
| Language Testing | 6 |
Author
| Xi, Xiaoming | 6 |
| Higgins, Derrick | 1 |
| Ling, Guangming | 1 |
| Mollaun, Pamela | 1 |
| Williamson, David | 1 |
| Zechner, Klaus | 1 |
Publication Type
| Journal Articles | 6 |
| Reports - Evaluative | 3 |
| Reports - Research | 3 |
Education Level
| Higher Education | 1 |
Audience
Showing all 6 results
Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014
The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we…
Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance
Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David – Language Testing, 2012
This paper compares two alternative scoring methods--multiple regression and classification trees--for an automated speech scoring system used in a practice environment. The two methods were evaluated on two criteria: construct representation and empirical performance in predicting human scores. The empirical performance of the two scoring models…
Descriptors: Scoring, Classification, Weighted Scores, Comparative Analysis
Xi, Xiaoming – Language Testing, 2010
Previous test fairness frameworks have greatly expanded the scope of fairness, but do not provide a means to fully integrate fairness investigations and set priorities. This article proposes an approach to guide practitioners on fairness research and practices. This approach treats fairness as an aspect of validity and conceptualizes it as…
Descriptors: Test Results, Language Tests, Test Validity, English (Second Language)
Xi, Xiaoming – Language Testing, 2010
Motivated by cognitive theories of graph comprehension, this study systematically manipulated characteristics of a line graph description task in a speaking test in ways to mitigate the influence of graph familiarity, a potential source of construct-irrelevant variance. It extends Xi (2005), which found that the differences in holistic scores on…
Descriptors: Familiarity, Graphs, Scoring, Task Analysis
Xi, Xiaoming – Language Testing, 2007
This study explores the utility of analytic scoring for TAST in providing useful and reliable diagnostic information for operational use in three aspects of candidates' performance: delivery, language use and topic development. One hundred and forty examinees' responses to six TAST tasks were scored analytically on these three aspects of speech. G…
Descriptors: Scoring, Profiles, Performance Based Assessment, Academic Discourse
Xi, Xiaoming – Language Testing, 2005
This study examines how task characteristics (the number of visual chunks and the amount of planning time) and test-taker characteristics (graph familiarity) influence the perceptual and cognitive processes involved in graph comprehension, the strategies used in describing graphs, and the scores obtained on the graph description task in a…
Descriptors: Program Effectiveness, Familiarity, Graphs, Cognitive Processes

Peer reviewed
Direct link
