Publication Date
| In 2015 | 6 |
| Since 2014 | 30 |
| Since 2011 (last 5 years) | 112 |
| Since 2006 (last 10 years) | 223 |
| Since 1996 (last 20 years) | 393 |
Descriptor
| Language Tests | 378 |
| Second Language Learning | 339 |
| English (Second Language) | 260 |
| Testing | 175 |
| Foreign Countries | 164 |
| Language Proficiency | 147 |
| Second Language Instruction | 95 |
| Test Validity | 87 |
| Oral Language | 77 |
| Scores | 77 |
| More ▼ | |
Source
| Language Testing | 488 |
Author
| Davies, Alan | 11 |
| Bachman, Lyle F. | 10 |
| Alderson, J. Charles | 8 |
| Fulcher, Glenn | 7 |
| Henning, Grant | 7 |
| Shohamy, Elana | 6 |
| Stansfield, Charles W. | 6 |
| Xi, Xiaoming | 6 |
| Chapelle, Carol A. | 5 |
| Cheng, Liying | 5 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 84 |
| Postsecondary Education | 22 |
| Secondary Education | 15 |
| Elementary Secondary Education | 11 |
| Elementary Education | 9 |
| High Schools | 7 |
| Grade 6 | 3 |
| Adult Education | 2 |
| Early Childhood Education | 2 |
| Grade 10 | 2 |
| More ▼ | |
Audience
| Researchers | 1 |
| Teachers | 1 |
Showing 136 to 150 of 488 results
Kane, Michael – Language Testing, 2010
This paper presents the author's critique on Xiaoming Xi's article, "How do we go about investigating test fairness?," which lays out a broad framework for studying fairness as comparable validity across groups within the population of interest. Xi proposes to develop a fairness argument that would identify and evaluate potential fairness-based…
Descriptors: Test Bias, Test Validity, Language Tests, Testing
Beglar, David – Language Testing, 2010
The primary purpose of this study was to provide preliminary validity evidence for a 140-item form of the Vocabulary Size Test, which is designed to measure written receptive knowledge of the first 14,000 words of English. Nineteen native speakers of English and 178 native speakers of Japanese participated in the study. Analyses based on the Rasch…
Descriptors: Test Items, Native Speakers, Test Validity, Vocabulary
Alderson, J. Charles – Language Testing, 2010
The Lancaster Language Testing Research Group was commissioned in 2006 by the European Organisation for the Safety of Air Navigation (Eurocontrol) to conduct a validation study of the development of a test called ELPAC (English Language Proficiency for Aeronautical Communication), intended to assess the language proficiency of air traffic…
Descriptors: Testing, Language Tests, Language Proficiency, Aviation Education
Xi, Xiaoming – Language Testing, 2010
Motivated by cognitive theories of graph comprehension, this study systematically manipulated characteristics of a line graph description task in a speaking test in ways to mitigate the influence of graph familiarity, a potential source of construct-irrelevant variance. It extends Xi (2005), which found that the differences in holistic scores on…
Descriptors: Familiarity, Graphs, Scoring, Task Analysis
Zhang, Bo – Language Testing, 2010
This article investigates how measurement models and statistical procedures can be applied to estimate the accuracy of proficiency classification in language testing. The paper starts with a concise introduction of four measurement models: the classical test theory (CTT) model, the dichotomous item response theory (IRT) model, the testlet response…
Descriptors: Language Tests, Classification, Item Response Theory, Statistical Analysis
Butler, Yuko Goto; Lee, Jiyoon – Language Testing, 2010
This study examined the effectiveness of self-assessment among 254 young learners of English as a foreign language. This study looked at 6th grade students in South Korea, who were asked to perform self-assessments on a regular basis for a semester during their English classes. The students improved their ability to self-assess their performance…
Descriptors: Second Language Learning, Program Effectiveness, Effect Size, Foreign Countries
Munoz, Ana P.; Alvarez, Marta E. – Language Testing, 2010
This article reports the results of a research study to determine the washback effect of an oral assessment system on some areas of the teaching and learning of English as a Foreign Language (EFL). The research combined quantitative and qualitative research methods within a comparative study between an experimental group and a comparison group.…
Descriptors: Experimental Groups, Qualitative Research, Student Surveys, Program Effectiveness
Sinharay, Sandip; Powers, Donald E.; Feng, Ying; Saldivia, Luis; Giunta, Anthony; Simpson, Annabelle; Weng, Vincent – Language Testing, 2009
In order to facilitate the interpretation of test scores from the TOEIC[R] "Bridge" as a measure of English language proficiency, one form of the test was administered to more than 6000 test takers in three South American countries--Colombia, Chile and Ecuador. The appropriateness of the TOEIC "Bridge" test as a measure of English language skill…
Descriptors: Factor Analysis, Foreign Countries, Language Skills, English (Second Language)
Plakans, Lia – Language Testing, 2009
As integrated tasks become more common in assessing writing for academic purposes, it is necessary to investigate how test takers approach these tasks. The present study explores the processes of test takers undertaking reading-to-write tasks developed for a university English placement exam. Think-aloud protocols and interviews of…
Descriptors: Writing Evaluation, Protocol Analysis, Writing Tests, Writing Processes
di Gennaro, Kristen – Language Testing, 2009
Practitioners working closely with second language (L2) writers in the US recognize at least two types of L2 students: international (IL2) and Generation 1.5 (G1.5) students. Some argue that specific differences in each group's writing performance are evident (cf. Harklau, 2003; Reid, 2006); however, investigations into observable and measurable…
Descriptors: English (Second Language), Second Language Learning, Student Placement, Writing (Composition)
Gebril, Atta – Language Testing, 2009
Generalizability of writing scores has always been a longstanding concern in L2 writing assessment. A number of studies have been conducted to investigate this topic during the last two decades. However, with the introduction of new test methods, such as reading-to-write tasks, generalizability studies need to focus on the score accuracy of…
Descriptors: Generalizability Theory, Writing Evaluation, Writing Tests, Scores
Alderson, J. Charles – Language Testing, 2009
In this article, the author reviews the TOEFL iBT which is the latest version of the TOEFL, whose history stretches back to 1961. The TOEFL iBT was introduced in the USA, Canada, France, Germany and Italy in late 2005. Currently the TOEFL test is offered in two testing formats: (1) Internet-based testing (iBT); and (2) paper-based testing (PBT).…
Descriptors: Oral Language, Writing Tests, Listening Comprehension Tests, Test Reviews
Johnson, Jeff S.; Lim, Gad S. – Language Testing, 2009
Language performance assessments typically require human raters, introducing possible error. In international examinations of English proficiency, rater language background is an especially salient factor that needs to be considered. The existence of rater language background-related bias in writing performance assessment is the object of this…
Descriptors: Performance Based Assessment, Performance Tests, Native Speakers, English (Second Language)
Ducasse, Ana Maria; Brown, Annie – Language Testing, 2009
Speaking tasks involving peer-to-peer candidate interaction are increasingly being incorporated into language proficiency assessments, in both large-scale international testing contexts, and in smaller-scale, for example course-related, ones. This growth in the popularity and use of paired and group orals has stimulated research, particularly into…
Descriptors: Oral Language, Interpersonal Communication, Second Language Learning, Language Tests
May, Lyn – Language Testing, 2009
The definition and operationalization of interactional competence in speaking tests that entail co-construction of discourse is an area of language testing requiring further research. This article explores the reactions of four trained raters to paired candidates who oriented to asymmetric patterns of interaction in a discussion task. Through an…
Descriptors: Oral Language, Language Proficiency, Evaluators, Language Tests

Peer reviewed
Direct link
