ERIC Number: EJ1078939
Record Type: Journal
Publication Date: 2014
Pages: 44
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-1814-9448
EISSN: N/A
Available Date: N/A
Constructing the Taiwanese Component of the Louvain International Database of Spoken English Interlanguage (LINDSEI)
Huang, Lan-fen
Taiwan Journal of TESOL, v11 n1 p31-74 2014
This paper reports the compilation of a corpus of Taiwanese students' spoken English, which is one of the sub-corpora of the Louvain International Database of Spoken English Interlanguage (LINDSEI) (Gilquin, De Cock, & Granger, 2010). LINDSEI is one of the largest corpora of learner speech. The compilation process follows the design criteria of LINDSEI so as to ensure comparability across the sub-corpora. The participants, procedures for data collection and process of transcription are all recorded. Fifty third- or fourth-year English majors in Taiwan were given recorded interviews in English. Each interview was accompanied by a profile containing information about such learner variables as age, gender, mother tongue, country, English learning context, knowledge of other foreign languages, and amount of time spent in English-speaking countries and such interviewer variables as gender, mother tongue, knowledge of foreign languages and degree of familiarity with the interviewees. Data on another variable, the learners' English proficiency level based on the results of international standardised tests, was collected; this is not available in other sub-corpora of LINDSEI. The participants' proficiency was similarly distributed across B1 to C1 levels in the Common European Framework of Reference. The structure of the Taiwanese sub-corpus is discussed in comparison with eleven other published sub-corpora. The preliminary investigation, using corpus-linguistic approaches, reveals overall statistical information about the Taiwanese component and Version 1 of LINDSEI. The lexical analyses of the top 50 words and chunks show the characteristics of spoken English in the Taiwanese sub-corpus. The contributions and research potential of this newly-developed learner corpus are discussed, followed by an example of Contrastive Interlanguage Analysis of the most common chunk, "I think," in the Taiwanese learners' speech. The release of this learner corpus is merely the first step. It is hoped that more corpus research will be done on Taiwanese learners, that corpora of other speech genres will be compiled and that research results will contribute to relevant areas in Applied Linguistics. [Note: Correction notice is here - http://www.tjtesol.org/index.php/current-issue/80-vol-15-2/430-correction-notice.html]
Descriptors: Interlanguage, Databases, English (Second Language), Second Language Learning, Interviews, Gender Differences, Standardized Tests, International Assessment, Oral Language, Foreign Countries, Computational Linguistics, Age Differences, Profiles, Native Language, Familiarity, Language Proficiency, Applied Linguistics, Contrastive Linguistics, Student Attitudes, Majors (Students), Undergraduate Students, Language Tests, Questionnaires
National Chengchi University Department of English. 64 Sec 2 ZhiNan Road, Wenshan District, Taipei 11605, Taiwan, R.O.C. Tel: 886-2-2938-7248; Fax: 886-2-2939-0510; e-mail: tjtesol@nccu.edu.tw; Web site: http://tjtesol.nccu.edu.tw
Publication Type: Journal Articles; Reports - Research
Education Level: Higher Education; Postsecondary Education
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Identifiers - Location: Taiwan
Identifiers - Assessments and Surveys: English Proficiency Test; International English Language Testing System; Test of English as a Foreign Language; Test of English for International Communication
Grant or Contract Numbers: N/A
Author Affiliations: N/A