NotesFAQContact Us
Search Tips
Peer reviewed Peer reviewed
PDF on ERIC Download full text
ERIC Number: EJ1111357
Record Type: Journal
Publication Date: 2008-Oct
Pages: 43
Abstractor: As Provided
ISSN: EISSN-2330-8516
Applying Content Similarity Metrics to Corpus Data: Differences between Native and Non-Native Speaker Responses to a TOEFL® Integrated Writing Prompt. Research Report. ETS RR-08-51
Deane, Paul; Gurevich, Olga
ETS Research Report Series, Oct 2008
For many purposes, it is useful to collect a corpus of texts all produced to the same stimulus, whether to measure performance (as on a test) or to test hypotheses about population differences. This paper examines several methods for measuring similarities in phrasing and content and demonstrates that these methods can be used to identify population differences between native and non-native speakers of English in a writing task.
Educational Testing Service. Rosedale Road, MS19-R Princeton, NJ 08541. Tel: 609-921-9000; Fax: 609-734-5410; e-mail:; Web site:
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Identifiers - Assessments and Surveys: Test of English as a Foreign Language
Grant or Contract Numbers: N/A