NotesFAQContact Us
Collection
Advanced
Search Tips
Peer reviewed Peer reviewed
Direct linkDirect link
ERIC Number: EJ968180
Record Type: Journal
Publication Date: 2012-Jan
Pages: 10
Abstractor: As Provided
Reference Count: N/A
ISBN: N/A
ISSN: ISSN-1366-7289
Distributions of Cognates in Europe as Based on Levenshtein Distance
Schepens, Job; Dijkstra, Ton; Grootjen, Franc
Bilingualism: Language and Cognition, v15 spec iss n1 p157-166 Jan 2012
Researchers on bilingual processing can benefit from computational tools developed in artificial intelligence. We show that a normalized Levenshtein distance function can efficiently and reliably simulate bilingual orthographic similarity ratings. Orthographic similarity distributions of cognates and non-cognates were identified across pairs of six European languages: English, German, French, Spanish, Italian, and Dutch. Semantic equivalence was determined using the conceptual structure of a translation database. By using a similarity threshold, large numbers of cognates could be selected that nearly completely included the stimulus materials of experimental studies. The identified numbers of form-similar and identical cognates correlated highly with branch lengths of phylogenetic language family trees, supporting the usefulness of the new measure for cross-language comparison. The normalized Levenshtein distance function can be considered as a new formal model of cross-language orthographic similarity.
Cambridge University Press. The Edinburgh Building, Shaftesbury Road, Cambridge, CB2 8RU, UK. Tel: 800-872-7423; Tel: 845-353-7500; Tel: +44-1223-326070; Fax: 845-353-4141; Fax: +44-1223-325150; e-mail: subscriptions_newyork@cambridge.org; Web site: http://www.cambridge.org
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A