ERIC Number: EJ968180
Record Type: Journal
Publication Date: 2012-Jan
Pages: 10
Abstractor: As Provided
Reference Count: 0
ISBN: N/A
ISSN: ISSN-1366-7289
Distributions of Cognates in Europe as Based on Levenshtein Distance
Schepens, Job; Dijkstra, Ton; Grootjen, Franc
Bilingualism: Language and Cognition, v15 spec iss n1 p157-166 Jan 2012
Researchers on bilingual processing can benefit from computational tools developed in artificial intelligence. We show that a normalized Levenshtein distance function can efficiently and reliably simulate bilingual orthographic similarity ratings. Orthographic similarity distributions of cognates and non-cognates were identified across pairs of six European languages: English, German, French, Spanish, Italian, and Dutch. Semantic equivalence was determined using the conceptual structure of a translation database. By using a similarity threshold, large numbers of cognates could be selected that nearly completely included the stimulus materials of experimental studies. The identified numbers of form-similar and identical cognates correlated highly with branch lengths of phylogenetic language family trees, supporting the usefulness of the new measure for cross-language comparison. The normalized Levenshtein distance function can be considered as a new formal model of cross-language orthographic similarity.
Descriptors: Semantics, Artificial Intelligence, Foreign Countries, Instructional Effectiveness, Bilingualism, Language Processing, Simulation, English, German, French, Spanish, Italian, Indo European Languages, Translation, Databases, Correlation, Contrastive Linguistics, Language Research, Computational Linguistics
Cambridge University Press. The Edinburgh Building, Shaftesbury Road, Cambridge, CB2 8RU, UK. Tel: 800-872-7423; Tel: 845-353-7500; Tel: +44-1223-326070; Fax: 845-353-4141; Fax: +44-1223-325150; e-mail: subscriptions_newyork@cambridge.org; Web site: http://www.cambridge.org
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Identifiers: Europe

Peer reviewed
Direct link
