ERIC Number: EJ1158161
Record Type: Journal
Publication Date: 2017
Pages: 21
Abstractor: As Provided
ISBN: N/A
ISSN: EISSN-1934-5275
EISSN: N/A
SCOPIC Design and Overview
Barth, Danielle; Evans, Nicholas
Language Documentation & Conservation, SP12 p1-21 2017
This paper provides an overview of the design and motivation for creating the Social Cognition Parallax Interview Corpus (SCOPIC), an open-ended, accessible corpus that balances the need for language-specific annotation with typologically-calibrated markup. SCOPIC provides richly annotated data, focusing on functional categories relevant to social cognition, the social and psychological facts that place people and others within an interconnected social context and allow people to interact with one another. By "parallax corpus" we mean "broadly comparable formulations resulting from a comparable task", to avoid the implications of "parallel corpus" that there will be exact semantic equivalence across languages. We describe the data structure of the corpus and the language functions being annotated, and provide an example of a typological analysis using recursive partitioning, a modern statistical technique. The current paper should be seen as the introductory chapter of an open-ended special issue of LDC whose goal is to make available both the original corpus, the evolving annotated versions, and analyses coming from them, so that any investigator can examine the corpus with their own questions in mind. A range of new papers, linked to the evolving corpus, will be added to this special issue over time.
Descriptors: Computational Linguistics, Documentation, Social Cognition, Data, Classification, Grammar, Semantics, Language Research, Statistical Analysis
National Foreign Language Resources Center at University of Hawaii. Department of Linguistics, UHM Moore Hall 569, 1890 East-West Road, Honolulu, HI 96822. Fax: 808-956-9166; e-mail: ldc@hawaii.edu; Web site: http://nflrc.hawaii.edu/ldc/
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A