NotesFAQContact Us
Search Tips
Peer reviewed Peer reviewed
PDF on ERIC Download full text
ERIC Number: EJ1072201
Record Type: Journal
Publication Date: 2008
Pages: 16
Abstractor: As Provided
ISSN: ISSN-1578-7044
Developing Software for Corpus Research
Mason, Oliver
International Journal of English Studies, v8 n1 p141-156 2008
Despite the central role of the computer in corpus research, programming is generally not seen as a core skill within corpus linguistics. As a consequence, limitations in software for text and corpus analysis slow down the progress of research while analysts often have to rely on third party software or even manual data analysis if no suitable software is available. Apart from software itself, data formats are also of great importance for text processing. But again, many practitioners are not very aware of the options available to them, and thus idiosyncratic text formats often make sharing of resources difficult if not impossible. This article discusses some issues relating to both data and processing which should aid researchers to become more aware of the choices available to them when it comes to using computers in linguistic research. It also describes an easy way towards automating some common text processing tasks that can easily be acquired without knowledge of actual computer programming.
University of Murcia. Department of English Philology Merced Campus, Calle Santo Cristo 1, Murcia 30071 Spain. Tel: +34-868-88-3406; Fax: +34-868-88-3409; e-mail:; Web site:
Publication Type: Journal Articles; Reports - Descriptive
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A