NotesFAQContact Us
Search Tips
Peer reviewed Peer reviewed
Direct linkDirect link
ERIC Number: EJ1092687
Record Type: Journal
Publication Date: 2016-Apr
Pages: 24
Abstractor: As Provided
Reference Count: 38
ISSN: ISSN-0013-1644
Automatic Coding of Short Text Responses via Clustering in Educational Assessment
Zehner, Fabian; Sälzer, Christine; Goldhammer, Frank
Educational and Psychological Measurement, v76 n2 p280-303 Apr 2016
Automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are available under open licenses. The accuracy of automatic text coding is demonstrated by using data collected in the "Programme for International Student Assessment" (PISA) 2012 in Germany. Free text responses of 10 items with n = 41,990 responses in total were analyzed. We further examined the effect of different methods, parameter values, and sample sizes on performance of the implemented system. The system reached fair to good up to excellent agreement with human codings (0.458 = ? = 0.959). Especially items that are solved by naming specific semantic concepts appeared properly coded. The system performed equally well with n = 1 , 661 and somewhat poorer but still acceptable down to n = 249 . Based on our findings, we discuss potential innovations for assessment that are enabled by automatic coding of short text responses.
SAGE Publications. 2455 Teller Road, Thousand Oaks, CA 91320. Tel: 800-818-7243; Tel: 805-499-9774; Fax: 800-583-2665; e-mail:; Web site:
Publication Type: Journal Articles; Reports - Research
Education Level: Secondary Education
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Identifiers - Location: Germany
Identifiers - Assessments and Surveys: Program for International Student Assessment