GE FRST Evaluation Report: How Well Does a Statistically-Based Natural Language Processing System Score Natural Language Constructed-Responses?.

Burstein, Jill C.; Kaplan, Randy M.

Notes FAQ Contact Us

Download full text

ERIC Number: ED393937

Record Type: Non-Journal

Publication Date: 1995-Sep

Pages: 30

Abstractor: N/A

ISBN: N/A

ISSN: N/A

EISSN: N/A

GE FRST Evaluation Report: How Well Does a Statistically-Based Natural Language Processing System Score Natural Language Constructed-Responses?

Burstein, Jill C.; Kaplan, Randy M.

There is a considerable interest at Educational Testing Service (ETS) to include performance-based, natural language constructed-response items on standardized tests. Such items can be developed, but the projected time and costs required to have these items scored by human graders would be prohibitive. In order for ETS to include these types of items on standardized tests, automated scoring systems need to be developed and evaluated. Automated scoring systems could decrease the time and costs required for human graders to score these items. This report details the evaluation of a statistically-based scoring system, the General Electric Free-Response Scoring Tool (GE FRST). GE FRST was designed to score short-answer, constructed-responses of up to 17 words. The report describes how the system performs for responses on three different item types: (1) the formulating-hypotheses item; (2) a paraphrase language proficiency item; and (3) a reading comprehension item. For the sake of efficiency, it is important to evaluate systems on a number of item types to see if the system's scoring method can generalize to a number of item types. An appendix shows learning information abut responses recognized by GE FRST. (Contains 7 figures, 13 tables, and 3 references.) (Author/SLD)

Descriptors: Computer Assisted Testing, Constructed Response, Cost Effectiveness, Hypothesis Testing, Natural Language Processing, Performance Based Assessment, Reading Comprehension, Scoring, Standardized Tests, Test Construction, Test Items

Publication Type: Reports - Evaluative

Education Level: N/A

Audience: N/A

Language: English

Sponsor: N/A

Authoring Institution: Educational Testing Service, Princeton, NJ.

Grant or Contract Numbers: N/A

Privacy | Copyright | Contact Us | Selection Policy | API