NotesFAQContact Us
Collection
Advanced
Search Tips
ERIC Number: ED516563
Record Type: Non-Journal
Publication Date: 2010
Pages: 148
Abstractor: As Provided
Reference Count: 0
ISBN: ISBN-978-1-1097-7972-1
ISSN: N/A
Finding Meaning: Sense Inventories for Improved Word Sense Disambiguation
Brown, Susan Windisch
ProQuest LLC, Ph.D. Dissertation, University of Colorado at Boulder
The deep semantic understanding necessary for complex natural language processing tasks, such as automatic question-answering or text summarization, would benefit from highly accurate word sense disambiguation (WSD). This dissertation investigates what makes an appropriate and effective sense inventory for WSD. Drawing on theories and methodologies from cognitive linguistics, psycholinguistics, and computational linguistics, we address underlying assumptions about sense inventories, investigate factors influencing successful sense annotation, and experiment with the appropriateness of using a lexicon enriched with syntactic and semantic information as a sense inventory for WSD. A psycholinguistic experiment is presented that investigates how people process polysemous words. The significant facilitation found in processing related senses as compared to unrelated senses supports a model of semantic representation in which related senses are not discrete entities. The results suggest that fine-grained sense inventories may not be necessary for WSD, as little information may be lost from using more general senses. It also suggests that flexible sense inventories, in which systems can move between more or less nuanced senses, may improve the portability of WSD systems. A series of annotation experiments investigates the factors that affect the reliability of word sense annotation, an important question given the strong correlation between WSD system accuracy and the reliability of the annotation used for training. We found that sense granularity is a significant factor, but that the number of senses used to label a word is not. Finally, we demonstrate that supervised machine learning can be successful with a sense inventory enriched with syntactic and semantic information, such as VerbNet, thereby consolidating the steps needed to create a complex knowledge representation and reasoning system. We trained a classifier to disambiguate a set of verbs with multiple VerbNet class memberships with 90% accuracy, which represents a 61% error reduction over the baseline. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page: http://www.proquest.com/en-US/products/dissertations/individuals.shtml.]
ProQuest LLC. 789 East Eisenhower Parkway, P.O. Box 1346, Ann Arbor, MI 48106. Tel: 800-521-0600; Web site: http://www.proquest.com/en-US/products/dissertations/individuals.shtml
Publication Type: Dissertations/Theses - Doctoral Dissertations
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A