ERIC Number: ED552211
Record Type: Non-Journal
Publication Date: 2013
Pages: 138
Abstractor: As Provided
ISBN: 978-1-2679-4640-9
ISSN: N/A
EISSN: N/A
Applications of Location Similarity Measures and Conceptual Spaces to Event Coreference and Classification
McConky, Katie Theresa
ProQuest LLC, Ph.D. Dissertation, State University of New York at Buffalo
This work covers topics in event coreference and event classification from spoken conversation. Event coreference is the process of identifying descriptions of the same event across sentences, documents, or structured databases. Existing event coreference work focuses on sentence similarity models or feature based similarity models requiring slot filling. This work shows the effectiveness of using a hybrid approach where the similarity of two events is determined by a combination of the similarity of the two event descriptions, in addition to the similarity of the event context features of location and time. A dynamic weighting approach is taken to combine the three similarity scores together. The described approach provides several benefits including improving event resolution and requiring less reliance on sophisticated natural language processing. A spatial and hierarchical based location similarity measure is developed to support the dynamic weighting event coreference approach. A complete end-to-end system is developed starting with geo-referencing toponyms from natural language text and structured documents. An experiment is conducted that shows a significant improvement in precision when using the developed spatial-hierarchical location similarity measure for an event coreference process over using location string matching techniques or Euclidean distance measures between location centroids as part of the event coreference process. Additionally, the developed spatial-hierarchical similarity measure provides robust predictable performance regardless of the parameter settings and the quality of the preceding location resolution processes. In parallel, a sentence similarity measure is developed for the event coreferencing domain based on the evaluation of location and time coupled with existing sentence similarity measures. Experiments show an increase of 30 to 64% in F-Measure performance in event coreferencing experiments when location and date are included as part of the sentence similarity measure. Seven sentence similarity measures are evaluated including semantic, word overlap and term frequency- inverse document frequency (TFIDF) based measures. TFIDF based measures are shown to have superior performance in two event coreference experiments. Finally, a novel application and implementation of conceptual spaces is applied to topic detection in military conversations to aid in battlefield event reporting. Extensions are made to existing conceptual spaces models in order to take into account contradictory information present in observations to mitigate the errors present in the noisy speech recognition data. Comparison of the conceptual spaces algorithm to a traditional approach using support vector machines indicates significant promise for this new application of conceptual spaces. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page: http://www.proquest.com/en-US/products/dissertations/individuals.shtml.]
Descriptors: Classification, Sentences, Interpersonal Communication, Verbal Communication, Documentation, Databases, Models, Natural Language Processing, Measurement Techniques, Scores, Semantics, Incidence, Experiments, Reports, Concept Formation, Proximity
ProQuest LLC. 789 East Eisenhower Parkway, P.O. Box 1346, Ann Arbor, MI 48106. Tel: 800-521-0600; Web site: http://www.proquest.com/en-US/products/dissertations/individuals.shtml
Publication Type: Dissertations/Theses - Doctoral Dissertations
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A