NotesFAQContact Us
Collection
Advanced
Search Tips
Back to results
ERIC Number: ED517683
Record Type: Non-Journal
Publication Date: 2009
Pages: 195
Abstractor: As Provided
ISBN: ISBN-978-1-1240-3561-1
ISSN: N/A
EISSN: N/A
BioFrameNet: A FrameNet Extension to the Domain of Molecular Biology
Dolbey, Andrew Eric
ProQuest LLC, Ph.D. Dissertation, University of California, Berkeley
In this study I introduce BioFrameNet, an extension of the Berkeley FrameNet lexical database to the domain of molecular biology. I examine the syntactic and semantic combinatorial possibilities exhibited in the lexical items used in this domain in order to get a better understanding of the grammatical properties of the language used in scientific writings on molecular biology. The particular data considered is a collection of Gene References in Function (GRIF) texts that describe various types of intracellular protein transport events, a collection that had previously been annotated for an ontologically grounded knowledge base. GRIF texts use long, complex noun phrases, with the omission of many items, resulting in a dense, telegraphic style of writing. This introduces an additional level of complexity to language used in scientific writings of this domain. In providing a frame semantic analysis and cataloging of the grammatical structures used in the scientific language of molecular biology, we see how well a FrameNet approach can handle language of this domain. Extending FrameNet to this domain serves as a testing ground for some of FrameNet's principles and claims, as it becomes evident how well a FrameNet approach handles language in a significantly different field than has been previously examined. I show how domain ontologies and knowledge bases, sources of definitions and classifications of biological phenomena based entirely on their biological properties, can be used in conjunction with lexical resources. At the same time, I also illustrate the overlap of grammatical properties across separate domain ontology classes, demonstrating that although the biology defined and classified in these classes is different, language used to describe and discuss them is not. Finally, I also explore the possibility that BioFrameNet can be used with tools that carry out Natural Language Processing tasks such as automatic semantic role labeling. Therefore, this work is at the intersection of theoretical frame semantics and practical applications and will potentially provide benefit to linguists, BioNLP engineers, and biologists. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page: http://www.proquest.com/en-US/products/dissertations/individuals.shtml.]
ProQuest LLC. 789 East Eisenhower Parkway, P.O. Box 1346, Ann Arbor, MI 48106. Tel: 800-521-0600; Web site: http://www.proquest.com/en-US/products/dissertations/individuals.shtml
Publication Type: Dissertations/Theses - Doctoral Dissertations
Education Level: Higher Education
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A