NotesFAQContact Us
Search Tips
Back to results
ERIC Number: ED526411
Record Type: Non-Journal
Publication Date: 2011
Pages: 239
Abstractor: As Provided
ISBN: ISBN-978-1-1245-3209-7
Dimensions of Drug Information
Sharp, Mark E.
ProQuest LLC, Ph.D. Dissertation, Rutgers The State University of New Jersey - New Brunswick
The high number, heterogeneity, and inadequate integration of drug information resources constitute barriers to many drug information usage scenarios. In the biomedical domain there is a rich legacy of knowledge representation in ontology-like structures that allows us to connect this problem both to the very mature field of library and information science classification research and the very new field of ontology matching/merging (OM). We argue for a broad view of OM that makes room not only for the "pre-formal" phase/type of multi-ontology integration exemplified by RxNorm and the UMLS Metathesaurus, but also for an even earlier phase/type when "What is there?" in a domain has to deal with implicit and poorly structured "ontologies" that barely qualify as such. Such is the case in the drug domain. We introduce "dimensions of drug information" as an approach to early, pre-formal OM in the drug domain that draws inspiration and incorporates principles from facet analysis, domain analysis, and Semantic Web research on linked data and mashups. By surveying 23 publicly available drug information resources, we identified 39 dimensions relevant to four drug (sub)domains--pharmacy, chemistry, biology, and clinical medicine--and mapped them to the resources An arbitrary four-domain, monohierarchical classification of the dimensions produced, by extension, a reasonable four-domain resource classification. Correspondence analysis and hierarchical cluster analysis also produced evidence of its partial validity. Detailed analysis of information on nine parent drug compounds from 15 resources refined this high-level dimensional mapping and identified hundreds of subdimensions which could be expressed as a six-level hierarchy. Based on these dimensions, we integrated this information in an experimental database and showed that it was useful (1) as a training set for automating the normalization of additional raw data from the same 15 sources, bringing the important goal of building an integrated, comprehensive (all drugs) database within reach, and (2) for satisfying a variety of use cases, some quite complex, derived from published literature representing the user types corresponding to our domain focus. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page:]
ProQuest LLC. 789 East Eisenhower Parkway, P.O. Box 1346, Ann Arbor, MI 48106. Tel: 800-521-0600; Web site:
Publication Type: Dissertations/Theses - Doctoral Dissertations
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A