NotesFAQContact Us
Search Tips
ERIC Number: ED530670
Record Type: Non-Journal
Publication Date: 2011
Pages: 142
Abstractor: As Provided
Reference Count: 0
ISBN: ISBN-978-1-1247-1058-7
A Semantic Analysis of XML Schema Matching for B2B Systems Integration
Kim, Jaewook
ProQuest LLC, Ph.D. Dissertation, University of Maryland, Baltimore County
One of the most critical steps to integrating heterogeneous e-Business applications using different XML schemas is schema matching, which is known to be costly and error-prone. Many automatic schema matching approaches have been proposed, but the challenge is still daunting because of the complexity of schemas and immaturity of technologies in semantic representation, measuring, and reasoning. The dissertation focuses on three challenging problems in schema matching. First, the existing approaches have often failed to sufficiently investigate and utilize semantic information imbedded in the hierarchical structure of the XML schemas. Secondly, due to synonyms and polysemies found in natural languages, the meaning of a data node in the schema cannot be determined solely by the words in its label. Thirdly, it is difficult to correctly identify the best set of matching pairs for all data nodes between two schemas. To overcome these problems, we propose new innovative approaches for XML schema matching, particularly applicable to XML schema integration and data transformation between heterogeneous e-Business systems. Our research supports two different tasks: integration task between two different component schemas; and transformation task between two business documents which confirm to different document schemas. For the integration task, we propose an approximate approach that produces the best matching candidates between global type components of two schemas, using their layer specific semantic similarities. For the transformation task, we propose another approximate approach that produces the best sets of matching pairs for all atomic nodes between two schemas, based on their linguistic and structural semantic similarities. We evaluate our approaches with the state of the art evaluation metrics and sample schema sets obtained from several e-Business standard organizations and e-Business system vendors. A variety of computer experiments have been conducted with encouraging results that show the proposed approaches are valuable for addressing difficulties in XML schema matching. [The dissertation citations contained here are published with the permission of ProQuest LLC. Further reproduction is prohibited without permission. Copies of dissertations may be obtained by Telephone (800) 1-800-521-0600. Web page:]
ProQuest LLC. 789 East Eisenhower Parkway, P.O. Box 1346, Ann Arbor, MI 48106. Tel: 800-521-0600; Web site:
Publication Type: Dissertations/Theses - Doctoral Dissertations
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A