The Study Of XML Storage Technique In Relational Database

Posted on:2006-10-21

Degree:Master

Type:Thesis

Country:China

Candidate:J Z Luo

Full Text:PDF

GTID:2168360155972245

Subject:Computer system architecture

Abstract/Summary:

PDF Full Text Request

XML has already become the standard format for data present and data exchange onthe Internet. With the sharp increasing of XML data, many kinds of DBMS come out inuse to manage XML data, such as XML native database, object-oriented database,relational database and so on. Among them, use relational database to storage andmanage XML data is the dominant approach up to now or even for a long period of timeaccording to mature technique and the wide apply of relational database.Owing to different structure between XML schema and relational schema, weshould do schema mapping at first. There are two kinds of schema mapping methods.The first method is based on a set of fixed mapping rules,and by these rules we canproduce a relational schema for the XML documents. Another method is based on querycost. A XML schema can be mapped to many relational schemas, the cost-based schemamapping method is to search the best relational schema whose query cost is theminimum.This paper studies the storage of XML document in relational database based onquery cost. The paper introduces various storage approaches of XML document andpoints out advantage and disadvantage of these approaches at first. At the same time, thepaper puts forward an improved approach, and it has many characteristics as follows.First, the paper introduces an extended schema graph based on XML Schema. Theextend schema graph can straightforward and complete express the structure of XMLdocuments. Second, add '|'operate to handle choice elements or choice group elementsfor traditional operates can't to do under this situation. Third, when obtain neighboringschemas, map a simple type node to a single relation is limited, thus schema searchspace is smaller then before, which make search algorithm find the best schema inshorter time. Forth, a new query cost model based on the characteristics of hash joinalgorithm is put forward, this cost model is more accurate than others. Fifth, a greedyalgorithm is used as the search algorithm of schema space, and optimize the algorithmwhen compare the cost of two schemas. The optimized algorithm speed up the processof cost compare, and proved the performance of algorithm as a result.At last, I do the imitated experiment and obtained the outcome expected.

Keywords/Search Tags:

XML, schema transfer, cost model, search algorithm

PDF Full Text Request

Related items

1	The Research Of Schema Evolution Strategy With Minimum Query Cost
2	A Study On A Schema-based ANT Algorithm And Its Application On QAP
3	Method And Tool For Transforming XML Schema To Conceptual Model
4	Research And Application On Technology Of Deep Web Schema Acquisition
5	Study On Schema Recognition Oriented To Response Page Of Deep Web
6	Research On Publishing XML Documents From Enterprise Database
7	The Research On Online Transfer Based On Cost-Sensitive And Kernel For Classification
8	Semantics-based Relational Schema To Xml Schema Conversion Methods Research
9	The Research Of Schema Manifestations Based On The Information Transfer
10	Research And Implementation On Schema Exchanging Between XML Schema And Relation Schema