Font Size: a A A

A Index Technology And Query Method For XML Document Based On Textnode

Posted on:2006-12-21Degree:MasterType:Thesis
Country:ChinaCandidate:Q WangFull Text:PDF
GTID:2168360155456973Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
XML, as a extensible markup language with particular characteristics and advantages, makes it possible to carry out deep application on Internet. XML will become the new standard format of data express and data transportation on Internet instead of HTML step by step in the future. Nowadays, more and more data are stored in XML file, and there has already a large number of data collections based on XML documents on Internet. The database technology how to store and query XML data is an important subject for research. Because data in XML documents has characteristics of nested structure and incomplete information mode, which makes its structure very complex, it is so difficult to deal with XML query. Traditional traverse technologies based on tree structure can not meet the demand to deal with complex XML documents. So how to query the data in XML documents quickly and accurately has already become a exigent problem to be solved imminently.In this paper, we research the query algorithms for XML documents, and construct a special index based on textnode and a query strategy for semistructured document like XML document. We construct the corresponding index structure and realize to query data in XML documents high-efficiently. This method has the following advantages. Its index file is small, can query XML documents quickly, is easy to realize, and is suitable for all XML documents including XML documents without DTD files.The main work of this paper is as follows:1) XML basic conceptions and characteristics are expatiated, and bases of XML query is explained.2) Domestic and international technologies for XML query are researched, the disadvantages and advantages of existing query technology are analyzed, the factors which affect the query efficiency are found, and the...
Keywords/Search Tags:XML, document tree, textnode-index, cumulate word frequency, inverted file, query-engine
PDF Full Text Request
Related items