Font Size: a A A

Research Of Semistructured Data Index Technology Based On XML

Posted on:2005-01-26Degree:MasterType:Thesis
Country:ChinaCandidate:W SunFull Text:PDF
GTID:2168360125971046Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
In recent years there has been an increased interest in managing data that does not conform to traditional data models, like the relational or object oriented data models. Therefore, semistructured data management and integration have recently become an important research topic in databases. Specifically, the extended Markup Language (XML) has emerged as a simple, practical standard to model and exchange semistructured data over the World Wide Web, without the rigid constraints of traditional database systems. So it is an important and significant way to managing and integrating semistructured data over the WWW while migrate semistructured data on the Web to XML. Now the data management and integration for XML-based semistructured data has become a hot research topic in the international database community.Research of index for semistructured data is one aspect of the data management for semistructured data. Index for semistructured data is not only the same as index techniques of triditional database, but also different to them. Some of mature techniques of triditional database can be diverted into semistructured data easily. But semistructured data has some inherent characteristics which are different from tridi.ri.onal database, so research of index for semistructured data brings some new questions.As XML is strong in data representation and exchange on the World-Wdie-Web, XML is much more than the bridge between World-Wdie-Web and database. In this article, the research of index for semistructured data bases on XML graphic model. Now, there are many research in index of semistructured data and many mature index models.Based on the characters of semistructured data, we research and classify index from seven aspects, Data representation, index interface, path templates, navigation, node indentification, index update and storage. The author discussed most index of semistructured data which had been bringed forward. Especially, detail of BUS index and shortcoming of dynamic change difficultly were bringed. So the author built a new proto model of semistructured data index which could support document changing frequently. The new index model conjoint with relation database model, and make it easy to insert nodes and delete nodes. The last, the proto model are implemented to validate the efficiency of index when content and structure of documents change frequently, through the result of experimentation.
Keywords/Search Tags:semistructured data, XML, index, data model
PDF Full Text Request
Related items