Font Size: a A A

Research On Full-Text Retrieval Technology For XML Documents Based On Inverted Index

Posted on:2008-01-04Degree:MasterType:Thesis
Country:ChinaCandidate:B T QingFull Text:PDF
GTID:2178360242964580Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the continuous development of information technology, the XML standard has been widely accepted and applied. How to inquiring information from XML documents more efficiently has become the hot spot of academic and industry research. In several kinds of XML documents inquiring technology, the index based full-text retrieval technology has the higher technical background and broad application prospects at present.On the basis of research and analysis for inverted index based full-text retrieval technology, a group of storage models and algorithms are designed which could support inverted indexing and full-text retrieving XML documents. These structures and algorithms have been applied with prototype system during investigation, and were compared with two kind of XML query language: XPath and XQuery.Considering that the full-text retrieval is one kind of I/O intensive technology, and especially need to visit disk equipment frequently during search by large scale documents set, an idea of using crossed cache list to buffering inverted index file is proposed, In addition, in order to support the renewal needs of documents set, an inverted index file structure and related algorithms based on extensible Bitmap were also introduced.
Keywords/Search Tags:XML, Full-text Retrieval, Inverted Index, Cache, Bitmap
PDF Full Text Request
Related items