Font Size: a A A

Xml Document Information Retrieval Techniques And Realization

Posted on:2003-03-18Degree:MasterType:Thesis
Country:ChinaCandidate:D F SunFull Text:PDF
GTID:2208360092499086Subject:Systems Engineering
Abstract/Summary:PDF Full Text Request
As more and more data is described, stored, exchanged and represented by XML, the abilities of information retrieval for XML document become increasingly important. Due to the structural characteristics of XML, the technology of information retrieval for XML documentation not only needs to satisfy the requirements of content but also structure of this information.First, this thesis has studied classic information retrieval for document from information retrieval theory, description by math, retrieval model and so on, designed and implemented a content retrieval experimental system based on vector space model.Then, we have studied XML data model, document structure, query requirement, indexing schema, and proposed a number method for structure information represent of XML document. Based on it, we have designed and part implemented an information retrieval prototype system for XML document. In this, content retrieval is achieved by content retrieval testing system based on vector space model, structure information is indexed by relation table through special numbering. We have implemented hybrid retrieval on content, structure and attribute of XML document.Finally, based on above studying, in particular, extended application on vector space model, thesis has proposed and built a whole solution about collection, processing and service of network information.
Keywords/Search Tags:XML, XML Document, Information Retrieval, Structure Retrieval, Vector Space Model, Indexing
PDF Full Text Request
Related items