Font Size: a A A

Research And Application Of Information Retrieval Based On XML Document

Posted on:2014-12-06Degree:MasterType:Thesis
Country:ChinaCandidate:N N HuFull Text:PDF
GTID:2268330425462264Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The information retrieval for XML document research in the field of information retrieval has an important position. In today’s era of the Internet, the network always has a huge amount of data on the increase, these data in the dissemination and exchange process, if there is not a standard, then the characteristics of information sharing in the network will be more, the process of acquiring information becomes very difficult.HTML is a hypertext markup language, when the birth of the Internet, as a standard language for network information transmission, but the focus of HTML is the external form of data, not the data itself meaning. With the emergence of XML, effectively solves the problem of information search process is complex, because the focus of XML language is concerned with the data information in the form of organization, and has the advantages of self descriptive, good scalability and platform independence. XML will become the standard for data representation and information exchange on the Web, will replace HTML become the main format of Web data exchange and information preservation.The information retrieval for XML document compared with the traditional information retrieval, the main difference lies in:the Indexing Strategy of XML document information retrieval, in information retrieval, not only need to deal with the index term in the XML document, also need to take into account the corresponding data structure in the information in the XML document.In order to improve the efficiency of information retrieval for XML document index strategy, design a XML document, the main process is through the index term to construct XML document, and proposes the concept of semantic level,index term expansion, in order to achieve in the content and the structure of double retrieval; based on the "HowNet" and the computational method of the analysis, put forward a computing method of quantitative relation between terms, and the application of the quantitative relations between terms from the concept of library construction index of XML documents.
Keywords/Search Tags:XML, Index Mechanism, Concept, Information Retrieval
PDF Full Text Request
Related items