Font Size: a A A

Research And Implementation On The Government Information Resource Retrieval Technology Of Government Information Resource Catalog Services System

Posted on:2012-01-29Degree:MasterType:Thesis
Country:ChinaCandidate:X ChenFull Text:PDF
GTID:2178330332985818Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
With the development of the construction of E-government, various Government departments have established the independent E-government systems. Because these systems lack the unified standards, different departments gatherd and collated the government information resources base on different rules. In order to reduce rebuilding and gain a high efficiency method to discover and share the information resources, it is necessary to integrate the information resources. In accordance with specific standards, government information resource catalog system classified the resources and made an interchanging service resource catalog that dynamically integrated the resources of the government. The catalog system will break the isolation of the different resource standards from different government departments, and will at last implement the information sharing and exchanging.This article summarized the situation on the using and sharing of government information resources in china, proposed the architecture of government information resources catalog service system, which included cataloguing system, catalog registering system and catalog management system., and gave a respective detailed analysis on a catalog service retrieval model. This retrieval model was builded based on a mixed search pattern (catalog search pattern, advanced search pattern and keywords search pattern), which provided the highly effective convenient retrieval service for the users.The information retrieval plays an importmant role in information resources catalog service system. Comparing with Xquery language, the main advantage of the keyword search is that the customers doesn't need to study complicated search language, nor need to have understanding of the structure of the XML text file, they only thing they need to do is inputing the keywords related to his interested contents. XML data keyword search takes element as the grain grade, and only return fragment that including the keywords, which raised the searching speed. In general, most of the existing XML keyword query methods only considered the structural relationships among XML nodes and return the fragment including keyword matching nodes as the query results. The semantic relevance was not fully used, that is the main reason which leaded to the irrelevance of the keywords query result. Therefore it's nessary to have more effective query algorithm when design mass data search engines.Based on the analysis of core metadata standards stipulated in national standards of government information resource catalog system, with the concepts of metadata search model, metadata TF*IDF and the semantic relevance, this article proposed a keywords search algorithm RF-MT, which used the XML TF*IDF ranking strategy of government information resource metadata and the keywords dependence to rank the individual matches by semantic relevance, and an improved keywords inverted index was proposed to improve the query efficiency. The experimental results showed that this algorithm can greatly improve the rank accuracy of search results as well as the time efficiency, which can effectively improve the data-sharing ability of government information resource.Finally this article introduced the designation and implemention of the three kinds of retrieval patterns separately. And with the utiliztion of the RF-MT algorithm in the mix retrieval pattern, this retrieval system realized the application of the Government information resource catalog Services retrieval system based on the metadata semantic relevance Oriented Ranking.
Keywords/Search Tags:Government information resource, Metadata, Keyword search, Semantic relevance, XML
PDF Full Text Request
Related items