Font Size: a A A

Research On Web Resources Management Based On Semantic Web

Posted on:2012-09-10Degree:MasterType:Thesis
Country:ChinaCandidate:D LiuFull Text:PDF
GTID:2178330335977752Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet, there are many Web resources, they have been one of the most important net resources. The main purpose of resource management is to allow people to find interested information from huge Web resources accurately, extract data from a lot of disordered information and represent them with a structured way so that we can take further action on them.This paper researched semantic web and the identification, extraction and structured expression of Web resources. ontology is about concepts and relations between them; RDF is the general framework which was used to describe resources. In this way, resources have semantic information which can improve the efficiency of resource management; at present there were multiple languages to inqury RDF, SPARQL language is one of them which supports various platforms and languages and is suitable for local or remote use. The main content and the results of this paper are as follows:Firstly, Web resources achievement and analyzing. We get pages by HTTP protocol; then use Microsoft open source library to convert HTML format into XML one which can be extracted. Then convert the XML document into DOM tree by traversing all XML nodes and use a relevant control to read them out from memory;Secondly, Web resources extraction. We need traversal DOM tree with extracting rules which were generated by combining with the users'needs and ontology files, ontology can solve problems such as synonyms or polysemous words and make the extraction results contain more information which people interested;Thirdly, Web resources description with RDF. Developing the Redland libraries secondary to express results using RDF/XML syntax and store them in XML/RDF file. RDF makes extraction results with more semantic information and it is easier to be handled by machine;Finally, Query Web resources. Take further action on the extraction results by researching SPARQL language grammar, semantics and its use.On the basis of above research work, this paper designed and implemented a Web resource management system, it includes four parts:Web resources achievement, Web resources extraction, Web resources expression and how to query Web resources. This system is easy to use, it is not necessary for users to possess professional knowledge so that they can find interested information. Experimental results show that with the semantic web, the Web resource management system can get high recall and precision.
Keywords/Search Tags:Web resources management, RDF, DOM, ontology, SPARQL
PDF Full Text Request
Related items