Font Size: a A A

Semantic Web-based Information Processing System Design

Posted on:2007-04-18Degree:MasterType:Thesis
Country:ChinaCandidate:Q YinFull Text:PDF
GTID:2208360185956375Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Internet's characteristic of magnanimity, isomer and dynamic, make Web Information Extraction different from traditional Information Extraction, and bring a new challenge meanwhile. At first, face to geometric series increasing huge Web information space, how to deal automatically and effective is a difficulty of Web Information Extraction. Then, isomer of Web pages cause how to identify required information from varied Web pages is a big difficulty. At last, dynamic update of Web cause keeping adaptability of Information Extraction become a problem.In this paper, the Information Extraction technology and its developing background and history are introduced. The system architecture, the taxonomy of Information Extraction and the key technology and weighing measure of Information Extraction are analysed. And the basic knowledge of ontology also be introduced. Based on this, a new approach to extracting information from normal document based on an application ontology is presented that describes a domain of interest. In our approach we combine the Information Extraction with ontology. We first use the concepts, relations and keywords of domain ontology to generate Information Extraction rule automatically and then do grammar parsing on the document. After that we use the result of grammar parsing and Information Extraction rule to do information extraction on document and at last output the result as a list of records.In this paper, according to the approach and engineering reality condition, an Semantic based Information Extraction System has been designed and wrote some codes and implemented the system, so in this paper, the main frame and the designing method of main modals in detail are introduced. Because we use the rule to extract information, so we focus on introducing a new method of extract information from network source based on semantic.The way we implement the system which includes data structure is also be introduced, flow chart etc. Then, the user interface of this extraction system and the result which we got from the processing of this system using some test documents are shown. At last the extraction result is analysed.
Keywords/Search Tags:Semantic web, Information Extraction, ontology, Metadata, XML
PDF Full Text Request
Related items