Font Size: a A A

Deep The Web Interface Based On Domain Knowledge Integration

Posted on:2012-03-31Degree:MasterType:Thesis
Country:ChinaCandidate:H WangFull Text:PDF
GTID:2208330335956059Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
With the rapid development of WWW technology, WDB-based Web applications exponentially increase, along with the increase of these applications; the Internet source of information has undergone tremendous changes. These changes have brought the traditional search-engine-based query and way of obtaining information huge challenge. Because of these Web-based database, query results which can only be got by submitting data through the HTML query form and then generated dynamically cannot be obtained by traditional search engines, so the researches how to use this part of the data that will become the largest source of information on the Deep web, is very necessary.Since 1994 Dr. Jill Ellsworth proposed the concept of the Deep web, researches have been started in foreigners. Deep web integration framework has been proposed, which consists of three parts: one is the Deep web interface integration, mainly to complete the discovery of the Deep web interfaces, classification and schema extraction; one is query processing, mainly to complete the mapping of customer queries; and the results processing, mainly to complete the result extraction, data transformation and consolidation. The ultimate goal of the Deep web is obtaining data in WDB hidden by the web application. WDB only provides the HTML form-based query interface and query results are returned based on HTML furthermore the HTML syntax is so flexible and contains any semantic information. Therefore, the analysis by HTML WDB query interface on the extraction, determination, classification, extraction and the results is quite difficult.Deep Web information integration framework tries to establish a fully automated system that can automatically complete the function of each part of the framework. But for the huge number of WDB, it is hard to find a unified approach. Most studies have tended to participate in part of the work by hand, or data integration in a small particular area. In view of this, we consider the application down to a particular area, such as book information inquiries or trains information inquiries. When specifying an area, using the domain knowledge as an integrated instructional information can reduce the difficulty and improve the efficiency of integration. This paper studies how to automatically identify the WDB query interface, and Integration the WDB Interface within a specified area.
Keywords/Search Tags:Deep web, Interface Integration, Interface Schema extraction
PDF Full Text Request
Related items