Font Size: a A A

The Research And Implementation Of Nutch-Based Website Harvest And Service System In Digital Library

Posted on:2011-02-28Degree:MasterType:Thesis
Country:ChinaCandidate:Z R ChangFull Text:PDF
GTID:2178360308961368Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
This paper introduced the research and implementation of Nutch-based Website Harvest and Service system in Special field (N-WHSS) under the framework of digital library systems integration application. Based on the practical requirements of application system integration in the Digital Library, it focus on the text parsing filters, plug-in development and technology applications of the level-automatic clustering of the search results. Finally, achieved resources-level integration with other subsystem in digital library through the webservice interface, and provided users with comprehensive and professional services.On the basic structure crawler, indexer, searcher models of Search Engine, N-WHSS system designed and developed GUI information module, information filtering module, Dictionary-based Chinese analyzer module, Topic-knowledge based information processing module and the webservice-based search service modules, making improvement in system function and performance, as well as practicality.
Keywords/Search Tags:Nutch, Web harvest, Resource in special field, Digital library, Integration service
PDF Full Text Request
Related items