Font Size: a A A

News Information Prearrangement Service System

Posted on:2008-10-31Degree:MasterType:Thesis
Country:ChinaCandidate:L HanFull Text:PDF
GTID:2178360212994107Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The now society is the information society. The information to people's production lives the pivotal function, this news information is especially important. The news information gradually appears to the enterprise and individual development importance. The effective use information technology and the information knowledge resources, carries on the Omni-directional transformation to enterprise's each kind of production management, the full development, inside and outside the use enterprise's person, the wealth, the commodity and the enterprise the information resource carries on production management, reduces the production and the management cost, enhances the economic efficiency, causes the enterprise the production, the design, the management, the management, the purchase, to store in a storehouse and so on the comprehensive realization automation, the intellectualization. The enterprise superintendent policy-maker requests promptly to grasp each kind with this profession related most recent, the most accurate news information, in order to is accurate fast makes the corresponding decision-making. Enterprise's each department also must promptly obtain respective need the news information, in order to understand the market tendency, promptly carries on the adjustment. Therefore, fast accurately grasps the news information is one of the modern enterprise and individual development essential important conditions. But this system is precisely for satisfy the user this demand to be born.Information on the Internet grows explosively every day. Search engine provides all the surfers on it with an entrance, from which they can reach every comer of the web. Therefore, search engine becomes the most popular network service second to email.With Web information continuing to explode in all directions, traditional Search Engine can not keep up with the more and more rigorous and prolific search requirements from different users. Recently, topic-driven search engine is presented to provide a new search service, which is better classified, containing more profound and focused data, and being updated in time.Based on our in-depth research in the search strategy in topic-driven search engine and the topic relativity judging algorithms, this article presents a structure design model of the topic-oriented web spider and then analyzes it in detail.This article is compiles a news information service system, uses topic-driven search engine for the user to provide the news information the collection work. This system main work is compiles a topic-driven search engine. The topic-oriented web spider is the topic-driven search engine foundation and the core. How therefore compiles the main question which a highly effective Web spider is we must solve. As the key component of search strategy in topic-oriented web spider, the topic relativity judging algorithms ensure the focused web crawling process of the spider. In the process of relativity judging between URL and topic, a novel URL pruning algorithm is presented based on the analysis on anchor text and other properties. The popular vector space model is used to classify HTML page from different topics.After this system completes, we will have a topic-driven search engine that used to collecting news information. According to the user input the subject ,this service system will get the news information that the user want to. When searches finished, transmits automatically the obtained content in the email address which assigns to the user.
Keywords/Search Tags:Search Engine, Web Spider, Search Strategy, Topic Distillation, Index Page
PDF Full Text Request
Related items