Research And Implementation On Topic Search Oriented To Enterprise Competitive Intelligence

Posted on:2011-02-11

Degree:Master

Type:Thesis

Country:China

Candidate:C W Zhang

Full Text:PDF

GTID:2178330332988253

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Web resources with massive growth have become an important source of gaining information for enterprises. These resources have the characteristics of semi-structure, discrete, real-time and heterogeneity. It has turned into a significant research area that how information of particular topic is extracted from Web resources and provided instantly for business companies as valuable intelligence.The subject is Web-based Topic Search oriented Enterprise Competitive Information. It focuses on the design and implementation of topic Web crawler, which is the core module of Topic Search. The main work is as follows:Topic Web Crawler: With a comprehensive analysis of the existing search algorithms, genetic algorithm based on non-greedy strategy is adopted to enhance the global convergence of information collection.Web Document Analysis:A Web document is converted to a document tree correspondingly, and relevant information is accessed effectively and rapidly by traversing this tree; After content refinement and text extraction, text eigenvector is established by using an improved calculation of weights of feature items.Topic Degree Evaluation: On the basis of the topic degree evaluation of Web document text, compute web links' topic degree combined with anchor text, URL string as well as the context.As discussed above, the overall design of CI system and implementation of Topic Search are described with detail.

Keywords/Search Tags:

Topic Search, Web Mining, Web Crawler, CI

PDF Full Text Request

Related items

1	The Research And Implementation Of Topical Web Crawler Based On Improved Shark-Search Algorithm
2	Research And Implementation On Topic Search Oriented To Enterprise Competitive Intelligence
3	Utility-driven Topic Web Mining Algorithm Research
4	Research On The Topic Crawler Algorithm Based On Vector Space Model
5	The Design Of Specific Topic Web Crawler And Its Transmission Group
6	Research And Implementation Of Topic Web Crawler Oriented To Web Mining
7	The Design And Implementation Of Topic Web Crawler About Mining Equipment
8	Topic-Driven Web Information Mining And The Design And Implementation Of TWIMS
9	Design And Implementation For Topic Specific Meta Search Engine Based On Web Data Mining
10	News Topic Mining Based On Web Crawler And LDA