Font Size: a A A

Research And Design Of A Special Field-based Text Information Obtaining System On Web

Posted on:2006-11-19Degree:MasterType:Thesis
Country:ChinaCandidate:S Z ZhaoFull Text:PDF
GTID:2168360155974258Subject:Computer applications
Abstract/Summary:PDF Full Text Request
With the development of Internet application, Web has become a main information resources for everyone . Although many popular search engines are advantageous tools that people use, they lack the technique and the strategy of understanding users in depth. Additional, because of Internet's opening , developments and heterogeneity, it is much difficult for users(especially for a special domain user) to quickly and exactly obtain the needed information from WWW. How to find the useful and wholesome information, and avoid the useless and harmful information, are the issue deserved to study for us.This dissertation discusses the growth and characteristics of Web and the misadvantage of existing search engines firstly. And then from the application requirement of specific-field users, weattempt to design a frame structure based on specific-field content information obtaining System on Web for them to obtain the needed web text quickly and intelligently. We also analyze the basic developing principle related to the system and the main characteristics and functions of each module composing the system from the implement technology. At the same time, the key technologies to realize the system is also discussed in detail, such as Robot technology, Analysis of Web page content, the Hyperlink structure analysis and Chinese text classification, which includes Chinese Words Segmentation, feature extraction, feature match and Wight value calculating technology etc.At last, summary of this thesis is given. The application prospect and realistic meaning of this system to acquire useful information for specific-field users from Web is pointed out, and further research direction is also put forward .
Keywords/Search Tags:special domain, search engine, Web text Information, Chinese Words segmentation, classification system
PDF Full Text Request
Related items