Font Size: a A A

Research Of Intranet Information Supervision System Based On Net Crawler And Full-text Search Engine

Posted on:2010-07-30Degree:MasterType:Thesis
Country:ChinaCandidate:X H FuFull Text:PDF
GTID:2178360275497450Subject:Biomedical engineering
Abstract/Summary:PDF Full Text Request
With the development of network technology and application information-based, the level of application based on network and information has improved increasingly. The major platform of network development and construction has transferred from Internet to Intranet.Generally,most organizations and departments have built all kinds of internet application systems based on Intranet.New technology such as dynamic server-side script and database has widely been used in web application development.As a result,the information based on Intranet grows rapidly with transferring the main point of building and application of new technologies."How to efficiently control the information based on network,especially on Intranet?" has become a challenge,which makes the network administrations have to face.The information on Intranet is different from the one on Internet,which plays an important role in society,and has more significant influence on organizations and departments.Therefore,it is the efficiently supervision that should be paid attention on by internet management.However,there are some technology difficulties to manage this kind of information for the characteristic of information based on Intranet. As to the issues,a software method is introduced,which is based on data collection and full-text search engine to develop an Intranet information supervision system.With the help of this system,network administrator can catch the ability of information collection and data filter fast,thus helping the administrator to supervise the web information on Intranet.In the process of development,some popular software technologies are adopted,such as web crawler,which is widely used in web search engine,full-text search engine based on RDBMS.On the other hand, considering the characteristic of information on Intranet,researchers take some additional technical measures to ensure the system work more efficiently,such as "site by site search mode","restriction search rules".In addition,after the search task on Intranet completed by data collection module,researchers use full-text search engine based on RDBMS to manipulate the data,such as merge and filter,and extract valuable information.Also,it is useful to implement a web module in system, which combines web with full-text search engine and RDBMS,and it provides an easy-to-use user interface based on browser,which offers a convenient way for users to get access to the system.Furthermore,people can use this system to search any keyword they are interested in.Meanwhile,through analyzing keyword log which record all keywords user has utilize,it is helpful to find what users most interested in. As a result,network administrator can further improve supervising ability of the system.At first,the paper makes an analysis of the difficulty and embarrassing situation, which network administrator confront.Then,the writer summarizes the characteristic of the information on Intranet.After that,there is a presentation of a software solution to supervise information on Intranet,as well as a description of the software architecture and implementation of system.At last,the paper makes a conclusion of the system's goal achieved,the shortage and the improvements.
Keywords/Search Tags:Intranet, Web, Search engine, Data collection, Web crawler/Web spider, XML, Full-text search engine, RDBMS
PDF Full Text Request
Related items