Font Size: a A A

The Realization Of WEB Alumni Information Retrieval System Based On Text Classification Technology

Posted on:2018-09-14Degree:MasterType:Thesis
Country:ChinaCandidate:Y G TianFull Text:PDF
GTID:2348330542962897Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the rapid development of information technology,especially the expansion and application of the Web,the information on the Web is growing at a very fast speed,which forms a huge information resource.Facing the soaring information on the Internet,it is a very meaningful topic that how to find the needed interrelated contents quickly and accurately from the vast information resources.As the work of the university alumni department,the alumni information collection is the basis of the work,and it plays an important role in the future development of the alumni work.Nowadays there is a mass of information on the Internet and among them many of alumni information like resumes or activity reports.Although we can search the alumni information by the search engine,the result set is very large,including many unorganized and unaccredited web pages.It is a huge work to distinguish correct and incorrect each result.Therefore,it is very practical to realize a search system that could find out the alumni information on the Internet automatically,comprehensively and accurately.The paper has studied the retrieval method about alumni information based on the text classification technology,analyzed the characteristics and difficulties of the alumni information retrieval in depth,and designed an alumni information retrieval system based on the Internet on the basis of the double classification method.The work of this paper is as follows:1.Describing the characteristics of the distribution of alumni information,as well as the lack of the learning method generalization ability caused by it and the dimension disaster problem.2.Adopting double classification method to realize the classification of alumni information,and it improves the rate of the coverage and accuracy of the results.3.Using the heuristic rules to identify the names of the alumni in the search results;4.According to the search methods above,this paper achieved an alumni information retrieval system based on the Internet though the JAVA language.The system has a real-time search about the alumni information on the Internet though the search engine technology and web crawler.
Keywords/Search Tags:Search engine, Double classification, Web crawler, Alumni information retrieval
PDF Full Text Request
Related items