Font Size: a A A

Design And Implementation Of Job Vertical Search Engine

Posted on:2014-01-21Degree:MasterType:Thesis
Country:ChinaCandidate:M LuoFull Text:PDF
GTID:2248330398960657Subject:Computer technology
Abstract/Summary:PDF Full Text Request
The rapid development of the Internet in recent decades, the explosive growth of the amount of information on the network, and how these massive amounts of information quickly and accurately extract valuable information have become the focus of attention. A lot of information on the Internet by finishing a platform for users to use general search engine, greatly improves the browsing and efficiency, but there are pages failure as well as knowledge overload problem. Vertical search engine for specific areas, the specific needs of a specific group of people to a certain value of information and related services, characterized by "specialized, refined, deep", can be said to be the industry division of labor in the field of search engine. Employment problem in recent years has been known as an important issue to be solved through convenient access to timely access to the recruitment employment information, and is bound to a certain extent, may increase the employment of graduates.In this paper, this employment situation and employment vertical search engine is designed and implemented based on the concept of vertical search engines. In this thesis a theoretical analysis of the search engines and vertical search engines, introduced its principle as well as the main classification of the main drawback of the search engine as well as the characteristics of the vertical search engine, and specific vertical search engine in the design key technologies involved in the process, Lucene framework and the implementation mechanism, as well as vertical search engine page design. Information collection module, designed both the list page reptiles and information page reptiles focused web crawler. Information extraction module to achieve extraction of body denoising algorithm based on the label’s website and design templates and dictionary-based structured information extraction algorithm. The system design goals are:focus on employment, to provide timely and effective post information for job seekers.Finally, based on the exploration of the key technologies of the vertical search engine, designed the employment vertical search engine, and gives the prototype system. Detailed information collection, information extraction and information the index retrieval module specific implementation process.
Keywords/Search Tags:Job, Vertical Search Engine, Information Collection, Information Extraction, Lucene
PDF Full Text Request
Related items