Font Size: a A A

On-line Employment Information Extraction Based On Patterns Discovery

Posted on:2007-07-18Degree:MasterType:Thesis
Country:ChinaCandidate:J H ChenFull Text:PDF
GTID:2178360212466302Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the rapid growth of the WWW, Web has become an important employment information resource. Extracting and reasonable storing the information is very important for further analyzing employment information, understanding employment situation and making employment plan.Most of web employment information is represented with HTML language. Since it cannot be used directly, the information is unavailable for the application. Hence, at first we must use the technique of web information extraction and storage to get employment information.According to display characteristics of web employment information, this thesis proposes a design about extraction tool based on pattern discovery. This tool applies PAT-array to find display patterns, and creates extraction rules to realize extraction of web employment information.Employment information is first stored with the form of XML file, and then this tool uses object-relational mapping to create the mapping rule between XML and database, and completes integration and storage of employment information.At last, using XML technique and Delphi, we have developed a demo system. It has a good result in one-line employment information extraction.
Keywords/Search Tags:Employment information, Web extraction, PAT-array, XML, Object-relational mapping
PDF Full Text Request
Related items