Font Size: a A A

Web Chinese Information Extraction Technology And Its Applications In The Recruitment Of Information Systems

Posted on:2008-01-20Degree:MasterType:Thesis
Country:ChinaCandidate:W T MengFull Text:PDF
GTID:2208360215964723Subject:Computer system architecture
Abstract/Summary:PDF Full Text Request
With the quickly development of WWW, it has become the important platform of transmitting and sharing information all over the world. It's out of question that the Internet has become the primary source for people to get the information they needed. But the fact is that the difficulty of getting useful information is growing rapidly while the explosion of the data appears on the WWW. Ideally, people can query the information on the WWW just like a database.For satisfy such the needs, web information extraction appeared and become abundant, but they cann't get high score at each aspect such as accuracy, extensibility, adaptability and so on. My research subject sloves the drawbacks on processing the half-structure text by Natural Language Understanding method and improves the existing language model .Based on this, the author design and development a web recruitment information extraction system called JobHunter.The extraction processes are as follows. Firstly, construct a Spider to snatch the web pages from some employ sites. And then extract the employment information and saved to the database by information extraction model. Lastly, display the information extracted to the job hunters at the interface.The system has a good extensibility and adaptability because it based on the natural language understanding method, and precision and recall can reach above 70%.
Keywords/Search Tags:Web Information Extraction, Natural Language Understanding, Spider, Name Entity Recognition
PDF Full Text Request
Related items