Font Size: a A A

Research On The Information Extraction System In Sports Domain

Posted on:2011-05-23Degree:MasterType:Thesis
Country:ChinaCandidate:G Y GaoFull Text:PDF
GTID:2178360305487504Subject:Communication and Information System
Abstract/Summary:PDF Full Text Request
Information extraction as an automated information processing technology interests many researchers in natural language processing. Firstly, Named entity recognition and relation extraction as the key technology of information extraction have been studied in this paper, a new approach is proposed to recognize entity based on conditional random fields, which fuses multiple knowledges, and a new approach is proposed to extract the entity relation in sports news based on conditional random fields. Secondly, the information extraction system in sports game news is designed and realized, which is mainly based on statistics and rules to extract sports game news. The experiments corpus comes from the www.sina.com and www.sohu.com. The experiments results show that the precision of system is 95.70%, the recall of system is 93.00% and the F-measure of system is 94.33%, which prove the validity of our approach.
Keywords/Search Tags:information extraction, named entity recognition, entity relation extraction, condition random fields
PDF Full Text Request
Related items