Font Size: a A A

Design And Implementation Of The Text Sturcturing System Of Insurance Clause

Posted on:2020-08-26Degree:MasterType:Thesis
Country:ChinaCandidate:Z H ZengFull Text:PDF
GTID:2428330623451855Subject:Computer technology
Abstract/Summary:PDF Full Text Request
In recent years,the continuous development of natural language processing technology has made text structured systems widely used in different fields of knowledge.Text structuring research in areas such as medical and journalism has made great breakthroughs,and there is no mature text structuring system in the insurance field.Due to the difference in professional knowledge and the description of text content,there is currently no universal text structure d system to meet the needs of information extraction in various fields.The unique language style of the insurance text makes the research of text structured system for insurance field still have new challenges.The insurance clause is the relevant rights and obligations established by both the insurance purchaser and the insurance company.The core content of insurance clause is the insurance liability text that describes the scope of liability of the insurance company and the compensation that the insurance company should bear when the insured has an insurance accident.The structured processing of the unstructured text helps to achieve a quick reading and effective understanding of the content of the insurance liability by the user.This paper proposes a text stucturing method for insurance clauses in the insurance field,and uses this as a support to construct a prototype system of insurance clause texts structuring,simulating the application scenarios of the proposed algorithm in the actual insurance liability condition query process.The main contents of this paper include the following parts:(1)Collecting the original PDF files of insurance clauses of various insurance companies through web crawlers,and designing different document parsing strategies for PDF files of different text formats to obtain the contents of the insurance liability texts in the insurance clauses;(2)This paper proposes a structuring scheme for applying unstructured insurance liability text data,that is,through the multi-stage text processing that include semantic separation of insurance liability long texts,short text classification of insurance liability,information extraction of entities,and fusion of short text structured re sults to finally realize the structure of 98524 insurance liability texts;(3)Design and implement the text stucturing system of insurance clause based on the proposed method.The test results show that the multi-stage text processing method proposed in this paper effectively realizes the structuring of the text in the insurance clause and meets the requirements of the system's expected design.
Keywords/Search Tags:Insurance Clause, Insurance Liability, Text Structuring, Named Entity Recognition, Conditional Random Field
PDF Full Text Request
Related items