Identification Of The Semi-Structured Text

Posted on:2010-04-25

Degree:Master

Type:Thesis

Country:China

Candidate:Z X Jiang

Full Text:PDF

GTID:2178360278965689

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

In daily lives, Resume is a very important text, which includes the author's information such as basic information and experience. The application of Resume is very extensive in today's society. Therefore, fast and efficient extraction of information in the resume has become an urgent demand. This article will study how to extract the resume information fast and effectively. First, text analysis more dependent on computers rather than artificial because of the Huge quantity of the Semi-structured text; second, we are able to get the accurate result according to the feature of Semi-structured text and lots of skills about text analysis, such as regulation match, relation analysis, statistics and so on.The main task of this paper is: having a deep research on effective algorithms of information extraction for Chinese Resume. The main research results are as follows: First, through research, the paper gives the characteristics of the Chinese Resume; Secondly, the paper gives effective algorithms of information extraction for various parts of the Chinese Resume; the third, giving the Chinese resume information extraction model; fourth, the paper gives the experimental results based on 1500 Chinese Resumes.From the structure of the paper's contents, the first chapter introduces the background and significance of the subject; in the second chapter, there are the introduction of semi-structured text, as well as definitions of key words; in the third chapter, automatic text classification techniques are introduced; Chapter four, give the characteristics of resume text and the model of information extraction; the fifth chapter, give the experimental results and analysis of the results; Chapter six, a summary of the work and problems.

Keywords/Search Tags:

semi-structured text, elements, items, categories, collections, regular matches, statistics, segmentation

PDF Full Text Request

Related items

1	Information Extraction For Semi-structured Chinese Resume
2	Curriculum Vitae Recognition System Base On Identification Of Semi-Structured Text
3	Research On Semi-structured Text Push Technology And Application
4	Research Of Information Retrieval Based Semi-Structured Data
5	Design And Implementation Of The Core Information Extraction System Of Semi-structured Financial Contract
6	Research And Application Of Semi-structured Data Extraction
7	Research On The Storing And Querying Of Semi-Structured Data On The Web
8	Research On Feature Extraction Method Of Semi-structured Document
9	Self-organising text collections with adaptive resonance theory neural networks
10	Semantic Retrieval Of Semi-Structured Text Based On Ontology Concept