The Study On Knowledge Extraction From Text Resources

Posted on:2011-09-29

Degree:Master

Type:Thesis

Country:China

Candidate:S Kong

Full Text:PDF

GTID:2178330332960839

Subject:Information management and e-government

Abstract/Summary:

PDF Full Text Request

With the widely development of information technology and internet, information resources is growing very quickly. And 80% of the information resource is stored in the form of natural language text. How to get the knowledge from the text data and how to solve the contradiction between flood of information and lack of knowledge is the goal of knowledge extraction. And natural language processing is the key technology to solve this problem.First, this paper gives out the background and research status of the topic on knowledge extraction from text resources. Therefore, we can know that the research object is unstructured tree text and the study goal is to extract knowledge, involving natural language processing, text mining and other related fields. After analyzing and summarizing the related knowledge extraction system at home and abroad, we present the history and development trends of this field. Second, we summarized the related key technology to provide the theoretical basis to this paper, including natural language processing, Chinese word segmentation, the semantic similarity algorithm and commonly used dictionary. Third, we proposed text knowledge extraction models, including the definition of the concept of text knowledge, analysis of the text structure, transformation of web html to plain text, implement of key word extraction and the topic sentence extraction. Finally, we design and implementation a knowledge extraction from text resources system to validate the text knowledge models.Overall, there is not much work of knowledge extraction in China. But the relevant research has developed well, such as information extraction, knowledge discovery, ontology and so on. Different from the traditional information extraction based on rule and learning mechanism, this paper aims to develop a knowledge extraction system that tries using NLP to extract knowledge for scientific literature of the discourse after word segmentation, POS tagging, syntactic analysis, and semantic analysis process. This study can be a kind of technology solutions to solve the problem of the contradiction between flood of information and lack of knowledge.

Keywords/Search Tags:

Natural Language Processing, Knowledge Extraction, Text Resources

PDF Full Text Request

Related items

1	Research On Discovering Knowledge Of Problem Solving Of Program Design Resources Based On Natural Language Processing
2	Design And Implementation Of Knowledge Extraction Algorithm Based On Natural Language Processing
3	The Application Of Natural Language Processing In Mining The Characteristics Of Concept Convey
4	Narrative Information Extraction with Non-Linear Natural Language Processing Pipeline
5	The Methodology And Implementation Of Chinese Natural Language Query In Databases
6	Text Classification Based On Natural Language Processing, Analysis And Research
7	Natural Language Processing Based On Scenarized Knowledge Representation And Its Application In Automatic Text Correction
8	Design And Implementation Of Knowledge Extraction System For Overlapping Relations In Complex Semantic Context
9	Research And Application Of Information Extraction And Knowledge Discovery Based On Professional Literature
10	Research On Text Classification Based On Natural Language Processing And Machine Learning