Text mining for semantic relations

Posted on:2003-10-02

Degree:Ph.D

Type:Thesis

University:The University of Texas at Dallas

Candidate:Girju, Corina Roxana

Full Text:PDF

GTID:2468390011483111

Subject:Computer Science

Abstract/Summary:

Text Mining is a rapidly emerging field concerned with the extraction of concepts, relations, and implicit knowledge from texts. The current state-of-the-art in Text Mining is based on shallow representations of text documents coupled with statistical data mining techniques. This approach is limited due to the highly ambiguous nature of natural language.; This thesis proposes a new approach to Text Mining that emphasizes the use of rich syntactic and semantic features to discover useful and implicit relations from text and that is based on the acquisition of some of the most frequently used semantic relations. Using a general algorithm, the system discovers automatically lexico-syntactic patterns for each semantic relation considered. The patterns are evaluated and accepted or rejected based on some semantic constraints specifically tailored for each semantic relation. These semantic constraints are rooted in the WordNet lexical database.; We have focused on two specific semantic relations widely used: CAUSALITY and PART-WHOLE relations. A text knowledge acquisition (KAT) system was developed to extract lexical-syntactic patterns that refer to these semantic relations.; The knowledge discovered, concepts and semantic relations, is organized into hierarchies for the purpose of developing ontologies. These ontologies are built using a knowledge classification approach based on subsumption.; In this thesis we also demonstrate the usefulness of Text Mining for advanced Natural Language applications, such as Question Answering. On-line ontology development helps understand complex questions and provides the means for Answer Fusion.

Keywords/Search Tags:

Text mining, Relations

Related items

1	Study On Method To Automatically Analyze The Text Structure Based On The Relevancy Computing Of Text Content
2	Personae Entity Relations Extraction And Analysis In Chinese Microblog Text
3	Research And Implement Of Gene-Gene Relations Mining System Based On Biomedical Literature
4	The Research Of Fusion Method For Relational Domain Knowledge Oriented To Data Mining
5	Research On Web Text Mining
6	Key Techniques Of Text Ming On Criminal Cases
7	The Analysis On The Basic Techniques For Preprocess Of Text Mining And The Study On The Application Of Text Mining
8	Research On Orientation Relations And Integrative Reasoning With Topological Relations And Orientation Relations In Dynamic Settings
9	The Research Of Text Preprocessing Based On Web Mining And Itsapplication
10	The Research & Realization On The Key Techniques Of Text Mining