Uyghur Comaprative Sentences And Relations Identification

Posted on:2015-06-06

Degree:Master

Type:Thesis

Country:China

Candidate:H Y Wang

Full Text:PDF

GTID:2298330431491893

Subject:Computer application technology

Abstract/Summary:

Comparison is one of the most common expressions, which is widely used by people toexpress their emotional tendency on different things. With the rapid development of computertechnology, there are a large number of Uyghur social and business application platformappear in the Internet, and the Uyghur texts of the platform contain many comparativesentences. As one of the most important parts of fine-grained opinion mining, identifyingcomparative sentences and extracting comparative relations have important practicalsignificance. Comparative opinion mining is related to several technologies, such as textclassification, information retrieval and information extraction. The information mined by uswill offer valuable reference for some potential buyers or investors. However, there is noresearch on Uyghur comparative sentences, and for the existing methods on miningcomparative opinions, they have quite low recall rate.This paper sets comparative texts as the object and studies the comparative opinionmining method, and it includes the following two aspects:(1) Identification of comparative sentences. We put forward a two-level model to identifysentences. The first level is to identify sentences based on our comparative words set; and thesecond level uses Bidirectional CSR Mining algorithm (Bi-CSR) that we propose in this paperto mine comparative patterns, and take them as feature for Support Vector Machine (SVM) toclassify into either comparative or not.(2) Extraction of comparative type and relation. For the type identification, we settype-indicating words in an order, and classify sentences based on these ordered words.Relation extraction is to get the comparative subject (Subject), comparative object (Object),feature (Feature) and comparative result (Result) from a comparative sentence. In order to getthese elements, we propose different methods for different kind of elements. Firstly, we treatSubject and Object as the topic of an opinion, and identify them through Latent Dirichlet Allocation (LDA) model. Then we use similarity calculation and a mapping table of Featureand Result to classify topics we get from LDA into Subject, Object and Feature. Lastly, wetake full use of the features of Uyghur comparative sentences to generate rules for Resultextraction. After we get all the above elements, we express comparative relation in the form of5-tuple <Subject, Object, Feature, Result, Type>.We do experiment according to the above idea, and through the experiment results weget the conclusion that our method can solve the problem of Uyghur comparative sentenceidentification and relation extraction. Our method gets good performance in both precisionand recall rates.

Keywords/Search Tags:

Uyghur, Comparative sentences recognition, Comparative type identification, Comparative relation extraction

Related items

1	Research On Comparative Sentences Identification And Relation Extraction Based On LSTM
2	Research On Comparative Identification And Comparative Opinion Element Extraction
3	Research On The Identification Approach Of Opinion Element Orientation For Chinese Comparative Sentences
4	Research On The Identification Of Comparative Sentences And Relations And Its Application
5	Extraction Of Chinese Comparative Elements And Its Application
6	Extracting Product Review Comparison Factor
7	Study On Comparative Recognition And Extraction Of Comparative Elements Based On Deep Learning
8	Identifying Comparative Chinese Cellphone Online Reviews Based On CR Emotional Lexicon Method
9	Research On Comparative Opinion Mining Based On Web
10	Research On Comparative Sentences Ellipsis Identification And Element Extraction