Font Size: a A A

Extraction Collocation Relations Of Product Opinion Target And Sentiment Words

Posted on:2014-01-02Degree:MasterType:Thesis
Country:ChinaCandidate:L B XuFull Text:PDF
GTID:2248330398471582Subject:Computer technology
Abstract/Summary:PDF Full Text Request
Product reviews mining is to use natural language processing techno-logy from network in the comment text extract the user product evaluation information, which help users to purchase products or services from the decision, provide valuable information to business or producers to improve products and improve service quality for decision-making. Product evaluation targets and opinion word collocation extraction is a very important research topic in the mining of product reviews, is a very important part of the basis of sentiment analysis tasks, but also emotional information extraction. This paper makes use of the natural language processing, machine learning, statistical machine translation technology, in view of the network Chinese review information, extract the evaluation target and evaluation word collocation relations. We carried out the study and comparison of the two methods, according to the special features of the evaluation collocation relationship in the Chinese text and based on basis of previous studies.1. This article will evaluate collocation extraction problem is transformed into a sequence labeled problem, constructing the word itself, part of speech, dependence relationship, word location and other characteristics, based on conditional random field model extract comment text opinion target and emotional word with relationships. This method is suitable for comparison rules, length short text sentences, and the need to manually label part of the training corpus. 2. The other way is make opinion target and emotional words with relationship problems transformed into word alignment problem, the use of machine learning methods based on the statistics of the source and channel corresponding relationship between the words and the words extracted from the parallel corpus, alignment probability is greater than the threshold value in the two words is considered to be a candidate evaluation collocation. After get the candidate evaluation collocation relationship, using the principle of random walk after several times of iteration calculation for each candidate words convergence probability, set up separately evaluation words and the evaluated object’s threshold, filtering the words which value is less than the threshold value. The rest of the words between candidate evaluation collocations, that they are the final evaluation with the relationship. This method is suitable for large-scale evaluation of corpus, without manual label the corpus.Finally, in this paper, using two different corpora respectively on the two methods of experiment, the experimental results show that the proposed two methods in the evaluation of product collocation extraction is feasible, and obtain better accuracy rate and recall rate. Research of these methods to other tasks of sentiment analysis has an important reference value.
Keywords/Search Tags:Evaluation of collocation extraction, Conditional RandomFields, Word Alignment, Random Walk
PDF Full Text Request
Related items