Font Size: a A A

Research On Sentiment Orientation Analysis Of Blog Article Based On Blog Search

Posted on:2011-09-18Degree:MasterType:Thesis
Country:ChinaCandidate:Y C FuFull Text:PDF
GTID:2218330368499759Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the popularity of the Internet and its rapid development worldwide, the information from online blog is exploding. The utilization rate of blog is as much as 57.7%, and the recognition and popularity level of blogs among Internet users are getting increased everyday. Blog makes that the authors express their opinions easily, and the readers can quickly browse and comment on the blog articles. The form of blog on sharing ideas becomes more and more popular. Therefore blog has become an important platform for sentiment expression and communication, and makes the emergence and spread of public opinion become the primary venue.However, in the over-expansion information ages, Internet users are more concerned about the concise, sentiment orientation information from celebrities. In order to quickly get blog article sentiment information for supporting or opposing from the Blog field on-demand, people have the urgent need for an appropriate sentiment search tools, which can easily organize and search the massive resources of blog. At this time, the best option is blog sentiment orientation retrieval.Based on the analysis on the sentiment factors implied in the Chinese blog article, combined with natural language processing technique, this thesis proposed the SPOA (Sentiment-dictionary and Parsing based Orientation Analysis) method of blog sentiment orientation analysis. In the blog article's pre-processing stage, a foundation sentiment dictionary and a polysemy dictionary were constructed for the identification of sentiment words in blog articles. The relationship of the group as the smallest unit of sentiment analysis, and combining the proposed VCCA algorithm (VOB and CMP Convert to ADV) on sentiment ectopic, the proposed algorithm makes the calculation of the modification degree of context-sensitive more accurate and reasonable.Then the experiment results show that the SPOA method based on dependency syntax is better than windows modified algorithm in emotions analysis on the Chinese blog articles. The syntax distance and dependency modification make the performance of sentiment orientation analysis improved significantly. There are no sharp difference between blog article's full-text analysis and network abstract analysis. However, the key emotional sentences processing for structural characteristics of blog articles makes significant advantages of overall performance. This shows that the structural features of the blog articles have impact on sentiment analysis obviously.Finally, a prototype system for blog article sentiment retrieval based on the SPOA algorithm is implemented, which sorted the search results by user preference requirements.
Keywords/Search Tags:blog search, opinion mining, sentiment analysis, syntactical analysis, text mining
PDF Full Text Request
Related items