Sentiment analysis of Twitter data

Posted on:2017-01-26

Degree:M.S

Type:Thesis

University:Rensselaer Polytechnic Institute

Candidate:Yuan, Bo

Full Text:PDF

GTID:2468390011998760

Subject:Computer Science

Abstract/Summary:

Sentiment Analysis and Opinion Mining has become a research hot-spot with the rapid development of social network websites.Twitter is a typical social network application with millions of users expressing their sentiment every day. In this work, we explored comprehensively the methodologies applied in sentiment classification over Twitter data: lexicon-based, rule-based and machine learning-based methods. Our data-set is crawled and manually cleaned with the principle of Naturally Annotated Big Data. The data-set contains 20, 000 tweets ranging over ten popular topics.;For lexicon-based methods, we experimented with the Simple Word Count approach and Feature Scoring approach using most popular sentiment lexicons and semantic resources, namely MPQA subjectivity lexicon, SentiWordNet, Vader Sentiment Lexicon, Bing Liu's lexicon and General Inquirer. We built customized sentiment lexicons, designed featuring scores and compared ten classifiers on real-world Twitter data. Further, we designed Lingusitic Inference Rules(LIR) to improve lexicon-based classifiers. LIR aims to handle negation, valence shift and contrast conjunctions in natural language. For machine learning-based methods, we used state-of-the-art supervised learning models: Naive Bayes, Maximum Entropy and Support Vector Machines. Two sets of features are compared. The first set of features is Bag-of-Words with N-Gram. The second set of features is Part-of-Speech linguistic annotation.

Keywords/Search Tags:

Sentiment, Twitter, Data

Related items

1	Research On Twitter Text Sentiment Analysis Based On Progressive Transductive SVM
2	Sentiment analysis of big social data with Apache Hadoop
3	Empirical Study Of Sentiment-Aware Modeling And Analyses In Social Networking
4	Framework for Crawling and Local Event Detection Using Twitter Data
5	Research On Sentiment Recognition Of Housing Price Twitter
6	Discovering Twitter Users' Off-line Community
7	Sentiment Analysis And Related Issues For Twitter
8	A Case Study on Determining the Big Data Veracity: A Method to Compute the Relevance of Twitter Dat
9	The Design And Implementation Of A Local Event Detection System Using Geo-tagged Twitter Data
10	Are There Perks to Being a Twitter Wallflower? Peripheral Participants in a Twitter-Enabled Learning Space in Public Relations and Higher Educatio