Difference between revisions of "OConnor et. al., ICWSM 2010"

From Cohen Courses
Jump to navigationJump to search
Line 16: Line 16:
 
== Discussion ==
 
== Discussion ==
  
This paper shows that simple methods that are subject to noise perform relatively well when used. Identifying related tweets is performed simply by searching tweets that contain a set of pre-selected keywords, such as Obama and Mccain for tweets about presidential elections. The general sentiment of a tweet is identified by looking for words that have an either positive or negative polarity, and the sentiment of a tweet is classified as positive if it contains a positive polarity word and negative if it contains a negative polarity word or both if it contains both positive and negative polarity words.
+
This paper shows that simple methods that are subject to noise perform relatively well when used to estimate aggregate sentiment.  
 +
 
 +
Identifying related tweets is performed simply by searching tweets that contain a set of pre-selected keywords, such as Obama and Mccain for tweets about presidential elections. This means that many tweets that are not oriented to this topic will also be selected.
 +
 
 +
The general sentiment of a tweet is identified by looking for words that have an either positive or negative polarity, and the sentiment of a tweet is classified as positive if it contains a positive polarity word and negative if it contains a negative polarity word or both if it contains positive and negative polarity words. The aggregate sentiment is calculated as the ratio between the number of positive tweets and the number of negative ones.
 +
 
  
 
== Related papers ==
 
== Related papers ==
  
 
== Study plan ==
 
== Study plan ==

Revision as of 10:56, 26 September 2012

Citation

Brendan O’Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A. Smith. 2010a. From tweets to polls: Linking text sentiment to public opinion time series. In Proc. of ICWSM.

Online version

From tweets to polls: Linking text sentiment to public opinion time series

Summary

This Paper attempts correlate the results of several surveys related consumer confidance and political opinions, with the sentiment words frequencies found in Twitter. The main motivation is that mining opinions in Twitter can be used as an alternative method to conducting surveys, which can be time consuming and comparatively expensive.

This task can be divided into two steps. First, collect relevant tweets from the Twitter corpora and then determine whether the tweets express positive or negative opinion.

Evaluation

Discussion

This paper shows that simple methods that are subject to noise perform relatively well when used to estimate aggregate sentiment.

Identifying related tweets is performed simply by searching tweets that contain a set of pre-selected keywords, such as Obama and Mccain for tweets about presidential elections. This means that many tweets that are not oriented to this topic will also be selected.

The general sentiment of a tweet is identified by looking for words that have an either positive or negative polarity, and the sentiment of a tweet is classified as positive if it contains a positive polarity word and negative if it contains a negative polarity word or both if it contains positive and negative polarity words. The aggregate sentiment is calculated as the ratio between the number of positive tweets and the number of negative ones.


Related papers

Study plan