Difference between revisions of "Yano et al ICWSM 2010. What’s Worthy of Comment? Content and Comment Volume in Political Blogs"
(Created page with '== Citation == Tae Yano and Noah A. Smith. What’s Worthy of Comment? Content and Comment Volume in Political Blogs. In Proc of ICWSM 2010. == Online Version == [http://www.cs…') |
|||
Line 1: | Line 1: | ||
− | == Citation == | + | == Citation == |
Tae Yano and Noah A. Smith. What’s Worthy of Comment? Content and Comment Volume in Political Blogs. In Proc of ICWSM 2010. | Tae Yano and Noah A. Smith. What’s Worthy of Comment? Content and Comment Volume in Political Blogs. In Proc of ICWSM 2010. | ||
Line 7: | Line 7: | ||
== Summary == | == Summary == | ||
− | This [[Category::Paper]] describes a [[UsesMethod::topic model]] based approach in ... | + | This [[Category::Paper]] describes a [[UsesMethod::topic model]] based approach in modeling the relationship between the text content of a political blog post and the comment volume (i.e. the total amount of response) that a post will receive. |
== Brief description of the method == | == Brief description of the method == | ||
+ | |||
+ | The author's propose a generative model, called the Topic-Poisson model, which proceeds as follows. The number of topic <math>K</math> is fixed in advance. | ||
+ | |||
+ | [[File:yano-icwsm-tp.png]] | ||
== Experimental Result == | == Experimental Result == | ||
+ | |||
+ | '''Task''': Predict whether a blog post will have higher volume than the average seen in training data | ||
+ | (Note that they are NOT predicting the absolute number of words in the comments) | ||
+ | |||
+ | The authors use a subset of the [[UsesDataset::Yano & Smith blog dataset|Yano & Smith blog dataset]]; data from 2 blogs, [http://matthewyglesias.theatlantic.com Matthew Yglesias] (denoted MY) and [http://www.redstate.com/ Red State] (denoted RS) were used. | ||
+ | |||
+ | The compared models were: | ||
+ | * Naive Bayes | ||
+ | * Regression: linear regression with elastic net regularization | ||
+ | * Topic Poisson | ||
+ | * Topic Negative Binomial | ||
+ | * CommentLDA | ||
== Discussion == | == Discussion == | ||
− | == Related Papers == | + | == Related Papers == |
== Study Plan == | == Study Plan == |
Revision as of 13:43, 30 September 2012
Contents
Citation
Tae Yano and Noah A. Smith. What’s Worthy of Comment? Content and Comment Volume in Political Blogs. In Proc of ICWSM 2010.
Online Version
What’s Worthy of Comment? Content and Comment Volume in Political Blogs.
Summary
This Paper describes a topic model based approach in modeling the relationship between the text content of a political blog post and the comment volume (i.e. the total amount of response) that a post will receive.
Brief description of the method
The author's propose a generative model, called the Topic-Poisson model, which proceeds as follows. The number of topic is fixed in advance.
Experimental Result
Task: Predict whether a blog post will have higher volume than the average seen in training data (Note that they are NOT predicting the absolute number of words in the comments)
The authors use a subset of the Yano & Smith blog dataset; data from 2 blogs, Matthew Yglesias (denoted MY) and Red State (denoted RS) were used.
The compared models were:
- Naive Bayes
- Regression: linear regression with elastic net regularization
- Topic Poisson
- Topic Negative Binomial
- CommentLDA