Difference between revisions of "Compare Yano et al NAACL 2009 Link PLSA LDA"

From Cohen Courses
Jump to navigationJump to search
Line 8: Line 8:
  
 
* method
 
* method
* dataset used - Yano et al. uses a corpus of blog posts from 40 different blog sites focusing on American politics during from November 2007 to October 2008 (right up to a presidential election). Diversity in political leanings was emphasized for the final selection. Five blogs were chosen for the final selection
+
* dataset used - Yano et al. uses a corpus of blog posts from 40 different blog sites focusing on American politics during from November 2007 to October 2008 (right up to a presidential election). Diversity in political leanings was emphasized for the final selection. Five blogs were chosen for the final selection. Nallapati and Cohen also use a corpus of blogs, but these were collected from July 2004 - July 2005. Initially, it was a noisy dataset with lots of broken links and useless information. The authors constrained blogs used to have a minimum of 2 ingoing or 2 outgoing links.
 
* problem
 
* problem
 
* big idea
 
* big idea

Revision as of 18:09, 30 November 2012

Papers

The papers are:

Comparison

  • method
  • dataset used - Yano et al. uses a corpus of blog posts from 40 different blog sites focusing on American politics during from November 2007 to October 2008 (right up to a presidential election). Diversity in political leanings was emphasized for the final selection. Five blogs were chosen for the final selection. Nallapati and Cohen also use a corpus of blogs, but these were collected from July 2004 - July 2005. Initially, it was a noisy dataset with lots of broken links and useless information. The authors constrained blogs used to have a minimum of 2 ingoing or 2 outgoing links.
  • problem
  • big idea
  • other

Questions

  1. How much time did you spend reading the (new, non-wikified) paper you summarized? About 2 hours
  2. How much time did you spend reading the old wikified paper? About 2 hours
  3. How much time did you spend reading the summary of the old paper? About 15 min
  4. How much time did you spend reading background material? N/A My final project for the class is on this area so I've read a lot of background papers
  5. Was there a study plan for the old paper? Yes
    1. if so, did you read any of the items suggested by the study plan? and how much time did you spend with reading them? I had actually read the papers before as it is directly related to my research with my advisor. I do a lot of Gibbs Sampling on graphical models (in particular topic-model derivatives) and that fits into the study plan
  6. Give us any additional feedback you might have about this assignment. I like this comparison. It was a nice way to view the papers in a different light and really made it stick in my memory. In general, I like the wikifying and used it extensively for the project (and probably will use this for my research in the future after the class is over).