Comparison Rosen-Zvi el al and cohn et al

From Cohen Courses
Jump to navigationJump to search

Papers Compared

1. The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity by David Cohn and Thomas Hofmann

2. Rosen-Zvi et al, The Author-Topic Model for Authors and Documents

Comparison

The two papers are similar in that they have a common big idea of being able to cluster similar documents, with using more than just the terms in the document. Both the papers use meta-data for their topic models. In the first case, it is hyperlinks and in the second case it is Authors. The methods that they use are different, although the baseline algorithm for the first paper is also LSA, but they use joint model of PLSA and PHITS. Whereas the second paper uses only LDA for building topic models. The datasets were completely different. Overall, the two papers tried to solve the same problem using different information and technique.

Questions and Answers

1. How much time did you spend reading the (new, non-wikified) paper you summarized?

approximately 1.5 hrs.

2. How much time did you spend reading the old wikified paper?

30 mins.

3. How much time did you spend reading the summary of the old paper?

10 mins

4. How much time did you spend reading background material?

20 mins.

5. Was there a study plan for the old paper?

No

6.if so, did you read any of the items suggested by the study plan? and how much time did you spend with reading them?

-NA-

7.Give us any additional feedback you might have about this assignment.

This assignment was fun. I got to read two related papers and actually take time out to think about how similar or different they were. This helps us understand both the papers, better. Since the summaries are posted on wiki, everyone should be allowed to edit summaries written by other people.