Difference between revisions of "Topic Model Approach to Authority Identification"
Line 29: | Line 29: | ||
Number of "helpful" user votes for each review were considered as a proxy for ranking reviews authoritativeness. | Number of "helpful" user votes for each review were considered as a proxy for ranking reviews authoritativeness. | ||
− | |||
− | |||
− | |||
== Features Used == | == Features Used == | ||
== Task Description and Evaluation == | == Task Description and Evaluation == | ||
− | |||
== Findings == | == Findings == |
Revision as of 19:47, 1 October 2012
This a Paper reviewed for Social Media Analysis 10-802 in Fall 2012.
Contents
Citation
author = {Alexandre Passos and Jacques Wainer and Aria Haghighi}, title = {What do you know? A topic-model approach to authority identification}, journal = {NIPS 2010 Workshop on Computational Social Science and the Wisdom of the Crowds}, year = {2010}
Online version
What do you know? A topic-model approach to authority identification
Summary
In this paper the authors present a preliminary study of basic approaches to the problem of identifying authoritative documents in a given domain using textual content and report their best performing approach using Hierarchical Topic Models [Blei et al, 2004]. Authoritative documents are ones which exhibit novel and relevant information relative to a document collection while demonstrating domain knowledge. Authors define authoritativeness identification task as a ranking problem and focus on product (book GoodReads and restaurant Yelp) reviews utilizing user votes as proxy for helpfulness and authoritativeness.
Dataset Description
The authors have reported results on two datasets.
- Book Reviews GoodReads dataset
* First 326 books in the "Best Books Ever" Category * First 60 odd reviews from each book.
- Restaurant Reviews Yelp dataset
* 283 Most reviewed restaurants in the Boston/Cambridge area
Number of "helpful" user votes for each review were considered as a proxy for ranking reviews authoritativeness.