Difference between revisions of "Lin et al KDD 2010"
Line 15: | Line 15: | ||
== Method == | == Method == | ||
The authors address the event tracking by first defining a term - [[UsesMethod::Popular Event Tracking]] (PET) in online communities which includes the popularity of events over time, the burstiness of user interest, information diffusion through the network structure and the evolution of topics. | The authors address the event tracking by first defining a term - [[UsesMethod::Popular Event Tracking]] (PET) in online communities which includes the popularity of events over time, the burstiness of user interest, information diffusion through the network structure and the evolution of topics. | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
== Dataset == | == Dataset == |
Revision as of 13:50, 4 February 2011
This is a Paper I read in 10802 social media analysis.
Citation
PET:A Statistical Model for Popular Events Tracking in Social Communities
Authors
Cindy Xide Lin, Bo Zhao, Qiaozhu Mei, Jiawei Han
Problem
In this paper, the authors address a method to observe and track the popular events or topics that evolve over time in the communities. Existing methods separate topics and netword structures apart. In this paper, textual topics and network are combined together which makes more sense.
Method
The authors address the event tracking by first defining a term - Popular Event Tracking (PET) in online communities which includes the popularity of events over time, the burstiness of user interest, information diffusion through the network structure and the evolution of topics.
Dataset
In this paper, the authors select twitter as their source of data. They choose 5,000 users with follower-followee relationships and crawling down 1,438,826 tweets displayed by these users during the period from Oct.2009 to early Jan.2010. Each day is regarded as a time point. Document is obtained by simply concatenating all tweets displayed by the user in certain day. The connection is defined as the number of tweets displayed by user by following another user during the period of 30 days.
Comparison with baseline models
Here in this paper, the authors compare PET with some other baseline models such as JonK, Cont, BOM and GInt. The authors apply these models to analyze both the Popularity Trend and Network diffusion. The conclusion is that PET generates the most consistent trends and the smoothest diffusion because PET estimates the popularity by comprehensively considering historic, textual and structured information in a unified way.
The popularity trend is shown in Fig.1. Network diffusion is shown in Fig.2.