Difference between revisions of "Lin et al KDD 2011"

From Cohen Courses
Jump to navigationJump to search
(Created page with '== PET:A Statistical Model for Popular Events Tracking in Social Communities == '''Problem:''' In this paper, the authors address a method to observe and track the popular even…')
 
Line 9: Line 9:
  
 
Definitions:
 
Definitions:
 +
 
1. Network Stream: <math>G=\{G_1,G_2,...,G_T\}</math> is a stream of network structures. Each element <math>G_k</math> in the set is a snapshot of the network at time <math>t_k</math>. <math>G_k = \{V_k, E_k\}</math>.  
 
1. Network Stream: <math>G=\{G_1,G_2,...,G_T\}</math> is a stream of network structures. Each element <math>G_k</math> in the set is a snapshot of the network at time <math>t_k</math>. <math>G_k = \{V_k, E_k\}</math>.  
  
2. Document stream: <math>D=\{D_1,D_2,...,D_T\}</math> is a stream of document collections. <math>D_k</math> is a the set of documents published between time <math>t_(k-1)</math> and <math>t_k</math>. <math>D_k=\{d_(k,1),d_(k,2),...,d_(k,N)\}</math>. <math>d_(k,i)</math> is the text document associated with user i in time <math>t_k</math>.
+
2. Document stream: <math>D=\{D_1,D_2,...,D_T\}</math> is a stream of document collections. <math>D_k</math> is a the set of documents published between time <math>t_{k-1}</math> and <math>t_k</math>. <math>D_k=\{d_{k,1},d_{k,2},...,d_{k,N}\}</math>. <math>d_(k,i)</math> is the text document associated with user i in time <math>t_k</math>.
 +
 
 +
3. Topic: Semantically coherent topic <math>\theta</math> is a multinomial distribution of words <math>\{p(w|\theta)\}</math>.
 +
 
 +
4. Event: <math>\theta^E=\{\theta_0^E,\theta_1^E,...,\theta_T^E\}</math> is a stream of topics. Among these, <math>\theta_0^E</math> is either specified by users or be discovered by an event detection algorithm.
  
3. topic: Semantically coherent topic
+
5. Interest: for each event, at each time point, each user has a certain level of interest in the event which is expressed as <math>h_k(i)</math>.
  
 
   
 
   

Revision as of 21:55, 3 February 2011

PET:A Statistical Model for Popular Events Tracking in Social Communities

Problem: In this paper, the authors address a method to observe and track the popular events or topics that evolve over time in the communities. Existing methods separate topics and netword structures apart. In this paper, textual topics and network are combined together which makes more sense.

Method: The authors address the event tracking by first defining a term - Popular Event Tracking (PET) in online communities which includes the popularity of events over time, the burstiness of user interest, information diffusion through the network structure and the evolution of topics.

PET leverages a Gibbs Random Field to model the interest of users, depending on their historical status as well as the influence form their social connections. The intuition here is that my current interest will be strongly related with my previous interest. Also my interest will be influenced by my friends which are my connections in social media.

Definitions:

1. Network Stream: is a stream of network structures. Each element in the set is a snapshot of the network at time . .

2. Document stream: is a stream of document collections. is a the set of documents published between time and . . is the text document associated with user i in time .

3. Topic: Semantically coherent topic is a multinomial distribution of words .

4. Event: is a stream of topics. Among these, is either specified by users or be discovered by an event detection algorithm.

5. Interest: for each event, at each time point, each user has a certain level of interest in the event which is expressed as .


The theories in this paper relies on the following three important observations: 1. user i's current interest is influenced by i's connections and a stronger tie brings a larger impact. 2. interest values are generally consistent over time. 3. a higher interest of user i in event should result in a higher proportion of the event covered in by user i.