Difference between revisions of "Latent Social Network Influence"

From Cohen Courses
Jump to navigationJump to search
Line 17: Line 17:
 
A ''cascade'' is defined is based on transmission model <math>w(t)</math> and recovery model <math>r(t)</math>. Transmission model defines how much time it takes for an infection to spread from one node to another and recovery model defines how long a node remains infected before it recovers. Further time when each node <math>i</math> is infected is given by <math>\tau_i</math>. With a <math>D</math> cascades given and a ''cascade'' we try to get MLE of the entries of the matrix ''A''.  
 
A ''cascade'' is defined is based on transmission model <math>w(t)</math> and recovery model <math>r(t)</math>. Transmission model defines how much time it takes for an infection to spread from one node to another and recovery model defines how long a node remains infected before it recovers. Further time when each node <math>i</math> is infected is given by <math>\tau_i</math>. With a <math>D</math> cascades given and a ''cascade'' we try to get MLE of the entries of the matrix ''A''.  
  
The following is maximum likelihood estimation problem.  
+
The following is maximum likelihood estimation problem. Here <math>X_c(t)</math> is set of nodes that got infected at the time ''t'' and <math>\tau_i^c = \inf </math> if the node ''i'' is never infected in the cascade ''c''.
 +
 
 
<math>
 
<math>
L(A;D) = \prod_{c \in D} \lbrack \lbrace \prod_{i; \tau_i^c < \inf} P(i infected at \tau_i^c | X_c(\tau_i^c))\rbrace \lbrace \prod_{i;\tau_i^c = \inf P(i never infected |X_c(t) \forall t) \rbrace\rbrack
+
L(A;D) = \prod_{c \in D} \left \lbrack \left \lbrace \prod_{i; \tau_i^c < \inf} P(i\mbox{ infected at }\tau_i^c | X_c(\tau_i^c)) \right \rbrace \left \lbrace \prod_{i;\tau_i^c = \inf} P(i\mbox{ never infected }|X_c(t) \forall t) \right \rbrace \right \rbrack
 +
</math>
 +
 
 +
<math>
 +
L(A;D) = \prod_{c \in D} \left \lbrack \left \lbrace \prod_{i; \tau_i^c < \inf} \left( 1 - \prod_{j;\tau_j \le \tau_i} (1- w(\tau_i^c -\tau_j^c)A_{ij} \right) \right \rbrace \left \lbrace \prod_{i;\tau_i^c = \inf} \prod_{j;\tau_j^c < \inf} (1-A_{ij}) \right \rbrace \right \rbrack
 
</math>
 
</math>
 
== Datasets used ==
 
== Datasets used ==

Revision as of 20:16, 4 February 2011

Citation

S. A. Myers & J. Leskovec "On the Convexity of Latent Social Network Inference" In Neural Information Processing Systems (NIPS), 2010.

Online version

Link to paper

Summary

This is an interesting paper of learning the graph structure which is not a typical Bayesian graph learning. The main problem that the authors try to solve is given a set of nodes and network diffusion, learn the graph over which the contagion spreads. The assumption is information diffusion often happens over an unknown network. Typically we observe the nodes getting infected and the time they get infected, however we cannot know who infected them. The infection can here can be blogs getting posted with a similar news articles, or people buying a new product or it can be a typical disease that is spreading on geographic or people network. This problem is converted to a maximum likelihood based convex optimization problem and further they introduce an penalty to penalize the graph with larger number of nodes.

Description of the method

Assume that there are nodes in the graph and we present a matrix A where

A cascade is defined is based on transmission model and recovery model . Transmission model defines how much time it takes for an infection to spread from one node to another and recovery model defines how long a node remains infected before it recovers. Further time when each node is infected is given by . With a cascades given and a cascade we try to get MLE of the entries of the matrix A.

The following is maximum likelihood estimation problem. Here is set of nodes that got infected at the time t and if the node i is never infected in the cascade c.

Datasets used

Experimental Results

Related Papers