Difference between revisions of "M. Kim and J. Leskovec. ICML'12"

From Cohen Courses
Jump to navigationJump to search
 
(16 intermediate revisions by the same user not shown)
Line 9: Line 9:
 
This is a paper on block based network analysis and prediction. This work furthers the work by Airoldi. et. al (see Related papers) to the
 
This is a paper on block based network analysis and prediction. This work furthers the work by Airoldi. et. al (see Related papers) to the
 
extent that each node/ user can actually belong to more than one block and node features are modeled in addition to link existence. The generative process is given as below (see the figure)</p>
 
extent that each node/ user can actually belong to more than one block and node features are modeled in addition to link existence. The generative process is given as below (see the figure)</p>
<p>
+
 
 
[[File:KimLeskovec.png]]
 
[[File:KimLeskovec.png]]
 +
 +
 +
<p>On the figure above, <math>N</math>, <math>L</math>, <math>K</math> is the number of users, number of user feature categories, and number of blocks respectively. The block membership probability of node <math>i</math> at bloc <math>k</math> is <math>\phi_{ik}\in [0, 1]</math>. Note that this model does not require <math>\sum_{k}\phi_{ik} = 1</math>
 +
*For each node <math>i</math>, and each block <math>k</math>, <math>\phi_{ik}</math> is generated from [http://en.wikipedia.org/wiki/Beta_distribution Beta distribution]: <math> \phi_{ik}\sim Beta(\alpha_{k1},\alpha_{k2})</math>
 +
*Then
 +
**The truly block membership of node <math>i</math> at block <math>k</math> is a binary indicator which is generated from [http://en.wikipedia.org/wiki/Bernoulli_distribution Bernoulli distribution] <math>z_{ik}\sim Bernoulli(\phi_{ik})</math>
 +
**The features of node <math>i</math> at feature category <math>l</math> is a is generated from [http://en.wikipedia.org/wiki/Bernoulli_distribution Bernoulli distribution] with the parameter is computed from block membership probabilities using [http://en.wikipedia.org/wiki/Logistic_function Logistic function]
 +
***<math>F_{il} \sim Bernoulli(y_{il})</math>
 +
***where <math>y_{il} = \frac{1}{1+exp(w_l^T\phi_i)}</math>
 +
**The link between node <math>i</math> and node <math>j</math> is generated from [http://en.wikipedia.org/wiki/Bernoulli_distribution Bernoulli distribution] with the parameter is the product of block-wise linking probabilities
 +
***<math>A_{ij} \sim Bernoulli(p_{il})</math>
 +
***where <math>p_{ij} = \prod_k\theta_k(z_{ik}, z_{jk})</math>
 
</p>
 
</p>
<p>On the figure above, <math>N</math>, <math>L</math>, <math>K</math> is the number of users, number of user feature categories, and number of blocks respectively. The block membership probability of node <math>i</math> at bloc <math>k</math> is <math>\phi_{ik}\in [0, 1]</math>. Note that this model does not require <center><math>\sum_{k}\phi_{ik}= 1</math></center>.
+
<p> Given the hyperparameter <math>\alpha</math>, the other parameters are estimated by maximum likelihood function using a variational inference algorithm (see the paper for the details).
*For each user <math>i</math>, and each block <math>k</math>, <math>\phi_{ik}</math> is generated from [http://en.wikipedia.org/wiki/Beta_distribution Beta distribution]: <center><math> \phi_{ik}\sim Beta(\alpha_{k1},\alpha_{k2})</math></center>
 
*Then, the truly block membership of user <math>i</math> at block <math>k</math> is a binary indicator which is generated from [http://en.wikipedia.org/wiki/Bernoulli_distribution Bernoulli distribution] <math>z_{ik}\sim Bernoulli(\phi_{ik})</math>
 
</p>
 
  
== Dicussion ==
+
== Discussion ==
 +
This work consider the case when a node actually belongs to more than one block and multiple type of links can be existed among nodes. Furthermore, node features are used to regularize the block membership values of nodes. However, the notation of binary block indicator <math>z_{ik}</math> is not well explained. <math>z_{ik}</math> is also a hidden variable and does not a play role that is clearly distinctive from <math>\phi_{ik}</math>'s.
  
 
== Related papers ==
 
== Related papers ==
 
*Probabilistic graph clustering: Airoldi. et. al. [http://jmlr.csail.mit.edu/papers/volume9/airoldi08a/airoldi08a.pdf Mixed Membership Stochastic Blockmodels]. Journal of Machine Learning Research 9 (2008) 1981-2014
 
*Probabilistic graph clustering: Airoldi. et. al. [http://jmlr.csail.mit.edu/papers/volume9/airoldi08a/airoldi08a.pdf Mixed Membership Stochastic Blockmodels]. Journal of Machine Learning Research 9 (2008) 1981-2014
 
*The paper by Hoff on [http://www.csss.washington.edu/Papers/wp54.pdf Multiplicative latent factor models for description and prediction of social networks]
 
*The paper by Hoff on [http://www.csss.washington.edu/Papers/wp54.pdf Multiplicative latent factor models for description and prediction of social networks]

Latest revision as of 14:30, 2 October 2012

This is a scientific paper authored by M. Kim and J. Leskovec, and appeared in ICML'12. Below is the paper summary written by Tuan Anh.

Citation

Online Version

Latent Multi-group Membership Graph Model.

Summary

This is a paper on block based network analysis and prediction. This work furthers the work by Airoldi. et. al (see Related papers) to the extent that each node/ user can actually belong to more than one block and node features are modeled in addition to link existence. The generative process is given as below (see the figure)

KimLeskovec.png


On the figure above, , , is the number of users, number of user feature categories, and number of blocks respectively. The block membership probability of node at bloc is . Note that this model does not require

  • For each node , and each block , is generated from Beta distribution:
  • Then
    • The truly block membership of node at block is a binary indicator which is generated from Bernoulli distribution
    • The features of node at feature category is a is generated from Bernoulli distribution with the parameter is computed from block membership probabilities using Logistic function
      • where
    • The link between node and node is generated from Bernoulli distribution with the parameter is the product of block-wise linking probabilities
      • where

Given the hyperparameter , the other parameters are estimated by maximum likelihood function using a variational inference algorithm (see the paper for the details).

Discussion

This work consider the case when a node actually belongs to more than one block and multiple type of links can be existed among nodes. Furthermore, node features are used to regularize the block membership values of nodes. However, the notation of binary block indicator is not well explained. is also a hidden variable and does not a play role that is clearly distinctive from 's.

Related papers