Dynamic Social Network Analysis using Latent Space Models
This is one of the paper written in course Social Media Analysis 10-802 in Spring 2011
Contents
Citation
Purnamrita Sarkar, Andrew W. Moore "Dynamic Social Network Analysis using Latent Space Models"
Online Version
Summary
This paper address the problem of Social Network Analysis using the method of Latent Space Models. More specifically, it addresses the problem of network evolution in social network analysis, by modelling the way in which the friendships drift over time. It efficiently learns this even when n is large, by assuming that nodes represented by points in the latent space do not make large movements over time. Hence it is a latent space model developed for dynamic analysis of social networks to predict the future link structure of the graph.
This paper is generalization of static modelling in Latent Space Models to Dynamic Social Networks by allowing latent coordinates to change smoothly over time, that is, between any two discretized time steps large movements of points are improbable.
This technique is applied to NIPS data to analysis the dynamics of network evolution.
Overview of the Method
Suppose that each observed link is associated with a discrete time step, then each time step produces its own graph. Further, with the Markov assumption the latent locations at the next time step are conditionally independent of locations in all other time steps given locations in current time step. This is very similar to method to HMM models.
Let be the graph at the time step t with n nodes. Let each node at the time step t be represented in p-dimensional latent space, and let be a matrix where each row represents the co-ordinates of a node. The conditional independence assumption shown in the figure below and hence we have the following
Hence we need to model two probability distributions and . The intuition is that is to estimate a graph such that links between pairs of entities which are far away in the latent Euclidian space are less probable and other distribution models the smoothness and assigns large movements of points in latent space less probable.
Model Description
Let us denote the distance between two nodes in the euclidean latent space as . Also, a radius parameter is introduced for every node , which is learned from the data. Further, is defined as greater of the radius of nodes and . Then the probability that there is a link between nodes $i$ and $j$ is given by
Therefore the probability that we observe a graph given coordinates is given by
Further, they show that it is possible to eliminate quadratic computation of the model over all pairs of links by introducing bi-quadratic kernel, hence it simplifies that two nodes have high probability of a link if their latent coordinates are within the radius of of one another.
The transition probability is simply taken as Guassian in the following way. The parameter controls the smoothness of transition
Algorithm
Now that the probability distributions are estimable we need to do the optimization problem to find the right which maximizes the product of these probability distributions. First, the latent coordinates are initialized by a time-dependent variation of the classical Multidimensional Scaling