SAGE Weibo
For this project, we are looking at using generative, graphical models to predict retweets, comments, tweets-at, follower networks of users of Tencent Weibo. In particular, we would like to examine how the influence of different demographic factors impacts the distribution of words in the social network and the structure of the network. We plan on looking at basic graphical models such as LDA and then extend this graphical model to account for the other structure in the data. In addition to looking at collapsed samplers to infer hidden variables and structures in our model, we will evaluate how SAGE can be used to increase robustness of our models on held-out data and reduce computational complexity.
Dataset
We will be using the data from the KDD Cup 2012 Track 1 (Tencent Weibo