Difference between revisions of "Mrinmaya et. al. WWW'12"
Line 42: | Line 42: | ||
== Dicussion == | == Dicussion == | ||
+ | This is yet another paper on topic modeling based network clustering. The underlying assumption is that when a user writes to other users, the topic is decided by the community of the sender and the sender herself; and the recipient and the type of message (reply, forward, etc) are decided by the community of the sender only. This make the transform from user-topic similarity to user-community similarity is not straightforward. | ||
== Related papers == | == Related papers == |
Revision as of 12:10, 2 October 2012
This is a scientific paper authored by Mrinmaya Sachan, and appeared in WWW'12. Below is the paper summary written by Tuan Anh.
Citation
@inproceedings{Sachan:2012:UCI:2187836.2187882,
author = {Sachan, Mrinmaya and Contractor, Danish and Faruquie, Tanveer A. and Subramaniam, L. Venkata}, title = {Using content and interactions for discovering communities in social networks}, booktitle = {Proceedings of the 21st international conference on World Wide Web}, series = {WWW '12}, year = {2012}, isbn = {978-1-4503-1229-5}, location = {Lyon, France}, pages = {331--340}, numpages = {10}, url = {http://doi.acm.org/10.1145/2187836.2187882}, doi = {10.1145/2187836.2187882}, acmid = {2187882}, publisher = {ACM}, address = {New York, NY, USA}, keywords = {community detection, probabilistic methods, social networks},
}
Online Version
Using Content and Interactions for Discovering Communities in Social Networks.
Summary
In this paper, the authors study the problem of communities detection in social networks. They employ the probabilistic approach and propose a generative model that describes how users' exchanged messages and interactions are generated from the hidden membership of each user. The general model, or ``full model" as called by the author, has the generative process as follows.
- For each of the topics, , sample topic as a dimensional multinomial distribution over words
- For each of the communities, sample social type interaction as a dimensional multinomial distribution over type of interactions
- For each of the communities, sample social type interaction recipient as a dimensional multinomial distribution over set of users
- For the each user ,
- Sample a dimensional multinomial, representing the community proportions for that sender.
- For each community , sample a dimensional multinomial, , representing the topic proportions for community and sender.
- For each post generated by the sender having words:
- Choose a community assignment for all for the post.
- For each recipient slot , of the post : Choose a recipient for all for the post.
- Choose a social interaction type for all for the post.
- For each word slot in :
- Choose a topic assignment for all
- Choose a word
In the model presented above, and is the number of topics and the number of communities respectively. They are user defined parameters and should be given before hand. are hyper-parameters and should also given before hand. The other parameters are estimated using Gibbs sampling method.
Dicussion
This is yet another paper on topic modeling based network clustering. The underlying assumption is that when a user writes to other users, the topic is decided by the community of the sender and the sender herself; and the recipient and the type of message (reply, forward, etc) are decided by the community of the sender only. This make the transform from user-topic similarity to user-community similarity is not straightforward.
Related papers
- Airoldi. et. al. Mixed Membership Stochastic Blockmodels. Journal of Machine Learning Research 9 (2008) 1981-2014
- Blei. et. al. Latent Dirichlet Allocation. Journal of Machine Learning Research 3 (2003) 993-1022
- Gregor Heinrich's note on Parameter estimation for text analysis