Cascading Behavior in Large Blog Graphs
This is a scientific paper authored by Leskovec et. al. and appeared in SDM 2007. Below is the paper summary written by Tuan Anh.
Citation
@proceedings{DBLP:conf/sdm/2007,
title = {Proceedings of the Seventh SIAM International Conference on Data Mining, April 26-28, 2007, Minneapolis, Minnesota, USA}, booktitle = {SDM}, publisher = {SIAM}, year = {2007}, bibsource = {DBLP, http://dblp.uni-trier.de}
}
Online Version
Cascading Behavior in Large Blog Graphs
Summary
This paper reports empirical studies on how information diffuses in blog-sphere and propose a model for modeling the diffusion process. By representing the diffusion step of information from one post to other post, or from one blog to other blog, by citing link from the later to the former, the authors track the diffusion chain by sets of citing subgraphs. The authors then conduct some statistics on the set of citing subgraphs. Their key findings includes:
- The number of blog posts per day has a weekly periodicity.
- The popularity of posts, which is measures by number of citing posts, drops with a power law of number of days after the post
- The blog network follows power law distribution, and so does the post network
- Most of citing subgraphs in post network, or "cascades" as they are called in the paper, have a tree-like shape with about 97% of cascades is trivial (isolated posts)
- The size of cascade follows Zipf distribution
- The number of cascades that a node (post) taking part in follows power law
Finally, the authors propose a model for modeling the cascades based on diffusion models in epidemiology, i.e., SIS model and its variations. They validate their models by comparing the above observed statistics and the ones obtained from simulating datasets which are generated by the models.
Dicussion
In this paper, the author study aggregated behaviors of information cascades without a deep analysis of which type of information exert which behavior. A deeper study from the content of blogs and post should give more insights of how information diffuses within blog-sphere
Related papers
There are some similar works on tracking diffusion on social media, e.g.,
- E. Adar and Lada. Adamic Tracking Information Epidemics in Blogspace
- Daniel M. Romero et. al. Differences in the Mechanics of Information Diffusion Across Topics: Idioms, Political Hashtags, and Complex Contagion on Twitter
- J. Leskovec et. al http://www.cs.cornell.edu/home/kleinber/kdd09-quotes.pdf