Difference between revisions of "Chang and Blei, AOAS2010"
(3 intermediate revisions by the same user not shown) | |||
Line 6: | Line 6: | ||
== Motivation == | == Motivation == | ||
− | + | For Network data, such as social networks of friends, citation networks of documents or hyperlinked networks of web pages, people want to point social network members toward new friends, scientific papers toward relevant citations or web pages toward other related pages. They also want to uncover the hidden community structure. This [[Category::paper]] developed a hierarchical model of both network structure and node attributes, based on [[UsesMethod::Latent Dirichlet Allocation]]. | |
− | |||
− | |||
− | |||
− | + | == Methodology == | |
− | + | [[File:RTM.jpg | 400px]] | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
1. For each document <math>d</math>: | 1. For each document <math>d</math>: | ||
Line 37: | Line 25: | ||
== Inference, Estimation and Prediction == | == Inference, Estimation and Prediction == | ||
− | + | Prediction | |
− | + | * Link prediction from words | |
− | * | ||
− | |||
<math>p(y_{d,d'}|w_d, w_{d'}) \approx E_q [p(y_{d,d'}|\bar{z_d},\bar{z_{d'}})]</math> | <math>p(y_{d,d'}|w_d, w_{d'}) \approx E_q [p(y_{d,d'}|\bar{z_d},\bar{z_{d'}})]</math> | ||
− | + | * Words prediction from link | |
<math>p(w_{d,i}|y_d) \approx E_q [p(w_{d,i}|z_{d,i})]</math> | <math>p(w_{d,i}|y_d) \approx E_q [p(w_{d,i}|z_{d,i})]</math> | ||
Line 52: | Line 38: | ||
== Results == | == Results == | ||
+ | * Evaluating the predictive distribution | ||
+ | [[File:result1.jpg | 500px]] | ||
+ | |||
+ | * Automatic link suggestion | ||
+ | [[File:result2-1.jpg | 500px]] | ||
+ | |||
+ | [[File:result2-2.jpg | 500px]] | ||
+ | |||
+ | * Modeling spatial data | ||
+ | [[File:result3.jpg | 400px]][[File:result4.jpg | 400px]] | ||
+ | |||
+ | == Related papers == | ||
+ | [[RelatedPaper::Airoldi et al, ML2008]] AIROLDI, E., BLEI, D., FIENBERG, S. and XING, E. (2008). Mixed membership stochastic blockmodels. J. Mach. Learn. Res. 9 1981–2014. | ||
+ | |||
+ | [[RelatedPaper::Blei et al, NIPS2007]] BLEI, D. M. and MCAULIFFE, J. D. (2007). Supervised topic models. In Neural Information Processsing Systems. Vancouver. | ||
+ | |||
+ | [[RelatedPaper::Dietz et al, ICML2007]] DIETZ, L., BICKEL, S. and SCHEFFER, T. (2007). Unsupervised prediction of citation influences. In Proc. ICML. Available at http://portal.acm.org/citation.cfm?id=1273526. | ||
+ | |||
+ | [[RelatedPaper::Nallapati et al, ACM2008]] NALLAPATI, R., AHMED, A., XING, E. P. and COHEN, W. W. (2008). Joint latent topic models for text and citations. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 542–550. ACM Press, New York. |
Latest revision as of 14:40, 31 March 2011
Contents
Citation
J. Chang and D. Blei. Hierarchical relational models for document networks. Annals of Applied Statistics, 4(1):124–150, 2010
Online version
Motivation
For Network data, such as social networks of friends, citation networks of documents or hyperlinked networks of web pages, people want to point social network members toward new friends, scientific papers toward relevant citations or web pages toward other related pages. They also want to uncover the hidden community structure. This paper developed a hierarchical model of both network structure and node attributes, based on Latent Dirichlet Allocation.
Methodology
1. For each document :
(a) Draw topic proportions (b) For each word : i. Draw assignment . ii. Draw word .
2. For each pair of documents ,:
(a) Draw binary link indicator where
Inference, Estimation and Prediction
Prediction
- Link prediction from words
- Words prediction from link
Data
- Cora: abstracts + citation link
- WebKB: web pages + hyperlinks
- PNAS: abstracts + intra-PNAS citation
- LocalNews: local news of each state in U.S + geographical adjacency
Results
- Evaluating the predictive distribution
- Automatic link suggestion
- Modeling spatial data
Related papers
Airoldi et al, ML2008 AIROLDI, E., BLEI, D., FIENBERG, S. and XING, E. (2008). Mixed membership stochastic blockmodels. J. Mach. Learn. Res. 9 1981–2014.
Blei et al, NIPS2007 BLEI, D. M. and MCAULIFFE, J. D. (2007). Supervised topic models. In Neural Information Processsing Systems. Vancouver.
Dietz et al, ICML2007 DIETZ, L., BICKEL, S. and SCHEFFER, T. (2007). Unsupervised prediction of citation influences. In Proc. ICML. Available at http://portal.acm.org/citation.cfm?id=1273526.
Nallapati et al, ACM2008 NALLAPATI, R., AHMED, A., XING, E. P. and COHEN, W. W. (2008). Joint latent topic models for text and citations. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 542–550. ACM Press, New York.