Topic Modeling: Beyond Bag-of-Words
From Cohen Courses
Jump to navigationJump to search
This a Paper discussed in Social Media Analysis 10-802 in Spring 2011.
Contents
Citation
Hanna M. Wallach: Topic Modeling: Beyond Bag-of-Words. ICML 2006
Online version
Summary
In text analysis community, methods are basically 2-folded: employ n-gram statistics like language-modeling, or recently emerged topic models which using 'bag-of-words', assuming word order doesn't matter. This work tries to incorporate both methods by proposing a hierarchical generative probabilistic model.