Segmented Topic Model

From Cohen Courses
Revision as of 15:50, 29 September 2012 by Ymiao (talk | contribs) (Created page with 'Segmented Topic Model is a new form of topic model which can take into account the inner structures in documents. The basic ideas are: * As in LDA, one document ''d'' has a mult…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Segmented Topic Model is a new form of topic model which can take into account the inner structures in documents. The basic ideas are:

  • As in LDA, one document d has a multinomial distribution v(d) over latent topics
  • In this document, each segment d,s (sentence or paragraph) also has a multinomial distribution over topics. This distribution is generated from a two-parameter Poisson-Dirichlet process r(d,s)~ Poisson-Dirichlet(v(d),a,b)