Pointwise mutual information

From Cohen Courses
Revision as of 11:42, 3 September 2010 by WikiAdmin (talk | contribs) (1 revision)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

This is a method discussed in Social Media Analysis 10-802 in Spring 2010.

If X and Y are random variables, the pointwise mutual information between two possible outcomes X=x and Y=y is

This quantity is zero if x and y are independent, positive if they are positively correlated, and negative if they are negatively correlated.

In Turney, ACL 2002 this was used as a way of assessing the semantic orientation of words or phrases. Specifically the semantic orientation of x was defined as

In more detail, Turney interpreted "X=x and Y=y" as an event where two words x and y occur nearby in the same document, and "X=x" as an event where word x occurs in a document. After some simplification, SO(x) can then be written as

This means that SO(x) can be computed quickly - with just two queries to a search engine.


Relevant Papers

 AddressesProblemUsesDataset
Turney, ACL 2002Review classification