A Clustering Approach for the Nearly Unsupervised Recognition of Nonliteral Language, EACL-2006

Citation

Birke, J. and A. Sarkar. 2006. A clustering approach for the nearly unsupervised recognition of nonliteral language. In Proceedings of EACL-06, pages 329–336.

Online Version

pdf link to the paper

Method Summary

TroFi (TropeFinder) System

Task: Classifying literal and nonliteral usages of verbs
Method: Use nearly unsupervised word-sense disambiguation and clustering techniques

Processing Steps

KE Algorithm: Similarity-based word-sense disambiguation algorithm
- Similarities are calculated between:
  1. Sentences containing the word we wish to disambiguate (the target word)
  2. Collections of seed sentences (feedback sets)
Clean the Feedback Sets
- In order to remove false attraction
- 4 Principle of Scrubbing
  1. Human annotations (in DoKMIE) are reliable
  2. Phrasal and expression verbs are often indicative of nonliteral uses
  3. Content words appearing in both feedback sets should be avoided
  4. Learning & voting: Use four learners (A, B, C, D) to vote the best form of scrubbing action

Result

TroFi achieved F1-score of 0.538, and outperforms the baseline by 24.4% (on human-labeled data)
Build the TroFi Example Base, which is a freely available metaphor annotated resource.

Discussion and Thought

This work explore a approach of metaphor identification which is relatively less mentioned. Compared with selection restriction modeling or lexicon-based methods, this method requires less human involvements, and adopt the well-development technologies borrowed from word sense disambiguation.
Models_of_metaphor_in_NLP criticized that this work doesn't define their task clearly. Part of the reason is that they simplify the task (a little bit) to fit some norm of word sense disambiguation. But in general, it's still a very inspiring work.

Study Plan

Papers you may want to read:

The core algorithm is based on this paper: Yael Karov and Shimon Edelman. 1998. Similarity-based word sense disambiguation. Comput. Linguist. 24, 1 (Mar. 1998), 41-59.
One important feature, SuperTag, is based on this paper: Srinivas Bangalore and Aravind K. Joshi. 1999. Supertagging: an approach to almost parsing. Comput. Linguist. 25, 2 (Jun. 1999), 237-265.

A Clustering Approach for the Nearly Unsupervised Recognition of Nonliteral Language, EACL-2006

Contents

Citation

Online Version

Method Summary

Result

Discussion and Thought

Study Plan

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools