A Clustering Approach for the Nearly Unsupervised Recognition of Nonliteral Language, EACL-2006

Citation

Birke, J. and A. Sarkar. 2006. A clustering approach for the nearly unsupervised recognition of nonliteral language. In Proceedings of EACL-06, pages 329–336.

Online Version

pdf link to the paper

Method Summary

TroFi (TropeFinder) System

Task: Classifying literal and nonliteral usages of verbs
Approach: Use nearly unsupervised word-sense disambiguation and * clustering techniques

Processing Steps

KE Algorithm: Similarity-based word-sense disambiguation algorithm
- Similarities are calculated between:
  1. Sentences containing the word we wish to disambiguate (the target word)
  2. Collections of seed sentences (feedback sets)
Clean the Feedback Sets
- In order to remove false attraction
- 4 Principle of Scrubbing

Learning & Voting

Nearly Unsupervised Word-Sense Disambiguation

Clustering

1. Use sentential context instead of selectional constraint violations or paths in semantic hierarchies

It also uses literal and nonliteral seed sets acquired and cleaned without human supervision in order to bootstrap learning.

We adapt a word-sense disambiguation algorithm to our task and augment it with multiple seed set learners, a voting schema, and additional features like SuperTags and extrasentential context.

Detailed experiments on hand-annotated data show that our enhanced algorithm outperforms the baseline by 24.4%.

Using the TroFi algorithm, we also build the TroFi Example Base, an extensible resource of annotated literal/nonliteral examples which is freely available to the NLP research community.

How much time did you spend reading the old wikified paper?
How much time did you spend reading the summary of the old paper?
How much time did you spend reading background materiel?
Was there a study plan for the old paper?
1. if so, did you read any of the items suggested by
2. the study plan? and how much time did you spend with reading them?
Give us any additional feedback you might have about this assignment.

A Clustering Approach for the Nearly Unsupervised Recognition of Nonliteral Language, EACL-2006

Contents

Citation

Online Version

Method Summary

Discussion

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools