Models of metaphor in NLP
Contents
Citation
E. Shutova. 2010. Models of Metaphor in NLP. In Proceedings of ACL 2010, Uppsala, Sweden.
Online version
Introduction
This is a review paper of modeling metaphors in NLP. The author devised it into two main tasks: "metaphor recognition" and "metaphor interpretation".
Metaphor Recognition
Met* System (Fass, 1991)
- First attempt to identify and interpret metaphorical expression
- Using selectional preference and hand-coded knowledge base
- 3-Stage Approaches
- Detect selectional preference violation
- If find violations, tested for being a metonymic relation using hand-coded patterns
- If not metonymy, search the knowledge base for a relevant analogy in order to discriminate metaphorical relations
- Problem
- Detects any kind of non-literalness in language (metaphors, metonymies and others), and not only metaphors
- Fail to detect high conventionality of metaphor
Goatly (1997)
- Identify a set of linguistic cues indicate metaphor
- metaphorically speaking, utterly, completely, so to speak and, surprisingly, literally.
Peters & Peters (2000)
- Detect figurative language in the WordNet
- Search for systematic polysemy, which allows to capture metonymic and metaphorical relations
CorMet System (Mason, 2004)
- The first attempt for source-target domain mapping
- A corpus-based approach to find systematic variations in domain-specific selectional preferences
- Take Master Metaphor List (Lakoff et al., 1991) as baseline, and achieve an accuracy of 77% (judged by human)
TroFi System (Birke & Sarkar, 2006)
- Sentence clustering approach for non-literal language recognition
- Inspired by a similarity-based word sense disambiguation method
- Approach
- Use a set of seed sentences (human annotated)
- Compute similarity between (1) the sentence containing the word to be disambiguated and (2) all of the seed sentences
- Select the sense corresponding to the annotation in the most similar seed sentences
- F1-score = 0.538. But the task is not clearly defined.
Gedigan et al. (2006)
- A survey-like work about literal and metaphorical uses of verbs
- Method
- Extract the lexical items whose frames are related to MOTION and CURE from FrameNet (Fillmore et al., 2003).
- Search the PropBank Wall Street Journal corpus (Kingsbury and Palmer, 2002) for sentences containing such lexical items and annotated them with respect to metaphoricity (by hand).
- Used PropBank annotation as features to train the maximum entropy classifier
- Accuracy = 95.12%
- Very close to majority baseline (92.90%) => 92.00% of the verbs of MOTION and CURE in the Wall Street Journal corpus are used metaphorically.
Krishnakumaran & Zhu (2007)
- Deal with not only verbs, but also nouns and adjectives.
- Use hyponymy relation in WordNet and word bigram counts to predict metaphors at sentence level.
- Approach
- Noun-Noun: Given an IS-A metaphor, verify if the two nouns involved are in hyponymy relation in WordNet, if not, tag as a metaphor
- Verb-Noun and Adjective-Noun: Calculate bigram probabilities of verb-noun and adj-noun pairs, if the pair has low frequency, tag as a metaphor
- Fail to capture conventionalized metaphors (loose information compared of parsed text)
Metaphor Interpretation
MIDAS System (Martin, 1990)
- A Metaphor Interpretation, Denotation and Acquisition System (MIDAS)
- Capture hierarchical organization of conventional metaphors
- Idea
- More specific conventional metaphors descend from the general ones
- Approach (modeling how novel metaphors are acquired)
- Given a metaphor example, search for a corresponding metaphor that would explain the anomaly.
- If not find: abstract it from the example to more general concepts and repeats search.
- If find a suitable general metaphor: create a mapping for its descendant (a more specific metaphor) based on this example.
- MIDAS has been integrated with the Unix Consultant (UC), the system that answers questions about Unix.
KARMA System (Narayanan, 1997), ATT-Meta (Barnden and Lee, 2002)
- Performing inferences about entities and events in the source and target domains for metaphor interpretation.
- Metaphor-based reasoning framework
- Approach
- Reasoning process relies on manually coded knowledge of the source domain.
- The results are then projected onto the target domain (using the conceptual mapping representation).
- Problems
- ATT-Meta concerns metaphorical and metonymic description of mental states and reasoning about mental states using first order logic. But they can not take natural language sentences as input.
- KARMA doesn't have this problem, it takes parsed text as input.
Talking Points (Veale and Hao, 2008)
- Fluid knowledge representation for metaphor interpretation and generation
- Approach
- From WordNet and from the web, extract a set of characteristics of (1) concepts belonging to source and target domains and (2) related facts about the world.
- Talking Points are then organized in Slipnet, a framework that allows for a number of insertions, deletions and substitutions in definitions of such characteristics in order to establish a connection between the target and the source concepts.
- Problem
Not evaluate to which extent their knowledge base are useful.
Shutova (2010)
- Defines metaphor interpretation as a paraphrasing task
- Derive literal paraphrases for metaphorical expression
- Approach
- Given a context, apply a probabilistic model to rank all possible paraphrases for the metaphorical expression
- Use automatically induced selectional preferences to discriminate between figurative and literal paraphrases
- Tested on metaphors expressed by a verb, and achieve an accuracy of 0.81.
Metaphor Resources
- Metaphor is a knowledge-hungry phenomenon.
- Two type of resources are needed
- Extensive manually-created knowledge-base (easier)
- Robust knowledge acquisition system for interpretation of metaphorical expressions (hard task)
The latter being a hard task, a great deal of metaphor research resorted to the first option.
Although hand-coded knowledge proved useful for metaphor interpretation (Fass, 1991; Martin, 1990), it should be noted that the systems utilizing it have a very limited coverage.
Master Metaphor List (Lakoff et al., 1991)
It includes a classification of metaphorical mappings (mainly those related to mind, feelings and emotions) with the corresponding examples of language use. This resource has been criticized for the lack of clear structuring principles of the mapping ontology (L¨onneker-Rodman, 2008).
The taxonomical levels are often confused, and the same classes are referred to by different class labels. This fact and the chosen data representation in the Master Metaphor List make it not suitable for computational use. However, both the idea of the list and its actual mappings ontology inspired the creation of other metaphor resources.
MetaBank (Martin, 1994) & Mental Metaphor Databank
- Created in the framework of the ATT-meta project
The MetaBank is a knowledge-base of English metaphorical conventions, represented in the form of metaphor maps (Martin, 1988) containing detailed information about source-target concept mappings backed by empirical evidence. The ATT-meta project databank contains a large number of examples of metaphors of mind classified by source–target domain mappings taken from the Master Metaphor List.
Along with this it is worth mentioning metaphor resources in languages other than English. There has been a wealth of research on metaphor in Spanish, Chinese, Russian, German, French and Italian. The Hamburg Metaphor Database (L¨onneker, 2004; Reining and L¨onneker-Rodman, 2007) contains examples of metaphorical expressions in German and French, which are mapped to senses from EuroWordNet5 and annotated with source–target domain mappings taken from the Master Metaphor List.
Alonge and Castelli (2003) discuss how metaphors can be represented in ItalWordNet for Italian and motivate this by linguistic evidence.
Encoding metaphorical information in generaldomain lexical resources for English, e.g. Word- Net (L¨onneker and Eilts, 2004), would undoubtedly provide a new platform for experiments and enable researchers to directly compare their results.
Metaphor Annotation in Corpora
To reflect two distinct aspects of the phenomenon, metaphor annotation can be split into two stages: identifying metaphorical senses in text (akin word sense disambiguation) and annotating source – target domain mappings underlying the production of metaphorical expressions. Traditional approaches to metaphor annotation include manual search for lexical items used metaphorically (Pragglejaz Group, 2007), for source and target domain vocabulary (Deignan, 2006; Koivisto-Alanko and Tissari, 2006; Martin, 2006) or for linguistic markers of metaphor (Goatly, 1997). Although there is a consensus in the research community that the phenomenon of metaphor is not restricted to similarity-based extensions of meanings of isolated words, but rather involves reconceptualization of a whole area of experience in terms of another, there still has been surprisingly little interest in annotation of cross-domain mappings. However, a corpus annotated for conceptual mappings could provide a new starting point for both linguistic and cognitive experiments.
Metaphor & Polysemy
Metaphor Identification
Pragglejaz Procedure
Source - Target Domain Vocabulary
Annotating Source and target Domains
Related papers
The widely cited Pang et al EMNLP 2002 paper was influenced by this paper - but considers supervised learning techniques. The choice of movie reviews as the domain was suggested by the (relatively) poor performance of Turney's method on movies.
An interesting follow-up paper is Turney and Littman, TOIS 2003 which focuses on evaluation of the technique of using PMI for predicting the semantic orientation of words.