Difference between revisions of "Ling, X. and Weld, D. Temporal Information Extraction. AAAI-10"

From Cohen Courses
Jump to navigationJump to search
Line 42: Line 42:
 
Performance of '''TIE''' is measured using precision that measures the number of correct predictions over all predictions made in test data and ''temporal entropy'' (TE) that measures the number and "tightness" of the induced temporal bounds. A human or a system can give time bounds to the starting and end points of the element in the text. Temporal entropy of a time point is thus measured as the log of the length of the tightest bound produced for the time point. The lower the entropy, the better (tighter) is the bound.  
 
Performance of '''TIE''' is measured using precision that measures the number of correct predictions over all predictions made in test data and ''temporal entropy'' (TE) that measures the number and "tightness" of the induced temporal bounds. A human or a system can give time bounds to the starting and end points of the element in the text. Temporal entropy of a time point is thus measured as the log of the length of the tightest bound produced for the time point. The lower the entropy, the better (tighter) is the bound.  
  
The precision and TE of '''TIE''' are compared to three other systems: (1) [http://aclweb.org/anthology/I/I08/I08-1054.pdf PASCA], a pattern-based extraction system for question answering, (2)
+
The precision and TE of '''TIE''' are compared to three other systems: (1) [http://aclweb.org/anthology/I/I08/I08-1054.pdf PASCA], a pattern-based extraction system for question answering, (2) [http://acl.ldc.upenn.edu/W/W05/W05-06.pdf#page=195 SRL], (3) [http://timeml.org/site/tarsqi/index.html TARSQI]. In terms of precision, '''TIE''' is able to extract more correct constraints at comparable precision to PASCA and SRL. In terms of TE, '''TIE''' has temporal entropy that is the closest to a human's skill. From the experiments, it is also observed that SRL feature improves the precision of '''TIE''' while transitivity feature improves the recall of the system. On [http://www.timeml.org/tempeval/ TempEval] task, '''TIE''' is able to achieve an accuracy of 0.695 which is comparable to that of the current state of the art, 0.716 ([[RelatedPaper::Yoshikawa 2009 jointly identifying temporal relations with markov logic |Yoshikawa et al. 2009]])
  
 
== Discussion ==
 
== Discussion ==

Revision as of 00:24, 3 November 2011

Reviews of this paper

Citation

Temporal Information Extraction, by X. Ling, D.S. Weld. In Proceedings of the Twenty Fifth National Conference on Artificial Intelligence, 2010.

Online version

This paper is available online [1].

Summary

This paper addresses the problem of Temporal ordering. Unlike other works that focus on time-agnostic extraction of facts, this paper presents TIE, a pipeline system that tries to extract as many facts from text while also inducing as much temporal information as possible. Temporal relations between events and times in a sentence are identified while transitivity is enforced to bind the start and ending time points for each event. The paper also presents temporal entropy as a way to evaluate recall of temporal information extraction systems.

Brief description of the method

The objective of this paper is given a sentence, output (1) a maximal set of temporal elements that are either events (e.g. "officers were dispatched") or temporal references (e.g. "1999") and (2) the tightest set of temporal constraints as directly implied by the text.

The constraints are linear inequalities of the form where and denotes either the beginning () or ending time point () of a temporal element .

For example, the sentence 'Steve Jobs revealed the iPhone in 2007' might produce these constraints:


The paper introduces a pipeline system called TIE with these components:

  • Preprocessing: (1) parse the sentence using a syntactic parser (the Stanford parser), (2) detect semantic roles for verbs in the sentence using SRL, (3) use TARSQI to find temporal elements (events and times) in the sentence, (4) generate features for each temporal element as well as between elements.
The features include descriptional features (tense, grounded time values, etc) generated for each element by TARSQI and syntactic features (dependency features and SRL features).
  • Classification: use probabilistic model trained on TimeBank Corpus combined with transitivity rules to classify each pair of points ( or ) of elements the point-wise relation.
Markov Logic Networks is used to make predictions. Formula weights are learned from the TimeBank data. Some of the formula templates are as follows:
where the first formula is transitivity and the second formula integrates the temporal information provided by SRL.

Experimental Result

The experiment is conducted on sentences extracted from Wikipedia articles. The ground truth constructed by asking human subjects to label the temporal relations between all pairs of elements by comparing their start and ending points. Each pair is labeled by at least two people with a third person to resolve any disagreement.

Performance of TIE is measured using precision that measures the number of correct predictions over all predictions made in test data and temporal entropy (TE) that measures the number and "tightness" of the induced temporal bounds. A human or a system can give time bounds to the starting and end points of the element in the text. Temporal entropy of a time point is thus measured as the log of the length of the tightest bound produced for the time point. The lower the entropy, the better (tighter) is the bound.

The precision and TE of TIE are compared to three other systems: (1) PASCA, a pattern-based extraction system for question answering, (2) SRL, (3) TARSQI. In terms of precision, TIE is able to extract more correct constraints at comparable precision to PASCA and SRL. In terms of TE, TIE has temporal entropy that is the closest to a human's skill. From the experiments, it is also observed that SRL feature improves the precision of TIE while transitivity feature improves the recall of the system. On TempEval task, TIE is able to achieve an accuracy of 0.695 which is comparable to that of the current state of the art, 0.716 (Yoshikawa et al. 2009)

Discussion

This paper presents an interesting take on the problem of temporal information extraction by finding dynamic events that are not predefined by any schema, rather is a result of entities that co-bursting together in a time period. However, a reader cannot help but question if the paper's assumption is valid that entities which co-burst together in time indeed have a relationship. In the case of celebrities for example, as they mention in the paper, some celebrities appear to be co-occurring a lot together in documents even though they are not involved in any relationships. Buzzy entities (entities that appear a lot in documents no matter when) may also cause an issue because they will appear to co-burst with many other unrelated entities. A better evaluation of the approach may be necessary. The fact that TRJ, which produces less than useful events, results in higher precision and recall in their entity-based evaluation also highlights this need to have a better evaluation approach and measure.

Another drawback to this paper is the lack of standardized data set for training and testing the approach. The choice of entities to list and events to sample seem arbitrary: take a random 30 entities and a random 35 entities (why 30 and 35?). The novelty of the task that this paper is addressing may cause finding standard data set difficult. In future a better and more standardized data set could be created to better train, test, and compare approaches in this area.

Related papers

Due to the novelty of the problem that this paper tries to address, there are still few works that are closely related to it. Perhaps the closest work is by Q. Zhao, P. Mitra, and B. Chen. Temporal and information flow based event detection from social text streams. In AAAI, 2007 where events are detected using keywords and a variety of signals including temporal and social constraints.

Although the paper is novel in terms of its take on temporal information extraction, the idea of using co-occurrence to identify relationships among entities are widespread in other applications. This idea of using distributional clustering has been used in Banko_2007_Open_Information_Extraction_from_the_Web to mine co-occurring entities from large collection of documents while being agnostic to the real world relations causing the co-occurrence. In this paper, instead of document co-occurrence, temporal co-occurrence is used to mine related entities.

Other related work that uses distributional clustering to discover structure in documents is Chambers, N. and Jurafsky, D. Template-based information extraction without the templates, ACL 2011. Although the relationship is not immediately clear (due to the non-temporal nature of the paper), this paper uses very similar ideas of co-occurrence and clustering to discover event templates in documents. It may be interesting to draw the similarities and differences between the two papers.