Xxiong project abstract

From Cohen Courses
Revision as of 19:05, 29 September 2010 by PastStudents (talk | contribs)
Jump to navigationJump to search

Team Members

Xuehan Xiong [xxiong@andrew.cmu.edu]

Goal

1. A revisit of boosting. I will evaluate the proposed method via NER task.

2. Extend a stacked hierarchical model recently developed for computer vision tasks and apply it in the IE domain.


I will choose (1) and one of (2) and (3) as my final project.

Motivation

1. In the traditional boosting, within each iteration the mis-classified samples are weighted more in the next round. However, these errors are made from training data. In my algorithm, I will give more weight to the data that are mis-labeled from cross-validation process, as in stacking.

2. The intuition of stacked hierarchical model is that predictions from one level of the hierarchy should help to predict the entities in the level above or below. Besides using neighbors' predictions, parent or/and children predictions may also be "stacked" into one's feature vector. Different from LDA, this model can only be used in a supervised mode.


Dataset

Superpowers

Experience with CRF and stacking in the domain of computer vision.

What question you want to answer

1. I want to know whether the proposed algorithm will outperform the traditional Ada-boost.

2. I want to know whether the stacked hierarchical model will be more effective than hierarchical Bayesian models, such as LDA, in the applications of IE and whether it will improve the results upon the original stacking algorithm without hierarchy.