Conditional Random Fields

This is a method discussed in Information Extraction 10-707 in Fall 2010.

Conditional random field (CRF) is a type of discriminative probabilistic model most often used for the labeling or parsing of sequential data, such as natural language text or biological sequences. With a foundation from Maximum Entropy model and Hidden Markov model, it outperforms them in particular on the tasks of Shallow Parsing, Named Entity Recognition and Visual Object Recognition etc.

Introduction

Linear-chain Conditional Random Fields

One of the commonly used version is the linear-chain conditional random fields. Such CRFs define conditional probability distributions p(Y|X) of label sequences given input sequences. The label and input sequences are assumed to have the same length.

A CRF on (X, Y) is specified by a local feature vector and a weight vector, the local features are defined as follows: