Bartlett et al., ACL-HLT 2008. Automatic Syllabification with Structured SVMs for Letter-to-Phoneme Conversion

From Cohen Courses
Jump to navigationJump to search

Citation

Susan Bartlett, Grzegorz Kondrak and Colin Cherry. 2008. Automatic syllabification with structured SVMs for letter-to-phoneme conversion. In Proceedings of ACL-08: HLT, 2008, pp. 568–576.

Online Version

Automatic syllabification with structured SVMs for letter-to-phoneme conversion.

Summary

This paper describes one of the first successful attempts at integrating automatic syllabification into a letter-to-phoneme conversion system using structured SVMs. The authors obtain substantial improvements in reducing automatic syllabification error rate (measured in WER) against the then state-of-the-art approach. The authors model the problem as an orthographic syllabification task as opposed to phonological syllabification. They treat it as a sequence tagging problem and define new tagging schemes. The method is applied to languages such as German and Dutch, in addition to English.

Method

Structured SVMs

Structured SVMs is a large-margin training method that is used for predicting structured outputs such as tag sequences. The method described in this paper uses structured SVMs that learn tag sequences from training data and perform structured output prediction by finding the highest scoring tag sequence using the Viterbi algorithm. Hence, the decoding problem in structured SVMs resembles that of an HMM.

Training structured SVMs is viewed as a multi-class classification problem. For a given training instance , a correct tag sequence is drawn from a set of possible tag sequences . Each input sequence has a feature space representation to represent a candidate output sequence .

The training process works in a manner that weights a correctly classified input sequence more than an incorrectly classified input sequence using a weight vector . Mathematically, the relationship can be expressed as:


The model then predicts the highest scoring sequence as follows,

\begin{center} Failed to parse (unknown function "\begin{center}"): {\displaystyle \begin{center} \underset{y \in Y_i}{\operatorname{argmax}} \left[ \Psi(x_i,y) \cdot w \right] \end{center} } \end{center}

The search for maximum approach is performed by the Viterbi algorithm.

Experiments and Results

Related Papers