Sum-Product Networks for Sequence Labeling
Ratajczak, Martin, Tschiatschek, Sebastian, Pernkopf, Franz
--We consider higher-order linear-chain conditional random fields (HO-LC-CRFs) for sequence modelling, and use sum-product networks (SPNs) for representing higher-order input-and output-dependent factors. SPNs are a recently introduced class of deep models for which exact and efficient inference can be performed. By combining HO-LC-CRFs with SPNs, expressive models over both the output labels and the hidden variables are instantiated while still enabling efficient exact inference. Furthermore, the use of higher-order factors allows us to capture relations of multiple input segments and multiple output labels as often present in real-world data. These relations can not be modeled by the commonly used first-order models and higher-order models with local factors including only a single output label. We demonstrate the effectiveness of our proposed models for sequence labeling. In extensive experiments, we outperform other state-of-the-art methods in optical character recognition and achieve competitive results in phone classification. For instance, they have been successfully used for speech recognition [3], optical character recognition and natural language processing [4]. Due to several advantages, LC-CRFs achieve better performance compared to their generative counterparts, i.e. hidden Markov models (HMMs) [3]. While LC-CRFs are normalized over the whole sequence, thereby counteracting the label bias problem, MEMMs are normalized locally .
Jul-6-2018
- Country:
- North America > United States
- Massachusetts (0.04)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Austria > Styria
- Graz (0.04)
- United Kingdom > England
- North America > United States
- Genre:
- Research Report (1.00)
- Technology: