Submodular Field Grammars: Representation, Inference, and Application to Image Parsing
Friesen, Abram L., Domingos, Pedro M.
–Neural Information Processing Systems
Natural scenes contain many layers of part-subpart structure, and distributions over them are thus naturally represented by stochastic image grammars, with one production per decomposition of a part. Unfortunately, in contrast to language grammars, where the number of possible split points for a production $A \rightarrow BC$ is linear in the length of $A$, in an image there are an exponential number of ways to split a region into subregions. This makes parsing intractable and requires image grammars to be severely restricted in practice, for example by allowing only rectangular regions. In this paper, we address this problem by associating with each production a submodular Markov random field whose labels are the subparts and whose labeling segments the current object into these subparts. We call the result a submodular field grammar (SFG).
Neural Information Processing Systems
Feb-14-2020, 14:27:47 GMT
- Technology: