High Recall Data-to-text Generation with Progressive Edit
Kim, Choonghan, Lee, Gary Geunbae
–arXiv.org Artificial Intelligence
Data-to-text (D2T) generation is the task of generating texts from structured inputs. We observed that when the same target sentence was repeated twice, Transformer (T5) based model generates an output made up of asymmetric sentences from structured inputs. In other words, these sentences were different in length and quality. We call this phenomenon "Asymmetric Generation" and we exploit this in D2T generation. Once asymmetric sentences are generated, we add the first part of the output with a no-repeated-target. As this goes through progressive edit (ProEdit), the recall increases. Hence, this method better covers structured inputs Figure 1: An example of generating asymmetric sentences.
arXiv.org Artificial Intelligence
Aug-9-2022
- Country:
- Asia
- Indonesia > West Nusa Tenggara
- Mataram (0.05)
- South Korea > Gyeongsangbuk-do
- Pohang (0.05)
- Indonesia > West Nusa Tenggara
- North America > United States
- Missouri (0.05)
- Asia
- Genre:
- Research Report (0.40)
- Technology: