Generating Semantic Graph Corpora with Graph Expansion Grammar
Andersson, Eric, Björklund, Johanna, Drewes, Frank, Jonsson, Anna
–arXiv.org Artificial Intelligence
We introduce Lovelace, a tool for creating corpora of semantic graphs. The system uses graph expansion grammar as a representational language, thus allowing users to craft a grammar that describes a corpus with desired properties. When given such grammar as input, the system generates a set of output graphs that are well-formed according to the grammar, i.e., a graph bank. The generation process can be controlled via a number of configurable parameters that allow the user to, for example, specify a range of desired output graph sizes. Central use cases are the creation of synthetic data to augment existing corpora, and as a pedagogical tool for teaching formal language theory.
arXiv.org Artificial Intelligence
Sep-15-2023
- Country:
- North America
- Canada > Quebec (0.14)
- United States > California (0.14)
- North America
- Genre:
- Research Report (0.50)
- Technology: