GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing

Yu, Tao, Wu, Chien-Sheng, Lin, Xi Victoria, Wang, Bailin, Tan, Yi Chern, Yang, Xinyi, Radev, Dragomir, Socher, Richard, Xiong, Caiming

Sep-29-2020–arXiv.org Artificial Intelligence

We present GraPPa, an effective pre-training approach for table semantic parsing that learns a compositional inductive bias in the joint representations of textual and tabular data. We construct synthetic question-SQL pairs over high-quality tables via a synchronous context-free grammar (SCFG) induced from existing text-to-SQL datasets. We pre-train our model on the synthetic data using a novel text-schema linking objective that predicts the syntactic role of a table field in the SQL for each question-SQL pair. To maintain the model's ability to represent real-world data, we also include masked language modeling (MLM) over several existing table-and-language datasets to regularize the pre-training process. On four popular fully supervised and weakly supervised table semantic parsing benchmarks, GraPPa significantly outperforms RoBERTa-large as the feature representation layers and establishes new state-of-the-art results on all of them.

artificial intelligence, natural language, rappa, (17 more...)

arXiv.org Artificial Intelligence

Sep-29-2020

arXiv.org PDF

Add feedback

Country:
- Asia (0.68)
- North America > United States (1.00)

Genre:
- Research Report (0.64)

Industry:
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (0.80)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language > Grammars & Parsing (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found