Conditional set generation using Seq2seq models

Madaan, Aman, Rajagopal, Dheeraj, Tandon, Niket, Yang, Yiming, Bosselut, Antoine

Oct-24-2022–arXiv.org Artificial Intelligence

Conditional set generation learns a mapping from an input sequence of tokens to a set. Several NLP tasks, such as entity typing and dialogue emotion tagging, are instances of set generation. Seq2Seq models, a popular choice for set generation, treat a set as a sequence and do not fully leverage its key properties, namely order-invariance and cardinality. We propose a novel algorithm for effectively sampling informative orders over the combinatorial space of label orders. We jointly model the set cardinality and output by prepending the set size and taking advantage of the autoregressive factorization used by Seq2Seq models. Our method is a model-independent data augmentation approach that endows any Seq2Seq model with the signals of order-invariance and cardinality. Training a Seq2Seq model on this augmented data (without any additional annotations) gets an average relative improvement of 20% on four benchmark datasets across various models: BART, T5, and GPT-3. Code to use SETAUG available at: https://setgen.structgen.com.

cardinality, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

Oct-24-2022

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - United States
    - Washington > King County
      - Seattle (0.04)
    - Pennsylvania > Allegheny County
      - Pittsburgh (0.14)
    - New Mexico > Los Alamos County
      - Los Alamos (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California
      - Los Angeles County > Long Beach (0.14)
      - San Diego County > San Diego (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.14)
- Europe
  - Switzerland (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Italy > Tuscany
    - Florence (0.04)
- Asia
  - India (0.04)
  - China > Heilongjiang Province
    - Daqing (0.04)
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)

Genre:
- Research Report
  - Experimental Study (0.67)
  - New Finding (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language > Large Language Model (0.88)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found