A Distributional Approach to Controlled Text Generation

Khalifa, Muhammad, Elsahar, Hady, Dymetman, Marc

Dec-21-2020–arXiv.org Artificial Intelligence

We propose a Distributional Approach to address Controlled Text Generation from pre-trained Language Models (LMs). This view permits to define, in a single formal framework, "pointwise" and "distributional" constraints over the target LM -- to our knowledge, this is the first approach with such generality -- while minimizing KL divergence with the initial LM distribution. The optimal target distribution is then uniquely determined as an explicit EBM (Energy-Based Model) representation. From that optimal representation we then train the target controlled autoregressive LM through an adaptive distributional variant of Policy Gradient. We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from the initial LM (GPT-2). We then perform experiments over distributional constraints, a unique feature of our approach, demonstrating its potential as a remedy to the problem of Bias in Language Models. Through an ablation study we show the effectiveness of our adaptive technique for obtaining faster convergence.

constraint, iclr 2021, wikileaks, (14 more...)

arXiv.org Artificial Intelligence

Dec-21-2020

arXiv.org PDF

Add feedback

Country:
- South America
  - Colombia (0.04)
  - Chile (0.04)
  - Brazil (0.04)
  - Argentina (0.04)
- Oceania
  - New Zealand (0.04)
  - Australia
    - Victoria > Melbourne (0.04)
    - New South Wales > Sydney (0.04)
- North America
  - United States
    - Florida (0.04)
    - New York (0.04)
    - Virginia > Richmond (0.04)
    - Ohio (0.04)
    - New Jersey (0.04)
    - Colorado (0.04)
    - Michigan > Washtenaw County
      - Ann Arbor (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Indiana > Marion County
      - Indianapolis (0.04)
    - Texas
      - Travis County > Austin (0.04)
      - Bexar County > San Antonio (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Utah > Salt Lake County
      - Salt Lake City (0.04)
    - Illinois > Cook County
      - Chicago (0.04)
    - California
      - Santa Clara County > Palo Alto (0.04)
      - San Diego County > San Diego (0.04)
      - Los Angeles County > Long Beach (0.04)
    - Wisconsin > Milwaukee County
      - Milwaukee (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
  - Canada
    - Saskatchewan (0.04)
    - Quebec (0.04)
    - Ontario > National Capital Region
      - Ottawa (0.04)
    - Manitoba > Winnipeg Metropolitan Region
      - Winnipeg (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
- Europe
  - United Kingdom (0.27)
  - Czechia (0.05)
  - Middle East > Cyprus (0.04)
  - Austria (0.04)
  - Bulgaria (0.04)
  - Belgium (0.04)
  - Croatia (0.04)
  - Belarus (0.04)
  - France (0.04)
  - Spain (0.04)
  - Greece (0.04)
  - Russia > Central Federal District
    - Moscow Oblast > Moscow (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Italy
    - Veneto > Venice (0.04)
    - Tuscany > Florence (0.04)
  - Netherlands > South Holland
    - Dordrecht (0.04)
- Asia
  - Brunei (0.04)
  - Russia (0.04)
  - India (0.04)
  - Singapore (0.04)
  - Bangladesh (0.04)
  - North Korea (0.04)
  - Macao (0.04)
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - China
    - Hong Kong (0.04)
    - Beijing > Beijing (0.04)
    - Guangdong Province > Shenzhen (0.04)
    - Shanghai > Shanghai (0.04)
  - Middle East
    - Israel (0.04)
    - Iraq (0.04)
    - Iran (0.04)
    - Syria (0.04)
    - Republic of Türkiye > Aksaray Province
      - Guzelyurt (0.04)
  - Afghanistan > Kabul Province
    - Kabul (0.04)
  - Japan > Shikoku
    - Tokushima Prefecture > Tokushima (0.04)
- Africa
  - Middle East
    - Somalia (0.04)
    - Egypt > Cairo Governorate
      - Cairo (0.04)
  - Ethiopia > Addis Ababa
    - Addis Ababa (0.04)

Genre:
- Personal (0.92)
- Research Report (0.82)
- Instructional Material > Course Syllabus & Notes (0.45)

Industry:
- Information Technology (1.00)
- Energy (1.00)
- Health & Medicine (1.00)
- Education (1.00)
- Consumer Products & Services > Restaurants (1.00)
- Banking & Finance > Economy (0.92)
- Retail (0.67)
- Law Enforcement & Public Safety
  - Crime Prevention & Enforcement (1.00)
  - Terrorism (0.67)
- Leisure & Entertainment
  - Games (0.92)
  - Sports
    - Soccer (0.67)
    - Football (0.67)
- Law
  - Civil Rights & Constitutional Law (1.00)
  - Government & the Courts (0.92)
- Government
  - Military (1.00)
  - Foreign Policy (1.00)
  - Voting & Elections (0.92)
  - Regional Government
    - North America Government > United States Government (1.00)
    - Europe Government (0.67)
    - Asia Government > China Government (0.67)
- Media
  - Television (1.00)
  - Film (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Machine Translation (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found