A Distributional Approach to Controlled Text Generation
Khalifa, Muhammad, Elsahar, Hady, Dymetman, Marc
–arXiv.org Artificial Intelligence
We propose a Distributional Approach to address Controlled Text Generation from pre-trained Language Models (LMs). This view permits to define, in a single formal framework, "pointwise" and "distributional" constraints over the target LM -- to our knowledge, this is the first approach with such generality -- while minimizing KL divergence with the initial LM distribution. The optimal target distribution is then uniquely determined as an explicit EBM (Energy-Based Model) representation. From that optimal representation we then train the target controlled autoregressive LM through an adaptive distributional variant of Policy Gradient. We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from the initial LM (GPT-2). We then perform experiments over distributional constraints, a unique feature of our approach, demonstrating its potential as a remedy to the problem of Bias in Language Models. Through an ablation study we show the effectiveness of our adaptive technique for obtaining faster convergence.
arXiv.org Artificial Intelligence
Dec-21-2020
- Country:
- South America
- Oceania
- New Zealand (0.04)
- Australia
- Victoria > Melbourne (0.04)
- New South Wales > Sydney (0.04)
- North America
- United States
- Florida (0.04)
- New York (0.04)
- Virginia > Richmond (0.04)
- Ohio (0.04)
- New Jersey (0.04)
- Colorado (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Indiana > Marion County
- Indianapolis (0.04)
- Texas
- Travis County > Austin (0.04)
- Bexar County > San Antonio (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Wisconsin > Milwaukee County
- Milwaukee (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Saskatchewan (0.04)
- Quebec (0.04)
- Ontario > National Capital Region
- Ottawa (0.04)
- Manitoba > Winnipeg Metropolitan Region
- Winnipeg (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- United Kingdom (0.27)
- Czechia (0.05)
- Middle East > Cyprus (0.04)
- Austria (0.04)
- Bulgaria (0.04)
- Belgium (0.04)
- Croatia (0.04)
- Belarus (0.04)
- France (0.04)
- Spain (0.04)
- Greece (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Italy
- Netherlands > South Holland
- Dordrecht (0.04)
- Asia
- Brunei (0.04)
- Russia (0.04)
- India (0.04)
- Singapore (0.04)
- Bangladesh (0.04)
- North Korea (0.04)
- Macao (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- China
- Middle East
- Israel (0.04)
- Iraq (0.04)
- Iran (0.04)
- Syria (0.04)
- Republic of Türkiye > Aksaray Province
- Guzelyurt (0.04)
- Afghanistan > Kabul Province
- Kabul (0.04)
- Japan > Shikoku
- Tokushima Prefecture > Tokushima (0.04)
- Africa
- Middle East
- Somalia (0.04)
- Egypt > Cairo Governorate
- Cairo (0.04)
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Middle East
- Genre:
- Personal (0.92)
- Research Report (0.82)
- Instructional Material > Course Syllabus & Notes (0.45)
- Industry:
- Information Technology (1.00)
- Energy (1.00)
- Health & Medicine (1.00)
- Education (1.00)
- Consumer Products & Services > Restaurants (1.00)
- Banking & Finance > Economy (0.92)
- Retail (0.67)
- Law Enforcement & Public Safety
- Crime Prevention & Enforcement (1.00)
- Terrorism (0.67)
- Leisure & Entertainment
- Law
- Civil Rights & Constitutional Law (1.00)
- Government & the Courts (0.92)
- Government
- Military (1.00)
- Foreign Policy (1.00)
- Voting & Elections (0.92)
- Regional Government
- North America Government > United States Government (1.00)
- Europe Government (0.67)
- Asia Government > China Government (0.67)
- Media
- Television (1.00)
- Film (1.00)
- Technology: