From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation

Malik, Ali, Mayhew, Stephen, Piech, Chris, Bicknell, Klinton

Jun-5-2024–arXiv.org Artificial Intelligence

We study the problem of controlling the difficulty level of text generated by Large Language Models (LLMs) for contexts where end-users are not fully proficient, such as language learners. Using a novel framework, we evaluate the effectiveness of several key approaches for this task, including few-shot prompting, supervised finetuning, and reinforcement learning (RL), utilising both GPT-4 and open source alternatives like LLama2-7B and Mistral-7B. Our findings reveal a large performance gap between GPT-4 and the open source models when using prompt-based strategies. However, we show how to bridge this gap with a careful combination of finetuning and RL alignment.

controlerror, dataset, proficiency level, (13 more...)

arXiv.org Artificial Intelligence

Jun-5-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - Michigan (0.04)
    - Maryland > Baltimore (0.04)
    - Washington > King County
      - Seattle (0.04)
    - Pennsylvania > Allegheny County
      - Pittsburgh (0.04)
    - New Mexico > Santa Fe County
      - Santa Fe (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California
      - Santa Clara County > Stanford (0.04)
      - San Diego County > San Diego (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
- Asia
  - Singapore (0.04)
  - Japan (0.04)
  - China > Hong Kong (0.04)
  - Thailand > Phuket
    - Phuket (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:
- Research Report > New Finding (0.66)

Industry:
- Health & Medicine (1.00)
- Education > Curriculum
  - Subject-Specific Education (0.65)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found