DiPT: Enhancing LLM reasoning through diversified perspective-taking

Just, Hoang Anh, Dabas, Mahavir, Huang, Lifu, Jin, Ming, Jia, Ruoxi

Sep-10-2024–arXiv.org Artificial Intelligence

Correct reasoning steps are important for language models to achieve high performance on many tasks, such as commonsense reasoning, question answering, and mathematical problem-solving [Wei et al., 2022, Kojima et al., 2022, Suzgun et al., 2022]. One way to elicit reasoning is through the chain-of-thought (CoT) method Wei et al. [2022], Kojima et al. [2022], which asks the model to provide step-by-step reasoning. Another approach encourages the model to provide similar problems Yasunaga et al. [2024] as the query, indirectly compelling the model to first understand the original query. Similarly, repeating and rephrasing the query Deng et al. [2023], Mekala et al. [2023] requires the model to first understand the problem and then modify the query into its own words. This rephrasing might help simplify the problem for the model. Additionally, reasoning can be generated by indirectly providing reasoning examples in demonstrations, referred to as in-context learning (ICL) Brown et al. [2020], Min et al. [2022], Xie et al. [2021]. While these methods have demonstrated significant performance improvements, language models are still prone to errors due to incorrect context understanding or analytical steps. Furthermore, they are subject to instability when requests are paraphrased. This instability is particularly concerning in the context of adversarial prompts, where recent research [Zou et al., 2023, Zeng et al., 2024] has shown that adversaries can intentionally rewrite prompts to coax safety-aligned language models into generating objectionable content that they would not generate otherwise.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Sep-10-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.46)

Genre:
- Research Report > New Finding (0.92)

Industry:
- Government (1.00)
- Health & Medicine > Therapeutic Area
  - Psychiatry/Psychology (0.93)
- Information Technology (0.67)
- Law > Criminal Law (0.68)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Media (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Problem Solving (0.88)
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)
  - Natural Language
    - Chatbot (0.93)
    - Large Language Model (1.00)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found