AITopics | Problem Solving

Collaborating Authors

Problem Solving

News Overviews Instructional Materials AI-Alerts Classics

Boosting Language Models Reasoning with Chain-of-Knowledge Prompting

Wang, Jianing, Sun, Qiushi, Chen, Nuo, Li, Xiang, Gao, Ming

arXiv.org Artificial IntelligenceJul-20-2023

Recently, Chain-of-Thought (CoT) prompting has delivered success on complex reasoning tasks, which aims at designing a simple prompt like ``Let's think step by step'' or multiple in-context exemplars with well-designed rationales to elicit Large Language Models (LLMs) to generate intermediate reasoning steps. However, the generated rationales often come with mistakes, making unfactual and unfaithful reasoning chains. To mitigate this brittleness, we propose a novel Chain-of-Knowledge (CoK) prompting, where we aim at eliciting LLMs to generate explicit pieces of knowledge evidence in the form of structure triple. This is inspired by our human behaviors, i.e., we can draw a mind map or knowledge map as the reasoning evidence in the brain before answering a complex question. Benefiting from CoK, we additionally introduce a F^2-Verification method to estimate the reliability of the reasoning chains in terms of factuality and faithfulness. For the unreliable response, the wrong evidence can be indicated to prompt the LLM to rethink. Extensive experiments demonstrate that our method can further improve the performance of commonsense, factual, symbolic, and arithmetic reasoning tasks.

explanation hint, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2306.06427

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Pennsylvania (0.05)
Europe > United Kingdom (0.05)
(17 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Sports (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

A Model to Support Collective Reasoning: Formalization, Analysis and Computational Assessment

Ganzer, Jordi (King's College London) | Criado, Natalia (King's College London) | Lopez-Sanchez, Maite (University of Barcelona) | Parsons, Simon (University of Lincoln) | Rodriguez-Aguilar, Juan A. (Institut d'Investigació en Intel·ligència Artificial (IIIA-CSIC))

Journal of Artificial Intelligence ResearchJul-19-2023

In this paper we propose a new model to represent human debates and methods to obtain collective conclusions from them. This model overcomes two drawbacks of existing approaches. First, our model does not assume that participants agree on the structure of the debate. It does this by allowing participants to express their opinion about all aspects of the debate. Second, our model does not assume that participants' opinions are rational, an assumption that significantly limits current approaches. Instead, we define a weaker notion of rationality that characterises coherent opinions, and we consider different scenarios based on the coherence of individual opinions and the level of consensus. We provide a formal analysis of different opinion aggregation functions that compute a collective decision based on the individual opinions and the debate structure. In particular, we demonstrate that aggregated opinions can be coherent even if there is a lack of consensus and individual opinions are not coherent. We conclude with an empirical evaluation demonstrating that collective opinions can be computed efficiently for real-sized debates.

aggregation function, argumentation, opinion profile, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.14409

AI Access Foundation

14409

Journal of Artificial Intelligence Research

Country:

Europe > Finland > Uusimaa > Helsinki (0.04)
North America > United States > New York (0.04)
North America > United States > District of Columbia > Washington (0.04)
(8 more...)

Genre:

Overview (0.67)
Research Report > New Finding (0.45)

Industry:

Law (0.67)
Government > Regional Government > Europe Government > United Kingdom Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.72)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.67)
Information Technology > Communications > Collaboration (0.67)

Add feedback

Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation

Zheng, Wenqing, Sharan, S P, Jaiswal, Ajay Kumar, Wang, Kevin, Xi, Yihan, Xu, Dejia, Wang, Zhangyang

arXiv.org Artificial IntelligenceJul-18-2023

For a complicated algorithm, its implementation by a human programmer usually starts with outlining a rough control flow followed by iterative enrichments, eventually yielding carefully generated syntactic structures and variables in a hierarchy. However, state-of-the-art large language models generate codes in a single pass, without intermediate warm-ups to reflect the structured thought process of "outline-then-detail". Inspired by the recent success of chain-of-thought prompting, we propose ChainCoder, a program synthesis language model that generates Python code progressively, i.e. from coarse to fine in multiple passes. We first decompose source code into layout frame components and accessory components via abstract syntax tree parsing to construct a hierarchical representation. We then reform our prediction target into a multi-pass objective, each pass generates a subsequence, which is concatenated in the hierarchy. Finally, a tailored transformer architecture is leveraged to jointly encode the natural language descriptions and syntactically aligned I/O data samples. Extensive evaluations show that ChainCoder outperforms state-of-the-arts, demonstrating that our progressive generation eases the reasoning procedure and guides the language model to generate higher-quality solutions. Our codes are available at: https://github.com/VITA-Group/ChainCoder.

chaincoder, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2305.00909

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
(2 more...)

Add feedback

Large Language Models Perform Diagnostic Reasoning

Wu, Cheng-Kuang, Chen, Wei-Lin, Chen, Hsin-Hsi

arXiv.org Artificial IntelligenceJul-17-2023

We explore the extension of chain-of-thought (CoT) prompting to medical reasoning for the task of automatic diagnosis. Motivated by doctors' underlying reasoning process, we present Diagnostic-Reasoning CoT (DR-CoT). Empirical results demonstrate that by simply prompting large language models trained only on general text corpus with two DR-CoT exemplars, the diagnostic accuracy improves by 15% comparing to standard prompting. Moreover, the gap reaches a pronounced 18% in out-domain settings. Our findings suggest expert-knowledge reasoning in large language models can be elicited through proper promptings.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2307.08922

Country:

Asia > Taiwan (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Otolaryngology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.91)

Add feedback

MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning

Liang, Zhenwen, Yu, Dian, Pan, Xiaoman, Yao, Wenlin, Zeng, Qingkai, Zhang, Xiangliang, Yu, Dong

arXiv.org Artificial IntelligenceJul-16-2023

Reasoning in mathematical domains remains a significant challenge for relatively small language models (LMs). Many current methods focus on specializing LMs in mathematical reasoning and rely heavily on knowledge distillation from powerful but inefficient large LMs (LLMs). In this work, we explore a new direction that avoids over-reliance on LLM teachers, introducing a multi-view fine-tuning method that efficiently exploits existing mathematical problem datasets with diverse annotation styles. Our approach uniquely considers the various annotation formats as different "views" and leverages them in training the model. By postpending distinct instructions to input questions, models can learn to generate solutions in diverse formats in a flexible manner. Experimental results show that our strategy enables a LLaMA-7B model to outperform prior approaches that utilize knowledge distillation, as well as carefully established baselines. Additionally, the proposed method grants the models promising generalization ability across various views and datasets, and the capability to learn from inaccurate or incomplete noisy data. We hope our multi-view training paradigm could inspire future studies in other machine reasoning domains.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2307.07951

Genre: Research Report > New Finding (0.66)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

A Study of Situational Reasoning for Traffic Understanding

Zhang, Jiarui, Ilievski, Filip, Ma, Kaixin, Kollaa, Aravinda, Francis, Jonathan, Oltramari, Alessandro

arXiv.org Artificial IntelligenceJul-15-2023

Intelligent Traffic Monitoring (ITMo) technologies hold the potential for improving road safety/security and for enabling smart city infrastructure. Understanding traffic situations requires a complex fusion of perceptual information with domain-specific and causal commonsense knowledge. Whereas prior work has provided benchmarks and methods for traffic monitoring, it remains unclear whether models can effectively align these information sources and reason in novel scenarios. To address this assessment gap, we devise three novel text-based tasks for situational reasoning in the traffic domain: i) BDD-QA, which evaluates the ability of Language Models (LMs) to perform situational decision-making, ii) TV-QA, which assesses LMs' abilities to reason about complex event causality, and iii) HDT-QA, which evaluates the ability of models to solve human driving exams. We adopt four knowledge-enhanced methods that have shown generalization capability across language reasoning tasks in prior work, based on natural language inference, commonsense knowledge-graph self-supervision, multi-QA joint training, and dense retrieval of domain information. We associate each method with a relevant knowledge source, including knowledge graphs, relevant benchmarks, and driving manuals. In extensive experiments, we benchmark various knowledge-aware methods against the three datasets, under zero-shot evaluation; we provide in-depth analyses of model performance on data partitions and examine model predictions categorically, to yield useful insights on traffic understanding, given different background knowledge and reasoning strategies.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3580305.3599246

2306.0252

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.05)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry: Transportation > Ground > Road (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Classification and Generation of real-world data with an Associative Memory Model

Simas, Rodrigo, Sa-Couto, Luis, Wichert, Andreas

arXiv.org Artificial IntelligenceJul-13-2023

Drawing from memory the face of a friend you have not seen in years is a difficult task. However, if you happen to cross paths, you would easily recognize each other. The biological memory is equipped with an impressive compression algorithm that can store the essential, and then infer the details to match perception. The Willshaw Memory is a simple abstract model for cortical computations which implements mechanisms of biological memories. Using our recently proposed sparse coding prescription for visual patterns, this model can store and retrieve an impressive amount of real-world data in a fault-tolerant manner. In this paper, we extend the capabilities of the basic Associative Memory Model by using a Multiple-Modality framework. In this setting, the memory stores several modalities (e.g., visual, or textual) of each pattern simultaneously. After training, the memory can be used to infer missing modalities when just a subset is perceived. Using a simple encoder-memory-decoder architecture, and a newly proposed iterative retrieval algorithm for the Willshaw Model, we perform experiments on the MNIST dataset. By storing both the images and labels as modalities, a single Memory can be used not only to retrieve and complete patterns but also to classify and generate new ones. We further discuss how this model could be used for other learning tasks, thus serving as a biologically-inspired framework for learning.

information, modality, vector, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.neucom.2023.126514

2207.04827

Country:

Europe > Portugal > Lisbon > Lisbon (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Saarland (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.62)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.62)

Add feedback

IR Design for Application-Specific Natural Language: A Case Study on Traffic Data

Hu, Wei, Wang, Xuhong, Wang, Ding, Yao, Shengyue, Mao, Zuqiu, Li, Li, Wang, Fei-Yue, Lin, Yilun

arXiv.org Artificial IntelligenceJul-13-2023

In the realm of software applications in the transportation industry, Domain-Specific Languages (DSLs) have enjoyed widespread adoption due to their ease of use and various other benefits. With the ceaseless progress in computer performance and the rapid development of large-scale models, the possibility of programming using natural language in specified applications - referred to as Application-Specific Natural Language (ASNL) - has emerged. ASNL exhibits greater flexibility and freedom, which, in turn, leads to an increase in computational complexity for parsing and a decrease in processing performance. To tackle this issue, our paper advances a design for an intermediate representation (IR) that caters to ASNL and can uniformly process transportation data into graph data format, improving data processing performance. Experimental comparisons reveal that in standard data query operations, our proposed IR design can achieve a speed improvement of over forty times compared to direct usage of standard XML format data.

application, artificial intelligence, natural language, (18 more...)

arXiv.org Artificial Intelligence

2307.06983

Country:

Asia > China > Shanghai > Shanghai (0.05)
Asia > Macao (0.05)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Ground > Road (0.69)
Transportation > Infrastructure & Services (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.49)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.46)

Add feedback

An Open-Source Knowledge Graph Ecosystem for the Life Sciences

Callahan, Tiffany J., Tripodi, Ignacio J., Stefanski, Adrianne L., Cappelletti, Luca, Taneja, Sanya B., Wyrwa, Jordan M., Casiraghi, Elena, Matentzoglu, Nicolas A., Reese, Justin, Silverstein, Jonathan C., Hoyt, Charles Tapley, Boyce, Richard D., Malec, Scott A., Unni, Deepak R., Joachimiak, Marcin P., Robinson, Peter N., Mungall, Christopher J., Cavalleri, Emanuele, Fontana, Tommaso, Valentini, Giorgio, Mesiti, Marco, Gillenwater, Lucas A., Santangelo, Brook, Vasilevsky, Nicole A., Hoehndorf, Robert, Bennett, Tellen D., Ryan, Patrick B., Hripcsak, George, Kahn, Michael G., Bada, Michael, Baumgartner, William A. Jr, Hunter, Lawrence E.

arXiv.org Artificial IntelligenceJul-11-2023

Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to automatically construct them. However, tackling complex biomedical integration problems requires flexibility in the way knowledge is modeled. Moreover, existing KG construction methods provide robust tooling at the cost of fixed or limited choices among knowledge representation models. PheKnowLator (Phenotype Knowledge Translator) is a semantic ecosystem for automating the FAIR (Findable, Accessible, Interoperable, and Reusable) construction of ontologically grounded KGs with fully customizable knowledge representation. The ecosystem includes KG construction resources (e.g., data preparation APIs), analysis tools (e.g., SPARQL endpoints and abstraction algorithms), and benchmarks (e.g., prebuilt KGs and embeddings). We evaluate the ecosystem by surveying open-source KG construction methods and analyzing its computational performance when constructing 12 large-scale KGs. With flexible knowledge representation, PheKnowLator enables fully customizable KGs without compromising performance or usability.

github, knowledge graph, ontology, (9 more...)

arXiv.org Artificial Intelligence

2307.05727

Country:

North America > United States > Colorado > Boulder County > Boulder (0.14)
North America > United States > Colorado > Adams County > Aurora (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(12 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology (0.67)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Communications > Web > Semantic Web (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(2 more...)

Add feedback

Some Preliminary Steps Towards Metaverse Logic

Furtado, Antonio L., Casanova, Marco A., de Lima, Edirlei Soares

arXiv.org Artificial IntelligenceJul-10-2023

Assuming that the term 'metaverse' could be understood as a computer-based implementation of multiverse applications, we started to look in the present work for a logic that would be powerful enough to handle the situations arising both in the real and in the fictional underlying application domains. Realizing that first-order logic fails to account for the unstable behavior of even the most simpleminded information system domains, we resorted to non-conventional extensions, in an attempt to sketch a minimal composite logic strategy. The discussion was kept at a rather informal level, always trying to convey the intuition behind the theoretical notions in natural language terms, and appealing to an AI agent, namely ChatGPT, in the hope that algorithmic and common-sense approaches can be usefully combined.

logic & formal reasoning, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2307.05574

Country:

Oceania > New Zealand (0.04)
North America > Bermuda (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(8 more...)

Genre:

Instructional Material (0.93)
Research Report (0.64)

Industry:

Law (1.00)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback