AITopics | Problem Solving

Collaborating Authors

Problem Solving

News Overviews Instructional Materials AI-Alerts Classics

REFINER: Reasoning Feedback on Intermediate Representations

Paul, Debjit, Ismayilzada, Mete, Peyrard, Maxime, Borges, Beatriz, Bosselut, Antoine, West, Robert, Faltings, Boi

arXiv.org Artificial IntelligenceApr-4-2023

Language models (LMs) have recently shown remarkable performance on reasoning tasks by explicitly generating intermediate inferences, e.g., chain-of-thought prompting. However, these intermediate inference steps may be inappropriate deductions from the initial context and lead to incorrect final predictions. Here we introduce REFINER, a framework for finetuning LMs to explicitly generate intermediate reasoning steps while interacting with a critic model that provides automated feedback on the reasoning. Specifically, the critic provides structured feedback that the reasoning LM uses to iteratively improve its intermediate arguments. Empirical evaluations of REFINER on three diverse reasoning tasks show significant improvements over baseline LMs of comparable scale. Furthermore, when using GPT3.5 as the reasoner, the trained critic significantly improves reasoning without finetuning the reasoner. Finally, our critic model is trained without expensive human-in-the-loop data but can be substituted with humans at inference time.

computational linguistic, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2304.01904

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Washington > King County > Seattle (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(6 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.66)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning > Abductive Reasoning (0.48)

Add feedback

Vitalik Buterin reveals major challenge for Ethereum's future – and how to solve it

#artificialintelligenceApr-2-2023, 00:05:05 GMT

Ethereum Co-Founder Vitalik Buterin shared his musing on an "underdiscussed, but nevertheless very important" aspect of the Ethereum ecosystem in a recent blog post this weekend. The post entitled "How will Ethereum's multi-client philosophy interact with ZK-EVMs?" focused on the technical challenges, trade-offs, and potential solutions for creating a multi-client ecosystem for ZK-EVMs. Vitalik believes ZK-EVMs will evolve to become an essential part of Ethereum's layer-1 security and verification process in the future. Zero Knowledge (ZK) technology allows developers to prove the authenticity of a transaction or message without revealing any additional information. Thus, it allows one party to convince another that a message is true without disclosing any knowledge beyond the message's validity.

ethereum, vitalik, zk-evm implementation, (9 more...)

#artificialintelligence

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.40)

Add feedback

On the Complexity of Finding Set Repairs for Data-Graphs

Abriola, Sergio (Conicet UBA) | Martínez, María Vanina (Conicet UBA) | Pardal, Nina (Conicet UBA) | Cifuentes, Santiago (a:1:{s:5:"en_US";s:8:"FCEN UBA";}) | Pin Baque, Edwin (FCEN UBA)

Journal of Artificial Intelligence ResearchMar-30-2023

In the deeply interconnected world we live in, pieces of information link domains all around us. As graph databases embrace effectively relationships among data and allow processing and querying these connections efficiently, they are rapidly becoming a popular platform for storage that supports a wide range of domains and applications. As in the relational case, it is expected that data preserves a set of integrity constraints that define the semantic structure of the world it represents. When a database does not satisfy its integrity constraints, a possible approach is to search for a ‘similar’ database that does satisfy the constraints, also known as a repair. In this work, we study the problem of computing subset and superset repairs for graph databases with data values using a notion of consistency based on having a set of Reg-GXPath expressions as integrity constraints. We show that for positive fragments of Reg-GXPath these problems admit a polynomial-time algorithm, while the full expressive power of the language renders them intractable.

constraint, expression, node, (14 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.13994

AI Access Foundation

13994

Journal of Artificial Intelligence Research

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Indiana > Madison County > Anderson (0.04)

Industry:

Leisure & Entertainment (0.67)
Media > Film (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.65)

Add feedback

Three-way causal attribute partial order structure analysis

Zaifa, Xue, Huibin, Lu, Tao, Zhang, Tao, Li, Xin, Lu

arXiv.org Artificial IntelligenceMar-28-2023

As an emerging concept cognitive learning model, partial order formal structure analysis (POFSA) has been widely used in the field of knowledge processing. In this paper, we propose the method named three-way causal attribute partial order structure (3WCAPOS) to evolve the POFSA from set coverage to causal coverage in order to increase the interpretability and classification performance of the model. First, the concept of causal factor (CF) is proposed to evaluate the causal correlation between attributes and decision attributes in the formal decision context. Then, combining CF with attribute partial order structure, the concept of causal attribute partial order structure is defined and makes set coverage evolve into causal coverage. Finally, combined with the idea of three-way decision, 3WCAPOS is formed, which makes the purity of nodes in the structure clearer and the changes between levels more obviously. In addition, the experiments are carried out from the classification ability and the interpretability of the structure through the six datasets. Through these experiments, it is concluded the accuracy of 3WCAPOS is improved by 1% - 9% compared with classification and regression tree, and more interpretable and the processing of knowledge is more reasonable compared with attribute partial order structure. Keywords: Formal concept analysis, Three-way decision, Attribute partial order structure, Causal inference, Causal factor 1. Introduction Attribute partial order structure analysis (APOSA) is an important method in the field of Concept-cognitive learning (CCL) [4, 31, 32, 19], which explores the relationship between attributes from the perspective of human cognition.

artificial intelligence, decision tree learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2303.17482

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Hebei Province (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Add feedback

When Brain-inspired AI Meets AGI

Zhao, Lin, Zhang, Lu, Wu, Zihao, Chen, Yuzhong, Dai, Haixing, Yu, Xiaowei, Liu, Zhengliang, Zhang, Tuo, Hu, Xintao, Jiang, Xi, Li, Xiang, Zhu, Dajiang, Shen, Dinggang, Liu, Tianming

arXiv.org Artificial IntelligenceMar-28-2023

Artificial General Intelligence (AGI) has been a long-standing goal of humanity, with the aim of creating machines capable of performing any intellectual task that humans can do. To achieve this, AGI researchers draw inspiration from the human brain and seek to replicate its principles in intelligent machines. Brain-inspired artificial intelligence is a field that has emerged from this endeavor, combining insights from neuroscience, psychology, and computer science to develop more efficient and powerful AI systems. In this article, we provide a comprehensive overview of brain-inspired AI from the perspective of AGI. We begin with the current progress in brain-inspired AI and its extensive connection with AGI. We then cover the important characteristics for both human intelligence and AGI (e.g., scaling, multimodality, and reasoning). We discuss important technologies toward achieving AGI in current AI systems, such as in-context learning and prompt tuning. We also investigate the evolution of AGI systems from both algorithmic and infrastructural perspectives. Finally, we explore the limitations and future of AGI.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2303.15935

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > China > Shanghai > Shanghai (0.04)
North America > United States > Texas (0.04)
(7 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Add feedback

Thermodynamics of bidirectional associative memories

Barra, Adriano, Catania, Giovanni, Decelle, Aurélien, Seoane, Beatriz

arXiv.org Artificial IntelligenceMar-27-2023

In this paper we investigate the equilibrium properties of bidirectional associative memories (BAMs). Introduced by Kosko in 1988 as a generalization of the Hopfield model to a bipartite structure, the simplest architecture is defined by two layers of neurons, with synaptic connections only between units of different layers: even without internal connections within each layer, information storage and retrieval are still possible through the reverberation of neural activities passing from one layer to another. We characterize the computational capabilities of a stochastic extension of this model in the thermodynamic limit, by applying rigorous techniques from statistical physics. A detailed picture of the phase diagram at the replica symmetric level is provided, both at finite temperature and in the noiseless regimes. Also for the latter, the critical load is further investigated up to one step of replica symmetry breaking. An analytical and numerical inspection of the transition curves (namely critical lines splitting the various modes of operation of the machine) is carried out as the control parameters - noise, load and asymmetry between the two layer sizes - are tuned. In particular, with a finite asymmetry between the two layers, it is shown how the BAM can store information more efficiently than the Hopfield model by requiring less parameters to encode a fixed number of patterns. Comparisons are made with numerical simulations of neural dynamics. Finally, a low-load analysis is carried out to explain the retrieval mechanism in the BAM by analogy with two interacting Hopfield models. A potential equivalence with two coupled Restricted Boltmzann Machines is also discussed.

artificial intelligence, hopfield model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1088/1751-8121/accc60

2211.09694

Country:

Europe > Spain > Galicia > Madrid (0.04)
Europe > Italy (0.04)
North America > United States > Colorado (0.04)
(2 more...)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.90)

Add feedback

A Survey on the Densest Subgraph Problem and its Variants

Lanciano, Tommaso, Miyauchi, Atsushi, Fazzone, Adriano, Bonchi, Francesco

arXiv.org Artificial IntelligenceMar-25-2023

The Densest Subgraph Problem requires to find, in a given graph, a subset of vertices whose induced subgraph maximizes a measure of density. The problem has received a great deal of attention in the algorithmic literature over the last five decades, with many variants proposed and many applications built on top of this basic definition. Recent years have witnessed a revival of research interest on this problem with several interesting contributions, including some groundbreaking results, published in 2022 and 2023. This survey provides a deep overview of the fundamental results and an exhaustive coverage of the many variants proposed in the literature, with a special attention on the most recent results. The survey also presents a comprehensive overview of applications and discusses some interesting open problems for this evergreen research topic.

bioinformatics, data mining, machine learning, (26 more...)

arXiv.org Artificial Intelligence

2303.14467

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.05)
Asia > Japan > Honshū > Chūbu > Nagano Prefecture > Nagano (0.04)
North America > United States > Nebraska > Douglas County > Omaha (0.04)
(5 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Information Technology (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(9 more...)

Add feedback

Heuristic Search for Multi-Objective Probabilistic Planning

Chen, Dillon, Trevizan, Felipe, Thiébaux, Sylvie

arXiv.org Artificial IntelligenceMar-25-2023

Heuristic search is a powerful approach that has successfully been applied to a broad class of planning problems, including classical planning, multi-objective planning, and probabilistic planning modelled as a stochastic shortest path (SSP) problem. Here, we extend the reach of heuristic search to a more expressive class of problems, namely multi-objective stochastic shortest paths (MOSSPs), which require computing a coverage set of non-dominated policies. We design new heuristic search algorithms MOLAO* and MOLRTDP, which extend well-known SSP algorithms to the multi-objective case. We further construct a spectrum of domain-independent heuristic functions differing in their ability to take into account the stochastic and multi-objective features of the problem to guide the search. Our experiments demonstrate the benefits of these algorithms and the relative merits of the heuristics.

artificial intelligence, improper policy, mossp, (15 more...)

arXiv.org Artificial Intelligence

2303.14363

Country:

North America > United States > Oklahoma > Payne County > Cushing (0.04)
Europe > United Kingdom (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

Learning to Operate in Open Worlds by Adapting Planning Models

Piotrowski, Wiktor, Stern, Roni, Sher, Yoni, Le, Jacob, Klenk, Matthew, deKleer, Johan, Mohan, Shiwali

arXiv.org Artificial IntelligenceMar-24-2023

Planning agents are ill-equipped to act in novel situations in which their domain model no longer accurately represents the world. We introduce an approach for such agents operating in open worlds that detects the presence of novelties and effectively adapts their domain models and consequent action selection. It uses observations of action execution and measures their divergence from what is expected, according to the environment model, to infer existence of a novelty. Then, it revises the model through a heuristics-guided search over model changes. We report empirical evaluations on the CartPole problem, a standard Reinforcement Learning (RL) benchmark. The results show that our approach can deal with a class of novelties very quickly and in an interpretable fashion.

agent, artificial intelligence, planning & scheduling, (15 more...)

arXiv.org Artificial Intelligence

2303.14272

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.10)
Europe > United Kingdom > England > Greater London > London (0.05)
Asia > Middle East > Israel (0.05)

Genre: Research Report (0.85)

Industry: Government (0.31)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.74)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.72)

Add feedback

Planning Goals for Exploration

Hu, Edward S., Chang, Richard, Rybkin, Oleh, Jayaraman, Dinesh

arXiv.org Artificial IntelligenceMar-22-2023

Dropped into an unknown environment, what should an agent do to quickly learn about the environment and how to accomplish diverse tasks within it? We address this question within the goal-conditioned reinforcement learning paradigm, by identifying how the agent should set its goals at training time to maximize exploration. We propose "Planning Exploratory Goals" (PEG), a method that sets goals for each training episode to directly optimize an intrinsic exploration reward. PEG first chooses goal commands such that the agent's goal-conditioned policy, at its current level of training, will end up in states with high exploration potential. It then launches an exploration policy starting at those promising states. To enable this direct optimization, PEG learns world models and adapts sampling-based planning algorithms to "plan goal commands". In challenging simulated robotics environments including a multi-legged ant robot in a maze, and a robot arm on a cluttered tabletop, PEG exploration enables more efficient and effective training of goal-conditioned policies relative to baselines and ablations. Our ant successfully navigates a long maze, and the robot arm successfully builds a stack of three blocks upon command. Website: https://penn-pal-lab.github.io/peg/

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2303.13002

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Pennsylvania (0.04)
Europe > Russia (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry:

Education (0.67)
Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.54)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.50)

Add feedback