AITopics | Search

Collaborating Authors

Search

"Search is a problem-solving technique that systematically explores a space of problem states, i.e., successive and alternative stages in the problem-solving process. Examples of problem states might include the different board configurations in a game or intermediate steps in a reasoning process. This space of alternative solutions is then searched to find an answer. Newell and Simon (1976) have argued that this is the essential basis of human problem solving. Indeed, when a chess player examines the effects of different moves or a doctor considers a number of alternative diagnoses, they are searching among alternatives."
– from Section 1.2 of Chapter One of George F. Luger's textbook, Artificial Intelligence: Structures and Strategies for Complex Problem Solving, 5th Edition (Addison-Wesley; 2005).

News Overviews Instructional Materials AI-Alerts Classics

Minimax Value Interval for Off-Policy Evaluation and Policy Optimization

Neural Information Processing SystemsJan-22-2025, 08:13:23 GMT

We study minimax methods for off-policy evaluation (OPE) using value functions and marginalized importance weights. Despite that they hold promises of overcoming the exponential variance in traditional importance sampling, several key problems remain: (1) They require function approximation and are generally biased. For the sake of trustworthy OPE, is there anyway to quantify the biases? In this paper we answer both questions positively. By slightly altering the derivation of previous methods (one from each style), we unify them into a single value interval that comes with a special type of double robustness: when either the value-function or the importance-weight class is well specified, the interval is valid and its length quantifies the misspecification of the other class.

minimax value interval, off-policy evaluation and policy optimization, quantify, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.65)

Add feedback

Reviews: DetNAS: Backbone Search for Object Detection

Neural Information Processing SystemsJan-22-2025, 07:58:19 GMT

This paper proposes a neural network search strategy for object detection task. The problem is interesting and useful for many real applications. This paper gives a three stage solution that can search pre-training based detectors effectively and efficiently. Experiments on both COCO and VOC are conducted to show the effectiveness of the proposed solution, and detection based models are superior than classification based models. The idea of searching network structure for detection with pre-training stage is novel and interesting.

backbone search, detnas, object detection, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.82)
Information Technology > Artificial Intelligence > Vision (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.42)

Add feedback

Boosting MCTS with Free Energy Minimization

Dao, Mawaba Pascal, Peter, Adrian M.

arXiv.org Artificial IntelligenceJan-22-2025

Active Inference, grounded in the Free Energy Principle, provides a powerful lens for understanding how agents balance exploration and goal-directed behavior in uncertain environments. Here, we propose a new planning framework, that integrates Monte Carlo Tree Search (MCTS) with active inference objectives to systematically reduce epistemic uncertainty while pursuing extrinsic rewards. Our key insight is that MCTS already renowned for its search efficiency can be naturally extended to incorporate free energy minimization by blending expected rewards with information gain. Concretely, the Cross-Entropy Method (CEM) is used to optimize action proposals at the root node, while tree expansions leverage reward modeling alongside intrinsic exploration bonuses. This synergy allows our planner to maintain coherent estimates of value and uncertainty throughout planning, without sacrificing computational tractability. Empirically, we benchmark our planner on a diverse set of continuous control tasks, where it demonstrates performance gains over both standalone CEM and MCTS with random rollouts.

artificial intelligence, machine learning, planning & scheduling, (12 more...)

arXiv.org Artificial Intelligence

2501.13083

Genre: Research Report (0.82)

Industry: Health & Medicine (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Reviews: Theoretical Analysis of Adversarial Learning: A Minimax Approach

Neural Information Processing SystemsJan-21-2025, 23:21:17 GMT

Originality: I find the approach original and interesting, I find that other works have been cited and the section of related work is written clearly and detailed, it gives a nice overview. I think only that it is important to highlight more clearly the differences between [40] and the current work. In particular, it is unclear what is the penalty parameter, and how their method of adversarial training relates to this work - do they optimize a different bound or what quantities do they optimize, and do these quantities show up in the proposed bound? Quality: the work seems complete, and sound for as far as I could check. I could not check all the proofs in detail but I read the work in great detail.

adversarial learning, minimax approach, theoretical analysis, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.41)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)

Add feedback

Review for NeurIPS paper: Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms

Neural Information Processing SystemsJan-21-2025, 20:42:30 GMT

Additional Feedback: Post-rebuttal comments: I've read the rebuttal and other reviews. The authors have addressed most of my concerns and hence I increase my score. I hope the authors would make the suggested edits in the revised version and explain the role of their main assumption. Can you explain why things fail if this assumption does not hold? Can you make use of a prior (in the case it is informative)?

greedy algorithm, multi-armed bandit, unreasonable effectiveness, (3 more...)

Neural Information Processing Systems

Genre: Personal > Interview (0.39)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.48)
Information Technology > Data Science > Data Mining > Big Data (0.40)

Add feedback

Review for NeurIPS paper: Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms

Neural Information Processing SystemsJan-21-2025, 20:42:22 GMT

All reviewers agree that the paper considers a problem of relevance (bandits with many arms) and shows interesting results about simple-to-implement learning algorithms based on the greedy principle. However, one lingering concern that arose during the discussions among the reviewers was whether/how the results obtained in the paper applied for the case when the number of arms is larger than the time horizon of the game (k T). It appears that the author response to this question has not been substantial. Though I can see that this will not be an issue -- the proof of Lemma 2 bounds regret with respect to the best possible reward of 1, the author(s) is/are requested to add a precise clarification of this regime in the updated version.

greedy algorithm, multi-armed bandit, unreasonable effectiveness, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.40)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)

Add feedback

Reviews: Learning to Perform Local Rewriting for Combinatorial Optimization

Neural Information Processing SystemsJan-21-2025, 19:37:56 GMT

After rebuttal: The discussion of the method applicability in the rebuttal is convinced for me. I upgrade my score to 7. This paper proposes a learning-based approach for combinatorial optimization problems. Starting from an initial complete solution of the problem, several local rewriting updates are applied to the solution iteratively. In each rewriting step, a local region and an updating rule are picked to update the solution and two networks are trained by reinforcement learning to pick local regions and updating rules.

combinatorial optimization, learning, perform local rewriting, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.63)
Information Technology > Artificial Intelligence > Machine Learning (0.60)

Add feedback

Reviews: Learning to Perform Local Rewriting for Combinatorial Optimization

Neural Information Processing SystemsJan-21-2025, 19:37:46 GMT

The reviewers liked the paper and were further convinced by the response. Please take their suggestions into account when preparing the final version of your paper.

combinatorial optimization, learning, perform local rewriting

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)

Add feedback

Reviews: Learning Local Search Heuristics for Boolean Satisfiability

Neural Information Processing SystemsJan-21-2025, 19:36:22 GMT

This work is original in its use of deep reinforcement learning and graph neural networks to learn novel search control heuristics for SAT solving. While the techniques used are not novel themselves, the application domain is. The authors do a good job of surveying related work in this area and situating their contributions in this landscape. The paper is well-written and I found it very easy to follow the details of the proposed approach and the authors' results. Technically, the work presented is solid, though I have a few comments/suggestions here.

boolean satisfiability, learning local search heuristic, solution time, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.58)

Add feedback

Reviews: Learning Local Search Heuristics for Boolean Satisfiability

Neural Information Processing SystemsJan-21-2025, 19:36:10 GMT

The reviewers were positive about this paper based upon their initial read. The authors response addressed their concerns, so they were even more comfortable with a positive outcome after the author response. I encourage the authors to incorporate their responses to the reviewer concerns into any final version of the paper.

author response, boolean satisfiability, learning local search heuristic

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.85)

Add feedback