Goto

Collaborating Authors

 Search


Generalizing Bayesian Optimization with Decision-theoretic Entropies

arXiv.org Artificial Intelligence

Bayesian optimization (BO) is a popular method for efficiently inferring optima of an expensive black-box function via a sequence of queries. Existing information-theoretic BO procedures aim to make queries that most reduce the uncertainty about optima, where the uncertainty is captured by Shannon entropy. However, an optimal measure of uncertainty would, ideally, factor in how we intend to use the inferred quantity in some downstream procedure. In this paper, we instead consider a generalization of Shannon entropy from work in statistical decision theory (DeGroot 1962, Rao 1984), which contains a broad class of uncertainty measures parameterized by a problem-specific loss function corresponding to a downstream task. We first show that special cases of this entropy lead to popular acquisition functions used in BO procedures such as knowledge gradient, expected improvement, and entropy search. We then show how alternative choices for the loss yield a flexible family of acquisition functions that can be customized for use in novel optimization settings. Additionally, we develop gradient-based methods to efficiently optimize our proposed family of acquisition functions, and demonstrate strong empirical performance on a diverse set of sequential decision making tasks, including variants of top-$k$ optimization, multi-level set estimation, and sequence search.


HPC Storage Service Autotuning Using Variational-Autoencoder-Guided Asynchronous Bayesian Optimization

arXiv.org Artificial Intelligence

They range from Empirical performance tuning, also known as autotuning, is multiuser, high-speed storage systems such as burst buffers [2], a hot topic in software optimization nowadays, and a promising [3], [4], to transient, application-specific services providing approach for HPC storage service tuning. In this approach, processing capabilities such as in situ analysis [5], [6], [7]. the user exposes the tunable parameters and defines the range These systems aim to improve I/O and storage performance of values that each parameter can take; a search method by moving away from file-based interfaces and from the is then used to explore the parameter space by executing POSIX semantics, instead providing specific interfaces and optimizations different parameter configurations on the target platform. The that can be tailored to individual applications. An challenge for HPC storage services autotuning stems from example of such a distributed storage service is HEPnOS [8], the complexity of the workflow and the search space. First, an in-memory object store for high-energy physics (HEP) several tunable parameters can be interdependent, requiring an applications developed by Argonne National Laboratory and execution of the complete workflow on the target platform for FermiLab.


On the Generalization of Neural Combinatorial Optimization Heuristics

arXiv.org Artificial Intelligence

Neural Combinatorial Optimization approaches have recently leveraged the expressiveness and flexibility of deep neural networks to learn efficient heuristics for hard Combinatorial Optimization (CO) problems. However, most of the current methods lack generalization: for a given CO problem, heuristics which are trained on instances with certain characteristics underperform when tested on instances with different characteristics. While some previous works have focused on varying the training instances properties, we postulate that a one-size-fit-all model is out of reach. Instead, we formalize solving a CO problem over a given instance distribution as a separate learning task and investigate meta-learning techniques to learn a model on a variety of tasks, in order to optimize its capacity to adapt to new tasks. Through extensive experiments, on two CO problems, using both synthetic and realistic instances, we show that our proposed meta-learning approach significantly improves the generalization of two state-of-the-art models.


Unsupervised Search Algorithm Configuration using Query Performance Prediction

arXiv.org Artificial Intelligence

Search engine configuration can be quite difficult for inexpert developers. Instead, an auto-configuration approach can be used to speed up development time. Yet, such an automatic process usually requires relevance labels to train a supervised model. In this work, we suggest a simple solution based on query performance prediction that requires no relevance labels but only a sample of queries in a given domain. Using two example usecases we demonstrate the merits of our solution.


Persistent Homology Guided Monte-Carlo Tree Search for Effective Non-Prehensile Manipulation

arXiv.org Artificial Intelligence

Performing object retrieval tasks in messy real-world workspaces involves the challenges of \emph{uncertainty} and \emph{clutter}. One option is to solve retrieval problems via a sequence of prehensile pick-n-place operations, which can be computationally expensive to compute in highly-cluttered scenarios and also inefficient to execute. The proposed framework selects the option of performing non-prehensile actions, such as pushing, to clean a cluttered workspace to allow a robotic arm to retrieve a target object. Non-prehensile actions, allow to interact simultaneously with multiple objects, which can speed up execution. At the same time, they can significantly increase uncertainty as it is not easy to accurately estimate the outcome of a pushing operation in clutter. The proposed framework integrates topological tools and Monte-Carlo tree search to achieve effective and robust pushing for object retrieval problems. In particular, it proposes using persistent homology to automatically identify manageable clustering of blocking objects in the workspace without the need for manually adjusting hyper-parameters. Furthermore, MCTS uses this information to explore feasible actions to push groups of objects together, aiming to minimize the number of pushing actions needed to clear the path to the target object. Real-world experiments using a Baxter robot, which involves some noise in actuation, show that the proposed framework achieves a higher success rate in solving retrieval tasks in dense clutter compared to state-of-the-art alternatives. Moreover, it produces high-quality solutions with a small number of pushing actions improving the overall execution time. More critically, it is robust enough that it allows to plan the sequence of actions offline and then execute them reliably online with Baxter.


Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-Cut

arXiv.org Artificial Intelligence

In Ref. [1], Schuetz et al provide a scheme to employ Among a variety of QUBO problems Ref. [1] consider The cut results for the GNN (for both, d = 3 and 5) are presented in Figure 1 of Ref. [1], whose flip (x After only fit to the GNN data obtained from averaging over 0.4n such flips, typically no further improvements were randomly generated instances of the problem for a progression possible and GD converged; very scalable and fast (done of different problem sizes n. Like in Ref. [1], I have also Figure 1(a), the results all look rather good, although it is included what they describe as a rigorous upper bound, already noticeable that results for GD are barely distinguishable cut While the GNN results To discern further details, it is essential to present appear impressively close to that upper bound, however, the data in a form that, at least, eliminates some of including two other sets of data puts these results in a its trivial aspects. The second set is achieved by a simple gradient for better comparison with Refs. Also, energy results (blue line) are systematical far (> 15% at any n) provides a fair reference point to assess relative error because from optimal (1-RSB, green line) and hardly provide any a purely random assignment of variables results in improvement over pure gradient descent (GD, maroon an energy of zero, the ultimate null model. It appears that the GNN learns what is indeed point is lacking for the errors quoted in Tab. 1 of the most typical about the energy landscape: the vast Ref. [1], for example.) Since we care about the scalability of the algorithm in In fact, extending GD by a subsequent 5n spin flips, say, the asymptotic limit for large problem sizes n, each flip adjusting one among the least-stable spins (even which in the form of Figure 1(a) is out of view, it expedient if not always unstable), allows this greedy local search to to visualize the data plotted for an inverse of the explore several local minima, still at linear cost.


Incentive Mechanism and Path Planning for UAV Hitching over Traffic Networks

arXiv.org Artificial Intelligence

Package delivery via the UAVs is a promising transport mode to provide efficient and green logistic services, especially in urban areas or complicated topography. However, the energy storage limit of the UAV makes it difficult to perform long-distance delivery tasks. In this paper, we propose a novel multimodal logistics framework, in which the UAVs can call on ground vehicles to provide hitch services to save their own energy and extend their delivery distance. This multimodal logistics framework is formulated as a two-stage model to jointly consider the incentive mechanism design for ground vehicles and path planning for UAVs. In Stage I, to deal with the motivations for ground vehicles to assist UAV delivery, a dynamic pricing scheme is proposed to best balance the vehicle response time and payments to ground vehicles. It shows that a higher price should be decided if the vehicle response time is long to encourage more vehicles to offer a ride. In Stage II, the task allocation and path planning of the UAVs over traffic network is studied based on the vehicle response time obtained in Stage I. To address pathfinding with restrictions and the performance degradation of the pathfinding algorithm due to the rising number of conflicts in multi-agent pathfinding, we propose the suboptimal conflict avoidance-based path search (CABPS) algorithm, which has polynomial time complexity. Finally, we validate our results via simulations. It is shown that our approach is able to increase the success rate of UAV package delivery. Moreover, we estimate the delivery time of the UAV in a pessimistic case, it is still twice as fast as the delivery time of the ground vehicle only.


NAS-based Recursive Stage Partial Network (RSPNet) for Light-Weight Semantic Segmentation

arXiv.org Artificial Intelligence

Current NAS-based semantic segmentation methods focus on accuracy improvements rather than light-weight design. In this paper, we proposed a two-stage framework to design our NAS-based RSPNet model for light-weight semantic segmentation. The first architecture search determines the inner cell structure, and the second architecture search considers exponentially growing paths to finalize the outer structure of the network. It was shown in the literature that the fusion of high- and low-resolution feature maps produces stronger representations. To find the expected macro structure without manual design, we adopt a new path-attention mechanism to efficiently search for suitable paths to fuse useful information for better segmentation. Our search for repeatable micro-structures from cells leads to a superior network architecture in semantic segmentation. In addition, we propose an RSP (recursive Stage Partial) architecture to search a light-weight design for NAS-based semantic segmentation. The proposed architecture is very efficient, simple, and effective that both the macro- and micro- structure searches can be completed in five days of computation on two V100 GPUs. The light-weight NAS architecture with only 1/4 parameter size of SoTA architectures can achieve SoTA performance on semantic segmentation on the Cityscapes dataset without using any backbones.


Computer Vision - Richard Szeliski

#artificialintelligence

As humans, we perceive the three-dimensional structure of the world around us with apparent ease. Think of how vivid the three-dimensional percept is when you look at a vase of flowers sitting on the table next to you. You can tell the shape and translucency of each petal through the subtle patterns of light and shading that play across its surface and effortlessly segment each flower from the background of the scene (Figure 1.1). Looking at a framed group por- trait, you can easily count (and name) all of the people in the picture and even guess at their emotions from their facial appearance. Perceptual psychologists have spent decades trying to understand how the visual system works and, even though they can devise optical illusions1 to tease apart some of its principles (Figure 1.3), a complete solution to this puzzle remains elusive (Marr 1982; Palmer 1999; Livingstone 2008).


Nested Search versus Limited Discrepancy Search

arXiv.org Artificial Intelligence

Limited Discrepancy Search (LDS) is a popular algorithm to search a state space with a heuristic to order the possible actions. Nested Search (NS) is another algorithm to search a state space with the same heuristic. NS spends more time on the move associated to the best heuristic playout while LDS spends more time on the best heuristic move. They both use similar times for the same level of search. We advocate in this paper that it is often better to follow the best heuristic playout as in NS than to follow the heuristic as in LDS.