AITopics | Optimization

Collaborating Authors

Optimization

News Overviews Instructional Materials AI-Alerts Classics

Submodular Optimization with Submodular Cover and Submodular Knapsack Constraints

Neural Information Processing SystemsFeb-14-2020, 18:26:32 GMT

We investigate two new optimization problems -- minimizing a submodular function subject to a submodular lower bound constraint (submodular cover) and maximizing a submodular function subject to a submodular upper bound constraint (submodular knapsack). We are motivated by a number of real-world applications in machine learning including sensor placement and data subset selection, which require maximizing a certain submodular function (like coverage or diversity) while simultaneously minimizing another (like cooperative cost). These problems are often posed as minimizing the difference between submodular functions [9, 23] which is in the worst case inapproximable. We show, however, that by phrasing these problems as constrained optimization, which is more natural for many applications, we achieve a number of bounded approximation guarantees. We also show that both these problems are closely related and, an approximation algorithm solving one can be used to obtain an approximation guarantee for the other.

cover and submodular knapsack constraint, submodular function, submodular optimization, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.64)
Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Automating Bayesian optimization with Bayesian optimization

Malkomes, Gustavo, Garnett, Roman

Neural Information Processing SystemsFeb-14-2020, 17:42:25 GMT

Bayesian optimization is a powerful tool for global optimization of expensive functions. One of its key components is the underlying probabilistic model used for the objective function f. In practice, however, it is often unclear how one should appropriately choose a model, especially when gathering data is expensive. In this work, we introduce a novel automated Bayesian optimization approach that dynamically selects promising models for explaining the observed data using Bayesian Optimization in the model space. Crucially, we account for the uncertainty in the choice of model; our method is capable of using multiple models to represent its current belief about f and subsequently using this information for decision making.

automating bayesian optimization, bayesian optimization, optimization

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Add feedback

Approximate Dynamic Programming Finally Performs Well in the Game of Tetris

Gabillon, Victor, Ghavamzadeh, Mohammad, Scherrer, Bruno

Neural Information Processing SystemsFeb-14-2020, 17:42:00 GMT

Tetris is a popular video game that has been widely used as a benchmark for various optimization techniques including approximate dynamic programming (ADP) algorithms. A close look at the literature of this game shows that while ADP algorithms, that have been (almost) entirely based on approximating the value function (value function based), have performed poorly in Tetris, the methods that search directly in the space of policies by learning the policy parameters using an optimization black box, such as the cross entropy (CE) method, have achieved the best reported results. This makes us conjecture that Tetris is a game in which good policies are easier to represent, and thus, learn than their corresponding value functions. So, in order to obtain a good performance with ADP, we should use ADP algorithms that search in a policy space, instead of the more traditional ones that search in a value function space. In this paper, we put our conjecture to test by applying such an ADP algorithm, called classification-based modified policy iteration (CBMPI), to the game of Tetris.

adp algorithm, tetris, value function, (4 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.64)

Add feedback

Adversarially Robust Optimization with Gaussian Processes

Bogunovic, Ilija, Scarlett, Jonathan, Jegelka, Stefanie, Cevher, Volkan

Neural Information Processing SystemsFeb-14-2020, 17:26:47 GMT

In this paper, we consider the problem of Gaussian process (GP) optimization with an added robustness requirement: The returned point may be perturbed by an adversary, and we require the function value to remain as high as possible even after this perturbation. This problem is motivated by settings in which the underlying functions during optimization and implementation stages are different, or when one is interested in finding an entire region of good inputs rather than only a single point. We show that standard GP optimization algorithms do not exhibit the desired robustness properties, and provide a novel confidence-bound based algorithm StableOpt for this purpose. We rigorously establish the required number of samples for StableOpt to find a near-optimal point, and we complement this guarantee with an algorithm-independent lower bound. We experimentally demonstrate several potential applications of interest using real-world data sets, and we show that StableOpt consistently succeeds in finding a stable maximizer where several baseline methods fail.

gaussian process, modeling & simulation, optimization problem, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Modeling & Simulation (0.65)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.64)

Add feedback

A primal-dual method for conic constrained distributed optimization problems

Aybat, Necdet Serhat, Hamedani, Erfan Yazdandoost

Neural Information Processing SystemsFeb-14-2020, 17:12:46 GMT

We consider cooperative multi-agent consensus optimization problems over an undirected network of agents, where only those agents connected by an edge can directly communicate. The objective is to minimize the sum of agent-specific composite convex functions over agent-specific private conic constraint sets; hence, the optimal consensus decision should lie in the intersection of these private sets. We provide convergence rates in sub-optimality, infeasibility and consensus violation; examine the effect of underlying network topology on the convergence rates of the proposed decentralized algorithms; and show how to extend these methods to handle time-varying communication networks. Papers published at the Neural Information Processing Systems Conference.

convergence rate, optimization problem, primal-dual method, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Bayesian Optimization with Gradients

Wu, Jian, Poloczek, Matthias, Wilson, Andrew G., Frazier, Peter

Neural Information Processing SystemsFeb-14-2020, 17:11:37 GMT

Bayesian optimization has shown success in global optimization of expensive-to-evaluate multimodal objective functions. However, unlike most optimization methods, Bayesian optimization typically does not use derivative information. In this paper we show how Bayesian optimization can exploit derivative information to find good solutions with fewer objective function evaluations. In particular, we develop a novel Bayesian optimization algorithm, the derivative-enabled knowledge-gradient (dKG), which is one-step Bayes-optimal, asymptotically consistent, and provides greater one-step value of information than in the derivative-free setting. We also compute the dKG acquisition function and its gradient using a novel fast discretization-free technique.

bayesian optimization, derivative information, gradient, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Dual Framework for Low-rank Tensor Completion

Nimishakavi, Madhav, Jawanpuria, Pratik Kumar, Mishra, Bamdev

Neural Information Processing SystemsFeb-14-2020, 16:57:06 GMT

One of the popular approaches for low-rank tensor completion is to use the latent trace norm regularization. However, most existing works in this direction learn a sparse combination of tensors. In this work, we fill this gap by proposing a variant of the latent trace norm that helps in learning a non-sparse combination of tensors. We develop a dual framework for solving the low-rank tensor completion problem. Overall, the optimal solution is shown to lie on a Cartesian product of Riemannian manifolds.

dual framework, low-rank tensor completion, optimal solution, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.53)

Add feedback

Learning Supervised PageRank with Gradient-Based and Gradient-Free Optimization Methods

Bogolubsky, Lev, Dvurechenskii, Pavel, Gasnikov, Alexander, Gusev, Gleb, Nesterov, Yurii, Raigorodskii, Andrei M., Tikhonov, Aleksey, Zhukovskii, Maksim

Neural Information Processing SystemsFeb-14-2020, 16:57:02 GMT

In this paper, we consider a non-convex loss-minimization problem of learning Supervised PageRank models, which can account for features of nodes and edges. We propose gradient-based and random gradient-free methods to solve this problem. Our algorithms are based on the concept of an inexact oracle and unlike the state-of-the-art gradient-based method we manage to provide theoretically the convergence rate guarantees for both of them. Finally, we compare the performance of the proposed optimization methods with the state of the art applied to a ranking task. Papers published at the Neural Information Processing Systems Conference.

gradient-based and gradient-free optimization method, learning supervised pagerank

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Add feedback

Maximum Margin Interval Trees

Drouin, Alexandre, Hocking, Toby, Laviolette, Francois

Neural Information Processing SystemsFeb-14-2020, 16:28:02 GMT

Learning a regression function using censored or interval-valued output data is an important problem in fields such as genomics and medicine. The goal is to learn a real-valued prediction function, and the training output labels indicate an interval of possible values. Whereas most existing algorithms for this task are linear models, in this paper we investigate learning nonlinear tree models. We propose to learn a tree by minimizing a margin-based discriminative objective function, and we provide a dynamic programming algorithm for computing the optimal solution in log-linear time. We show empirically that this algorithm achieves state-of-the-art speed and prediction accuracy in a benchmark of several data sets.

algorithm, maximum margin interval tree

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Memoized Online Variational Inference for Dirichlet Process Mixture Models

Hughes, Michael C., Sudderth, Erik

Neural Information Processing SystemsFeb-14-2020, 16:27:05 GMT

Variational inference algorithms provide the most effective framework for large-scale training of Bayesian nonparametric models. Stochastic online approaches are promising, but are sensitive to the chosen learning rate and often converge to poor local optima. We present a new algorithm, memoized online variational inference, which scales to very large (yet finite) datasets while avoiding the complexities of stochastic gradient. Our algorithm maintains finite-dimensional sufficient statistics from batches of the full dataset, requiring some additional memory but still scaling to millions of examples. Exploiting nested families of variational bounds for infinite nonparametric models, we develop principled birth and merge moves allowing non-local optimization.

dirichlet process mixture model, memoized online variational inference, nonparametric model, (3 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.72)

Add feedback