AITopics | Optimization

Collaborating Authors

Optimization

News Overviews Instructional Materials AI-Alerts Classics

Optimal DR-Submodular Maximization and Applications to Provable Mean Field Inference

Bian, An, Buhmann, Joachim M., Krause, Andreas

arXiv.org Machine LearningMay-18-2018

Mean field inference in probabilistic models is generally a highly nonconvex problem. Existing optimization methods, e.g., coordinate ascent algorithms, can only generate local optima. In this work we propose provable mean filed methods for probabilistic log-submodular models and its posterior agreement (PA) with strong approximation guarantees. The main algorithmic technique is a new Double Greedy scheme, termed DR-DoubleGreedy, for continuous DR-submodular maximization with box-constraints. It is a one-pass algorithm with linear time complexity, reaching the optimal 1/2 approximation ratio, which may be of independent interest. We validate the superior performance of our algorithms against baseline algorithms on both synthetic and real-world datasets.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

1805.07482

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Computing Kantorovich-Wasserstein Distances on $d$-dimensional histograms using $(d+1)$-partite graphs

Auricchio, Gennaro, Bassetti, Federico, Gualandi, Stefano, Veneroni, Marco

arXiv.org Machine LearningMay-18-2018

This paper presents a novel method to compute the exact Kantorovich-Wasserstein distance between a pair of $d$-dimensional histograms having $n$ bins each. We prove that this problem is equivalent to an uncapacitated minimum cost flow problem on a $(d+1)$-partite graph with $(d+1)n$ nodes and $dn^{\frac{d+1}{d}}$ arcs, whenever the cost is separable along the principal $d$-dimensional directions. We show numerically the benefits of our approach by computing the Kantorovich-Wasserstein distance of order 2 among two sets of instances: gray scale images and $d$-dimensional biomedical histograms. On these types of instances, our approach is competitive with state-of-the-art optimal transport algorithms.

artificial intelligence, kantorovich-wasserstein distance, machine learning, (17 more...)

arXiv.org Machine Learning

1805.07416

Genre: Research Report (0.84)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)

Add feedback

Optimizing for Generalization in Machine Learning with Cross-Validation Gradients

Barratt, Shane, Sharma, Rishi

arXiv.org Machine LearningMay-18-2018

Cross-validation is the workhorse of modern applied statistics and machine learning, as it provides a principled framework for selecting the model that maximizes generalization performance. In this paper, we show that the cross-validation risk is differentiable with respect to the hyperparameters and training data for many common machine learning algorithms, including logistic regression, elastic-net regression, and support vector machines. Leveraging this property of differentiability, we propose a cross-validation gradient method (CVGM) for hyperparameter optimization. Our method enables efficient optimization in high-dimensional hyperparameter spaces of the cross-validation risk, the best surrogate of the true generalization ability of our learning algorithm.

algorithm, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

1805.07072

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Cross Validation (1.00)

Add feedback

Preference Elicitation and Robust Optimization with Multi-Attribute Quasi-Concave Choice Functions

Haskell, William B., Huang, Wenjie, Xu, Huifu

arXiv.org Artificial IntelligenceMay-17-2018

Decision maker's preferences are often captured by some choice functions which are used to rank prospects. In this paper, we consider ambiguity in choice functions over a multi-attribute prospect space. Our main result is a robust preference model where the optimal decision is based on the worst-case choice function from an ambiguity set constructed through preference elicitation with pairwise comparisons of prospects. Differing from existing works in the area, our focus is on quasi-concave choice functions rather than concave functions and this enables us to cover a wide range of utility/risk preference problems including multi-attribute expected utility and $S$-shaped aspirational risk preferences. The robust choice function is increasing and quasi-concave but not necessarily translation invariant, a key property of monetary risk measures. We propose two approaches based respectively on the support functions and level functions of quasi-concave functions to develop tractable formulations of the maximin preference robust optimization model. The former gives rise to a mixed integer linear programming problem whereas the latter is equivalent to solving a sequence of convex risk minimization problems. To assess the effectiveness of the proposed robust preference optimization model and numerical schemes, we apply them to a security budget allocation problem and report some preliminary results from experiments.

choice function, law enforcement, optimization problem, (20 more...)

arXiv.org Artificial Intelligence

1805.06632

Country: North America > United States > California > Los Angeles County (0.14)

Genre: Research Report (0.82)

Industry:

Law Enforcement & Public Safety (0.93)
Energy > Oil & Gas (0.67)
Government > Regional Government > North America Government > United States Government (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

Improved Multi-Objective Binary Fish School for Feature Selection

Macedo, Mariana (University of Pernambuco, Recife) | Bastos-Filho, Carmelo (University of Pernambuco, Recife) | Menezes, Ronaldo (Florida Institute of Technology)

AAAI ConferencesMay-17-2018

The Multi-Objective Binary Fish School Search (MOBFSS) algorithm was proposed to solve optimization problems with two or three conflicting objectives and operating on discrete binary variables. The original proposal revealed good accuracy but it also exhibited a high computational cost. Here, we present strategies to obtain an improved version of MOBFSS that reaches lower Pareto fronts for minimization problems at a better computational cost. We also deploy local search procedures as proposed in BMOPSO-CDRLS to find solutions closer to the optimal solution. The achieved results outperform the state-of-art algorithms BMOPSO-CDR and BMOPSO-CDRLS in feature selection problems for hypervolume optimization. Hence, this paper contributes to the literature in Swarm Intelligence by introducing several algorithms that can be applied to improve feature selection in the context of classification programs.

feature selection, improved multi-objective binary fish school

AAAI Conferences

The Thirty-First International Flairs Conference

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.53)

Add feedback

Faster Rates for Convex-Concave Games

Abernethy, Jacob, Lai, Kevin A., Levy, Kfir Y., Wang, Jun-Kun

arXiv.org Machine LearningMay-17-2018

We consider the use of no-regret algorithms to compute equilibria for particular classes of convex-concave games. While standard regret bounds would lead to convergence rates on the order of $O(T^{-1/2})$, recent work \citep{RS13,SALS15} has established $O(1/T)$ rates by taking advantage of a particular class of optimistic prediction algorithms. In this work we go further, showing that for a particular class of games one achieves a $O(1/T^2)$ rate, and we show how this applies to the Frank-Wolfe method and recovers a similar bound \citep{D15}. We also show that such no-regret techniques can even achieve a linear rate, $O(\exp(-T))$, for equilibrium computation under additional curvature assumptions.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1805.06792

Country: Europe (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Independent Component Analysis via Energy-based and Kernel-based Mutual Dependence Measures

Jin, Ze, Matteson, David S.

arXiv.org Machine LearningMay-17-2018

We apply both distance-based (Jin and Matteson, 2017) and kernel-based (Pfister et al., 2016) mutual dependence measures to independent component analysis (ICA), and generalize dCovICA (Matteson and Tsay, 2017) to MDMICA, minimizing empirical dependence measures as an objective function in both deflation and parallel manners. Solving this minimization problem, we introduce Latin hypercube sampling (LHS) (McKay et al., 2000), and a global optimization method, Bayesian optimization (BO) (Mockus, 1994) to improve the initialization of the Newton-type local optimization method. The performance of MDMICA is evaluated in various simulation studies and an image data example. When the ICA model is correct, MDMICA achieves competitive results compared to existing approaches. When the ICA model is misspecified, the estimated independent components are less mutually dependent than the observed components using MDMICA, while they are prone to be even more mutually dependent than the observed components using other approaches.

artificial intelligence, mdmica, optimization problem, (17 more...)

arXiv.org Machine Learning

1805.06639

Country: Europe > Austria (0.28)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

Accelerating Nonnegative Matrix Factorization Algorithms using Extrapolation

Ang, Andersen Man Shun, Gillis, Nicolas

arXiv.org Machine LearningMay-17-2018

In this paper, we propose a general framework to accelerate significantly the algorithms for nonnegative matrix factorization (NMF). This framework is inspired from the extrapolation scheme used to accelerate gradient methods in convex optimization and from the method of parallel tangents. However, the use of extrapolation in the context of the two-block coordinate descent algorithms tackling the non-convex NMF problems is novel. We illustrate the performance of this approach on two state-of-the-art NMF algorithms, namely, accelerated hierarchical alternating least squares (A-HALS) and alternating nonnegative least squares (ANLS), using synthetic, image and document data sets.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1805.06604

Country: North America (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings

Artetxe, Mikel, Labaka, Gorka, Agirre, Eneko

arXiv.org Artificial IntelligenceMay-17-2018

Recent work has managed to learn cross-lingual word embeddings without parallel data by mapping monolingual embeddings to a shared space through adversarial training. However, their evaluation has focused on favorable conditions, using comparable corpora or closely-related languages, and we show that they often fail in more realistic scenarios. This work proposes an alternative approach based on a fully unsupervised initialization that explicitly exploits the structural similarity of the embeddings, and a robust self-learning algorithm that iteratively improves this solution. Our method succeeds in all tested scenarios and obtains the best published results in standard datasets, even surpassing previous supervised systems. Our implementation is released as an open source project at https://github.com/artetxem/vecmap

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1805.06297

Country:

Europe (1.00)
North America > United States > California (0.46)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)

Add feedback

Analyzing high-dimensional time-series data using kernel transfer operator eigenfunctions

Klus, Stefan, Peitz, Sebastian, Schuster, Ingmar

arXiv.org Machine LearningMay-16-2018

Kernel transfer operators, which can be regarded as approximations of transfer operators such as the Perron-Frobenius or Koopman operator in reproducing kernel Hilbert spaces, are defined in terms of covariance and cross-covariance operators and have been shown to be closely related to the conditional mean embedding framework developed by the machine learning community. The goal of this paper is to show how the dominant eigenfunctions of these operators in combination with gradient-based optimization techniques can be used to detect long-lived coherent patterns in high-dimensional time-series data. The results will be illustrated using video data and a fluid flow example.

artificial intelligence, eigenfunction, upstream oil & gas, (16 more...)

arXiv.org Machine Learning

1805.10118

Country:

North America > United States (0.29)
Europe > Germany (0.14)
Africa (0.14)

Genre: Research Report (0.40)

Industry: Energy > Oil & Gas > Upstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback