AITopics

1810.1246

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.35)

Erdogdu, Murat A., Mackey, Lester, Shamir, Ohad

Global Non-convex Optimization with Discretized Diffusions

arXiv.org Machine LearningOct-29-2018

An Euler discretization of the Langevin diffusion is known to converge to the global minimizers of certain convex and non-convex optimization problems. We show that this property holds for any suitably smooth diffusion and that different diffusions are suitable for optimizing different classes of convex and non-convex functions. This allows us to design diffusions suitable for globally optimizing convex and non-convex functions not covered by the existing Langevin theory. Our non-asymptotic analysis delivers computable optimization and integration error bounds based on easily accessed properties of the objective and chosen diffusion. Central to our approach are new explicit Stein factor bounds on the solutions of Poisson equations. We complement these results with improved optimization guarantees for targets other than the standard Gibbs measure.

artificial intelligence, diffusion, machine learning, (19 more...)

1810.12361

Country: North America > Canada (0.45)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

arXiv.org Artificial IntelligenceOct-29-2018

Staff dimensioning in homecare services with uncertain demands

Rodriguez, C., Garaix, Thierry, Xie, X., Augusto, V.

The problem addressed in this paper is how to calculate the amount of personnel required to ensure the activity of a home health care (HHC) center on a tactical horizon. Design of quantitative approaches for this question is challenging. The number of caregivers has to be determined for each profession in order to balance the coverage of patients in a region and the workforce cost over several months. Unknown demand in care and spatial dimensions, combination of skills to cover a care and individual trips visiting patients make the underlaying optimization problem very hard. Few studies are dedicated to staff dimensioning for HHC compared to patient to nurses assignment/sequencing and centers location problems. We propose an original two-stage approach based on integer linear stochastic programming, that exploits historical medical data. The first stage calculates (near-)optimal levels of resources for possible demand scenarios , while the second stage computes the optimal number of caregiver for each profession to meet a target coverage indicator. For decision-makers, our algorithm gives the number of employees for each category required to satisfy the demand without any recourse (overtime, external resources) with fixed probability and confidence interval. The approach has been tested on various instances built from data of the French agency of hospitalization data (ATIH).

artificial intelligence, machine learning, optimization problem, (20 more...)

doi: 10.1080/00207543.2015.1081427

1811.06363

Country:

North America (0.67)
Europe > France (0.48)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Government > Regional Government > Europe Government > France Government (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.93)

Eriksson, David, Dong, Kun, Lee, Eric Hans, Bindel, David, Wilson, Andrew Gordon

Scaling Gaussian Process Regression with Derivatives

arXiv.org Artificial IntelligenceOct-29-2018

Gaussian processes (GPs) with derivatives are useful in many applications, including Bayesian optimization, implicit surface reconstruction, and terrain reconstruction. Fitting a GP to function values and derivatives at $n$ points in $d$ dimensions requires linear solves and log determinants with an ${n(d+1) \times n(d+1)}$ positive definite matrix -- leading to prohibitive $\mathcal{O}(n^3d^3)$ computations for standard direct methods. We propose iterative solvers using fast $\mathcal{O}(nd)$ matrix-vector multiplications (MVMs), together with pivoted Cholesky preconditioning that cuts the iterations to convergence by several orders of magnitude, allowing for fast kernel learning and prediction. Our approaches, together with dimensionality reduction, enables Bayesian optimization with derivatives to scale to high-dimensional problems and large evaluation budgets.

artificial intelligence, machine learning, modeling & simulation, (16 more...)

1810.12283

Country: North America (0.46)

Genre: Research Report (0.64)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

arXiv.org Artificial IntelligenceOct-29-2018

Incorporating Diversity into Influential Node Mining

Zhang, Yu, Xu, Frank F., Lyu, Tianshu, Ren, Xiang, Han, Jiawei

Diversity is a crucial criterion in many ranking and mining tasks. In this paper, we study how to incorporate node diversity into influence maximization (IM). We consider diversity as a reverse measure of the average similarity between selected nodes, which can be specified using node embedding or community detection results. Our goal is to identify a set of nodes which are simultaneously influential and diverse. Three most commonly used utilities in economics (i.e., Perfect Substitutes, Perfect Complements, and Cobb-Douglas) are proposed to jointly model influence spread and diversity as two factors. We formulate diversified IM as an optimization problem of these utilities, for which we present two approximation algorithms based on non-monotonic submodular maximization and traditional IM respectively. Experimental results show that our diversified IM framework outperforms other natural heuristics, such as embedding and diversified ranking, both in utility maximization and result diversification.

artificial intelligence, machine learning, optimization problem, (17 more...)

1810.05959

Country: North America > United States > California (0.28)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Data Science (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)

arXiv.org Artificial IntelligenceOct-28-2018

Differential Evolution with Nearest & Better Option for Function Optimization

Dong, Haozhen, Gao, Liang, Li, Xinyu, Zhong, Haoran, Zeng, Bing

Abstract--Differential evolution is the conventional algorithm with the fastest convergence speed, but it may be trapped local optimal solution easily, so many researchers devote themselves into improve DE. Whale swarm algorithm (WSA) is a new algorithm with niching strategy we proposed previously, it's featured with simple mutation strategy and powerful global search capability, but for functions with high dimensions, it converges slower than conventional algorithms. Based on this fact, we proposed a new DE algorithm, called DE with nearest & better option (NbDE). In order to evaluate the performance of NbDE, we compare NbDE with several meta-heuristic algorithms in nine classical benchmark functions with different dimensions. The result have shown that NbDE outperforms other algorithms in convergence speed and accuracy.

artificial intelligence, evolutionary algorithm, machine learning, (12 more...)

1812.07608

Country: Asia > China > Hubei Province (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.91)

Struminsky, Kirill, Lacoste-Julien, Simon, Osokin, Anton

Quantifying Learning Guarantees for Convex but Inconsistent Surrogates

arXiv.org Machine LearningOct-26-2018

We study consistency properties of machine learning methods based on minimizing convex surrogates. We extend the recent framework of Osokin et al. (2017) for the quantitative analysis of consistency properties to the case of inconsistent surrogates. Our key technical contribution consists in a new lower bound on the calibration function for the quadratic surrogate, which is non-trivial (not always zero) for inconsistent cases. The new bound allows to quantify the level of inconsistency of the setting and shows how learning with inconsistent surrogates can have guarantees on sample complexity and optimization difficulty. We apply our theory to two concrete cases: multi-class classification with the tree-structured loss and ranking with the mean average precision loss. The results show the approximation-computation trade-offs caused by inconsistent surrogates and their potential benefits.

artificial intelligence, machine learning, optimization problem, (20 more...)

1810.11544

Country:

Europe > Russia (0.28)
Asia (0.28)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

arXiv.org Machine LearningOct-26-2018

Empirical Evaluation of Contextual Policy Search with a Comparison-based Surrogate Model and Active Covariance Matrix Adaptation

Fabisch, Alexander

Contextual policy search (CPS) is a class of multi-task reinforcement learning algorithms that is particularly useful for robotic applications. A recent state-of-the-art method is Contextual Covariance Matrix Adaptation Evolution Strategies (C-CMA-ES). It is based on the standard black-box optimization algorithm CMA-ES. There are two useful extensions of CMA-ES that we will transfer to C-CMA-ES and evaluate empirically: ACM-ES, which uses a comparison-based surrogate model, and aCMA-ES, which uses an active update of the covariance matrix. We will show that improvements with these methods can be impressive in terms of sample-efficiency, although this is not relevant any more for the robotic domain.

evolutionary algorithm, machine learning, reinforcement learning, (16 more...)

1810.11491

Country: North America > Canada (0.28)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.67)

arXiv.org Machine LearningOct-26-2018

Benefits of over-parameterization with EM

Xu, Ji, Hsu, Daniel, Maleki, Arian

Expectation Maximization (EM) is among the most popular algorithms for maximum likelihood estimation, but it is generally only guaranteed to find its stationary points of the log-likelihood objective. The goal of this article is to present theoretical and empirical evidence that over-parameterization can help EM avoid spurious local optima in the log-likelihood. We consider the problem of estimating the mean vectors of a Gaussian mixture model in a scenario where the mixing weights are known. Our study shows that the global behavior of EM, when one uses an over-parameterized model in which the mixing weights are treated as unknown, is better than that when one uses the (correct) model with the mixing weights fixed to the known values. For symmetric Gaussians mixtures with two components, we prove that introducing the (statistically redundant) weight parameters enables EM to find the global maximizer of the log-likelihood starting from almost any initial mean parameters, whereas EM without this over-parameterization may very often fail. For other Gaussian mixtures, we provide empirical evidence that shows similar behavior. Our results corroborate the value of over-parameterization in solving non-convex optimization problems, previously observed in other domains.

artificial intelligence, converge, machine learning, (20 more...)

1810.11344

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.54)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

arXiv.org Machine LearningOct-25-2018

Efficient Load Sampling for Worst-Case Structural Analysis Under Force Location Uncertainty

Wang, Yining, Ulu, Erva, Singh, Aarti, Kara, Levent Burak

An important task in structural design is to quantify the structural performance of an object under the external forces it may experience during its use. The problem proves to be computationally very challenging as the external forces' contact locations and magnitudes may exhibit significant variations. We present an efficient analysis approach to determine the most critical force contact location in such problems with force location uncertainty. Given an input 3D model and regions on its boundary where arbitrary normal forces may make contact, our algorithm predicts the worst-case force configuration responsible for creating the highest stress within the object. Our approach uses a computationally tractable experimental design method to select number of sample force locations based on geometry only, without inspecting the stress response that requires computationally expensive finite-element analysis. Then, we construct a simple regression model on these samples and corresponding maximum stresses. Combined with a simple ranking based post-processing step, our method provides a practical solution to worst-case structural analysis problem. The results indicate that our approach achieves significant improvements over the existing work and brute force approaches. We demonstrate that further speed- up can be obtained when small amount of an error tolerance in maximum stress is allowed.

algorithm, artificial intelligence, machine learning, (15 more...)

1810.10977

Country:

North America > United States (0.28)
North America > Canada > Quebec (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.36)