AITopics | Optimization

Collaborating Authors

Optimization

News Overviews Instructional Materials AI-Alerts Classics

Fast Moving Natural Evolution Strategy for High-Dimensional Problems

arXiv.org Machine LearningJan-27-2022

In this work, we propose a new variant of natural evolution strategies (NES) for high-dimensional black-box optimization problems. The proposed method, CR-FM-NES, extends a recently proposed state-of-the-art NES, Fast Moving Natural Evolution Strategy (FM-NES), in order to be applicable in high-dimensional problems. CR-FM-NES builds on an idea using a restricted representation of a covariance matrix instead of using a full covariance matrix, while inheriting an efficiency of FM-NES. The restricted representation of the covariance matrix enables CR-FM-NES to update parameters of a multivariate normal distribution in linear time and space complexity, which can be applied to high-dimensional problems. Our experimental results reveal that CR-FM-NES does not lose the efficiency of FM-NES, and on the contrary, CR-FM-NES has achieved significant speedup compared to FM-NES on some benchmark problems. Furthermore, our numerical experiments using 200, 600, and 1000-dimensional benchmark problems demonstrate that CR-FM-NES is effective over scalable baseline methods, VD-CMA and Sep-CMA.

covariance matrix, cr-fm-ne, evolution strategy, (11 more...)

arXiv.org Machine Learning

2201.11422

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

DiGamma: Domain-aware Genetic Algorithm for HW-Mapping Co-optimization for DNN Accelerators

Kao, Sheng-Chun, Pellauer, Michael, Parashar, Angshuman, Krishna, Tushar

arXiv.org Artificial IntelligenceJan-26-2022

The design of DNN accelerators includes two key parts: HW resource configuration and mapping strategy. Intensive research has been conducted to optimize each of them independently. Unfortunately, optimizing for both together is extremely challenging due to the extremely large cross-coupled search space. To address this, in this paper, we propose a HW-Mapping co-optimization framework, an efficient encoding of the immense design space constructed by HW and Mapping, and a domain-aware genetic algorithm, named DiGamma, with specialized operators for improving search efficiency. We evaluate DiGamma with seven popular DNNs models with different properties. Our evaluations show DiGamma can achieve (geomean) 3.0x and 10.0x speedup, comparing to the best-performing baseline optimization algorithms, in edge and cloud settings.

algorithm, digamma, mapping, (16 more...)

arXiv.org Artificial Intelligence

2201.1122

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

DNNFuser: Generative Pre-Trained Transformer as a Generalized Mapper for Layer Fusion in DNN Accelerators

Kao, Sheng-Chun, Huang, Xiaoyu, Krishna, Tushar

arXiv.org Artificial IntelligenceJan-26-2022

Dataflow/mapping decides the compute and energy efficiency of DNN accelerators. Many mappers have been proposed to tackle the intra-layer map-space. However, mappers for inter-layer map-space (aka layer-fusion map-space), have been rarely discussed. In this work, we propose a mapper, DNNFuser, specifically focusing on this layer-fusion map-space. While existing SOTA DNN mapping explorations rely on search-based mappers, this is the first work, to the best of our knowledge, to propose a one-shot inference-based mapper. We leverage a famous language model GPT as our DNN architecture to learn layer-fusion optimization as a sequence modeling problem. Further, the trained DNNFuser can generalize its knowledge and infer new solutions for unseen conditions. Within one inference pass, DNNFuser can infer solutions with compatible performance to the ones found by a highly optimized search-based mapper while being 66x-127x faster.

dnnfuser, mapper, transformer, (12 more...)

arXiv.org Artificial Intelligence

2201.11218

Country: North America > United States > Georgia > Fulton County > Atlanta (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.87)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.83)
(3 more...)

Add feedback

Minimax Demographic Group Fairness in Federated Learning

Papadaki, Afroditi, Martinez, Natalia, Bertran, Martin, Sapiro, Guillermo, Rodrigues, Miguel

arXiv.org Artificial IntelligenceJan-25-2022

Machine learning models are being increasingly adopted to make decisions in a range of domains, such as finance, insurance, medical diagnosis, recruitment, and many more [2]. Therefore, we are often confronted with the need - sometimes imposed by regulatory bodies - to ensure that such machine learning models do not lead to decisions that discriminate individuals from a certain demographic group. The development of machine learning models that are fair across different (demographic) groups has been well studied in traditional learning setups where there is a single entity responsible for learning a model based on a local dataset holding data from individuals of the various groups. However, there are settings where the data representing different demographic groups is spread across multiple entities rather than concentrated on a single entity/server. For example, consider a scenario where various hospitals wish to learn a diagnostic machine learning model that is fair (or performs reasonably well) across different demographic groups but each hospital may only contain training data from certain groups because - in view of its geo-location - it serves predominantly individuals of a given demographic [5]. This new setup along with the conventional centralized one are depicted in Figure 1.

artificial intelligence, fedminmax, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3531146.3533081

2201.08304

Country:

North America > United States > Alaska (0.04)
North America > United States > Virginia (0.04)
North America > United States > New York > New York County > New York City (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine (0.88)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

The First AI4TSP Competition: Learning to Solve Stochastic Routing Problems

Bliek, Laurens, da Costa, Paulo, Afshar, Reza Refaei, Zhang, Yingqian, Catshoek, Tom, Vos, Daniël, Verwer, Sicco, Schmitt-Ulms, Fynn, Hottung, André, Shah, Tapan, Sellmann, Meinolf, Tierney, Kevin, Perreault-Lafleur, Carl, Leboeuf, Caroline, Bobbio, Federico, Pepin, Justine, Silva, Warley Almeida, Gama, Ricardo, Fernandes, Hugo L., Zaefferer, Martin, López-Ibáñez, Manuel, Irurozki, Ekhine

arXiv.org Artificial IntelligenceJan-25-2022

The TSP is one of the classical combinatorial optimization problems, with many variants inspired by real-world applications. This first competition asked the participants to develop algorithms to solve a time-dependent orienteering problem with stochastic weights and time windows (TD-OPSWTW). It focused on two types of learning approaches: surrogate-based optimization and deep reinforcement learning. In this paper, we describe the problem, the setup of the competition, the winning methods, and give an overview of the results. The winning methods described in this work have advanced the state-of-the-art in using AI for stochastic routing problems. Overall, by organizing this competition we have introduced routing problems as an interesting problem setting for AI researchers. The simulator of the problem has been made open-source and can be used by other researchers as a benchmark for new AI methods.

competition, node, travel time, (17 more...)

arXiv.org Artificial Intelligence

2201.10453

Country:

North America > Canada > Quebec > Montreal (0.14)
Europe > Netherlands > South Holland > Delft (0.04)
Europe > Netherlands > North Brabant > Eindhoven (0.04)
(6 more...)

Genre:

Research Report (1.00)
Workflow (0.93)
Overview (0.74)

Industry: Transportation (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Safe AI -- How is this Possible?

Rueß, Harald, Burton, Simon

arXiv.org Artificial IntelligenceJan-25-2022

A new generation of increasingly autonomous and self-learning cyber-physical systems (CPS) is being developed for control applications in the real world. These systems are AI-based in that they leverage techniques from the field of Artificial intelligence (AI) to flexibly cope with imprecision, inconsistency, incompleteness, to have an inherent ability to learn from experience, and to adapt according to changing and even unforeseen situations. This extra flexibility of AI systems makes it harder to predict their behavior. Moreover, AI systems usually are safety-critical in that they may be causing real harm in (and to) the real world. Consequently, the central question regarding the development of such systems is how to handle or even overcome this basic dichotomy between unpredictable and safe behavior of AI systems. In other words, how can we best construct systems that exploit AI techniques, without incurring the frailties of "AI-like" behavior?

ai system, international conference, neural network, (15 more...)

arXiv.org Artificial Intelligence

2201.10436

Country:

North America > United States > Alaska (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > India (0.04)

Genre: Research Report (0.64)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)
Automobiles & Trucks (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(3 more...)

Add feedback

AI-Aided Integrated Terrestrial and Non-Terrestrial 6G Solutions for Sustainable Maritime Networking

Saafi, Salwa, Vikhrova, Olga, Fodor, Gábor, Hosek, Jiri, Andreev, Sergey

arXiv.org Artificial IntelligenceJan-25-2022

The maritime industry is experiencing a technological revolution that affects shipbuilding, operation of both seagoing and inland vessels, cargo management, and working practices in harbors. This ongoing transformation is driven by the ambition to make the ecosystem more sustainable and cost-efficient. Digitalization and automation help achieve these goals by transforming shipping and cruising into a much more cost- and energy-efficient, and decarbonized industry segment. The key enablers in these processes are always-available connectivity and content delivery services, which can not only aid shipping companies in improving their operational efficiency and reducing carbon emissions but also contribute to enhanced crew welfare and passenger experience. Due to recent advancements in integrating high-capacity and ultra-reliable terrestrial and non-terrestrial networking technologies, ubiquitous maritime connectivity is becoming a reality. To cope with the increased complexity of managing these integrated systems, this article advocates the use of artificial intelligence and machine learning-based approaches to meet the service requirements and energy efficiency targets in various maritime communications scenarios.

application, communication, opération, (16 more...)

arXiv.org Artificial Intelligence

2201.06947

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Finland > Pirkanmaa > Tampere (0.05)
Europe > Czechia > South Moravian Region > Brno (0.05)
(7 more...)

Genre: Research Report (0.82)

Industry:

Telecommunications (1.00)
Information Technology (1.00)
Energy (1.00)
Transportation > Marine (0.89)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
Information Technology > Architecture > Real Time Systems (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)

Add feedback

Convex Analysis of the Mean Field Langevin Dynamics

Nitanda, Atsushi, Wu, Denny, Suzuki, Taiji

arXiv.org Machine LearningJan-25-2022

As an example of the nonlinear Fokker-Planck equation, the mean field Langevin dynamics attracts attention due to its connection to (noisy) gradient descent on infinitely wide neural networks in the mean field regime, and hence the convergence property of the dynamics is of great theoretical interest. In this work, we give a simple and self-contained convergence rate analysis of the mean field Langevin dynamics with respect to the (regularized) objective function in both continuous and discrete time settings. The key ingredient of our proof is a proximal Gibbs distribution $p_q$ associated with the dynamics, which, in combination of techniques in [Vempala and Wibisono (2019)], allows us to develop a convergence theory parallel to classical results in convex optimization. Furthermore, we reveal that $p_q$ connects to the duality gap in the empirical risk minimization setting, which enables efficient empirical evaluation of the algorithm convergence.

convergence, inequality, neural network, (11 more...)

arXiv.org Machine Learning

2201.10469

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Asia > Japan > Kyūshū & Okinawa > Kyūshū (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.36)

Add feedback

Sharpness-Aware Minimization

#artificialintelligenceJan-24-2022, 14:10:31 GMT

This post deals with a recent optimizing method for training neural networks described in the paper Sharpness-Aware Minimization for Efficiently Improving Generalization by P. Foret et al. (December 2020). Honestly, the first time I read about the paper details, I really thought the procedure therein described (or something similar) had already been explored many years before by tons of people… I was even surprised to read that it worked in some contexts. Modern models train through optimization methods relying just on the training loss. These models can easily memorize the training data and are prone to overfitting. They have more parameters than needed and this large number of parameters provides no guarantee of proper generalization to the test set.

argument, population loss, sharpness-aware minimization, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.31)

Add feedback

Evolutionary Computation for Expensive Optimization: A Survey - Machine Intelligence Research

#artificialintelligenceJan-24-2022, 03:49:20 GMT

Expensive optimization problem (EOP) widely exists in various significant real-world applications. However, EOP requires expensive or even unaffordable costs for evaluating candidate solutions, which is expensive for the algorithm to find a satisfactory solution. Moreover, due to the fast-growing application demands in the economy and society, such as the emergence of the smart cities, the internet of things, and the big data era, solving EOP more efficiently has become increasingly essential in various fields, which poses great challenges on the problem-solving ability of optimization approach for EOP. Among various optimization approaches, evolutionary computation (EC) is a promising global optimization tool widely used for solving EOP efficiently in the past decades. Given the fruitful advancements of EC for EOP, it is essential to review these advancements in order to synthesize and give previous research experiences and references to aid the development of relevant research fields and real-world applications.

evolutionary computation, expensive optimization, machine intelligence research, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback