Evolutionary Systems
A survey on policy search algorithms for learning robot controllers in a handful of trials
Chatzilygeroudis, Konstantinos, Vassiliades, Vassilis, Stulp, Freek, Calinon, Sylvain, Mouret, Jean-Baptiste
Most policy search algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning". We show that a first strategy is to leverage prior knowledge on the policy structure (e.g., dynamic movement primitives), on the policy parameters (e.g., demonstrations), or on the dynamics (e.g., simulators). A second strategy is to create data-driven surrogate models of the expected reward (e.g., Bayesian optimization) or the dynamical model (e.g., model-based policy search), so that the policy optimizer queries the model instead of the real system. Overall, all successful micro-data algorithms combine these two strategies by varying the kind of model and prior knowledge. The current scientific challenges essentially revolve around scaling up to complex robots (e.g., humanoids), designing generic priors, and optimizing the computing time.
Why don't the modules dominate - Investigating the Structure of a Well-Known Modularity-Inducing Problem Domain
Qin, Zhenyue, McKay, Robert, Gedeon, Tom
Wagner's modularity inducing problem domain is a key contribution to the study of the evolution of modularity, including both evolutionary theory and evolutionary computation. We study its behavior under classical genetic algorithms. Unlike what we seem to observe in nature, the emergence of modularity is highly conditional and dependent, for example, on the eagerness of search. In nature, modular solutions generally dominate populations, whereas in this domain, modularity, when it emerges, is a relatively rare variant. Emergence of modularity depends heavily on random fluctuations in the fitness function; with a randomly varied but unchanging fitness function, modularity evolved far more rarely. Interestingly, high-fitness non-modular solutions could frequently be converted into even-higher-fitness modular solutions by manually removing all inter-module edges. Despite careful exploration, we do not yet have a full explanation of why the genetic algorithm was unable to find these better solutions.
Predicting A Better Future With Swarm Intelligence Big Cloud Recruitment
Have you put a bet on the FIFA World Cup? If yes, the chances are you've made a pretty educated guess, right? You know which team has the strongest players or most favourable odds. Or maybe you've put some cash on your country's team, (which normally I'd avoid England, but given their recent performance, I could be wrong to!) Either way, you might be best casting your bets in line with San Francisco based Unanimous AI. They use a technology called Swarm AI โ algorithms modelled on swarms in nature that amplifies human intelligence.
An Empirical Approach For Probing the Definiteness of Kernels
Zaefferer, Martin, Bartz-Beielstein, Thomas, Rudolph, Gรผnter
Models like support vector machines or Gaussian process regression often require positive semi-definite kernels. These kernels may be based on distance functions. While definiteness is proven for common distances and kernels, a proof for a new kernel may require too much time and effort for users who simply aim at practical usage. Furthermore, designing definite distances or kernels may be equally intricate. Finally, models can be enabled to use indefinite kernels. This may deteriorate the accuracy or computational cost of the model. Hence, an efficient method to determine definiteness is required. We propose an empirical approach. We show that sampling as well as optimization with an evolutionary algorithm may be employed to determine definiteness. We provide a proof-of-concept with 16 different distance measures for permutations. Our approach allows to disprove definiteness if a respective counter-example is found. It can also provide an estimate of how likely it is to obtain indefinite kernel matrices. This provides a simple, efficient tool to decide whether additional effort should be spent on designing/selecting a more suitable kernel or algorithm.
Using AI to Predict the Growth of Cities - DZone AI
Cities are some of the clearest and well-used examples of a complex system, and whilst we are certainly better than we have been at managing their growth, they are, to a large extent, unmanageable. A recent study by a team of Spanish researchers at the Universidade da Coruna highlights how AI can be used to better understand how cities grow and evolve, at least in a vertical sense. The researchers use an evolutionary algorithm that's trained on historical and economic data of an urban area to predict how the skyline could look in a few years time. The method was successfully deployed in the Minato Ward, in Tokyo. The team believes that cities grow in a similar way to self-organized biological systems.
Whale swarm algorithm with the mechanism of identifying and escaping from extreme point for multimodal function optimization
Zeng, Bing, Li, Xinyu, Gao, Liang, Zhang, Yuyan, Dong, Haozhen
Noname manuscript No. (will be inserted by the editor) Abstract Most real-world optimization problems often come with multiple global optima or local optima. Therefore, increasing niching metaheuristic algorithms, which devote to finding multiple optima in a single run, are developed to solve these multimodal optimization problems. However, there are two difficulties urgently to be solved for most existing niching metaheuristic algorithms: how to set the optimal values of niching parameters for different optimization problems, and how to jump out of the local optima efficiently. Based on Whale Swarm Algorithm (WSA) we proposed previously, this paper presents a new multimodal optimizer named WSA with Iterative Counter (WSA-IC) to address these two difficulties. In the one hand, WSA-IC improves the iteration rule of the original WSA for multimodal optimization, which removes the need of specifying different values of attenuation coefficient for different problems to form multiple subpopulations, without introducing any niching parameter. In the other hand, WSA-IC enables the identification of extreme point during iterations relying on two new parameters (i.e., stability threshold T Moreover, the convergence of WSA-IC is proved. Finally, the proposed WSA-IC is compared with several niching metaheuristic algorithms on CEC2015 niching benchmark test functions and five additional classical multimodal functions with high dimensions. The experimental results demonstrate that WSA-IC statistically outperforms other niching metaheuristic algorithms on most test functions. Keywords Whale swarm algorithm ยท multimodal optimization ยท metaheuristic algorithm ยท niching ยท extreme point 1 Introduction Most of the real-world optimization problems are multimodal [1-6], i.e., their objective functions often contain multiple global optima or local optima. In such a scenario, using metaheuristic algorithms, no matter evolutionary algorithms (EAs) or swarm based algorithms, to solve these problems has become a hot research topic, as they are easy to implement and can converge to as good as possible solutions.
Top 20 Best Free Books To Get Jump Started with Artificial Intelligence
This is one of only a handful couple of writings that consolidates three fundamental postulations in the investigation of rationale programming: the logic that gives logic programs their extraordinary character: the act of programming viably utilizing the logic; and the productive usage of logic software on PCs.
Travel Time Optimization With Machine Learning And Genetic Algorithm
What is the relationship between machine learning and optimization? On the other hand, what happens when machine learning is used to solve optimization problems? Consider this: a UPS driver with 25 packages has 15 trillion possible routes to choose from. And if each driver drives just one more mile each day than necessary, the company would be losing $30 million a year. While UPS would have all the data for their trucks and routes, there is no way they can run 15 trillion computations per each driver with 25 packages.
How Roboticists Are Copying Nature to Make Fantastical Machines
If nature knows what it's doing, it sure does a good job hiding it. Like, why would evolution produce an elephant with a shovel for a face? For very good reasons, as it turns out. Natural selection is an astoundingly creative phenomenon, molding species to fit their environments, even if that means turning their faces into shovels. It's also created a galaxy of ways for animals to move about, from walking to crawling to flying.
Guided evolutionary strategies: escaping the curse of dimensionality in random search
Maheswaranathan, Niru, Metz, Luke, Tucker, George, Sohl-Dickstein, Jascha
Many applications in machine learning require optimizing a function whose true gradient is unknown, but where surrogate gradient information (directions that may be correlated with, but not necessarily identical to, the true gradient) is available instead. This arises when an approximate gradient is easier to compute than the full gradient (e.g. in meta-learning or unrolled optimization), or when a true gradient is intractable and is replaced with a surrogate (e.g. in certain reinforcement learning applications, or when using synthetic gradients). We propose Guided Evolutionary Strategies, a method for optimally using surrogate gradient directions along with random search. We define a search distribution for evolutionary strategies that is elongated along a guiding subspace spanned by the surrogate gradients. This allows us to estimate a descent direction which can then be passed to a first-order optimizer. We analytically and numerically characterize the tradeoffs that result from tuning how strongly the search distribution is stretched along the guiding subspace, and we use this to derive a setting of the hyperparameters that works well across problems. Finally, we apply our method to example problems including truncated unrolled optimization and a synthetic gradient problem, demonstrating improvement over both standard evolutionary strategies and first-order methods that directly follow the surrogate gradient.