Agents
Multi-agent Soft Actor-Critic Based Hybrid Motion Planner for Mobile Robots
He, Zichen, Dong, Lu, Song, Chunwei, Sun, Changyin
In this paper, a novel hybrid multi-robot motion planner that can be applied under non-communication and local observable conditions is presented. The planner is model-free and can realize the end-to-end mapping of multi-robot state and observation information to final smooth and continuous trajectories. The planner is a front-end and back-end separated architecture. The design of the front-end collaborative waypoints searching module is based on the multi-agent soft actor-critic algorithm under the centralized training with decentralized execution diagram. The design of the back-end trajectory optimization module is based on the minimal snap method with safety zone constraints. This module can output the final dynamic-feasible and executable trajectories. Finally, multi-group experimental results verify the effectiveness of the proposed motion planner.
PantheonRL: A MARL Library for Dynamic Training Interactions
Sarkar, Bidipta, Talati, Aditi, Shih, Andy, Sadigh, Dorsa
We present PantheonRL, a multiagent reinforcement learning software package for dynamic training interactions such as round-robin, adaptive, and ad-hoc training. Our package is designed around flexible agent objects that can be easily configured to support different training interactions, and handles fully general multiagent environments with mixed rewards and n agents. Built on top of StableBaselines3, our package works directly with existing powerful deep RL algorithms. Finally, PantheonRL comes with an intuitive yet functional web user interface for configuring experiments and launching multiple asynchronous jobs. Our package can be found at https://github.com/Stanford-ILIAD/PantheonRL.
Sampling-Based Robust Control of Autonomous Systems with Non-Gaussian Noise
Badings, Thom S., Abate, Alessandro, Jansen, Nils, Parker, David, Poonawala, Hasan A., Stoelinga, Marielle
Controllers for autonomous systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modelled as process noise, and common assumptions are that the underlying distributions are known and/or Gaussian. In practice, however, these assumptions may be unrealistic and can lead to poor approximations of the true noise distribution. We present a novel planning method that does not rely on any explicit representation of the noise distributions. In particular, we address the problem of computing a controller that provides probabilistic guarantees on safely reaching a target. First, we abstract the continuous system into a discrete-state model that captures noise by probabilistic transitions between states. As a key contribution, we adapt tools from the scenario approach to compute probably approximately correct (PAC) bounds on these transition probabilities, based on a finite number of samples of the noise. We capture these bounds in the transition probability intervals of a so-called interval Markov decision process (iMDP). This iMDP is robust against uncertainty in the transition probabilities, and the tightness of the probability intervals can be controlled through the number of samples. We use state-of-the-art verification techniques to provide guarantees on the iMDP, and compute a controller for which these guarantees carry over to the autonomous system. Realistic benchmarks show the practical applicability of our method, even when the iMDP has millions of states or transitions.
How is AI Revolutionizing the Telecommunications Industry?
Robotic Process Automation (RPA): Robotic Process Automation is a technology that configures computer software to capture data and manipulate applications in the way it is done by humans. With RPA telecommunication providers can automate back-end activities such as data entry, reconciliation, or validation, streamline customer support as well as perform cross-sell and up-sell utilizing AI-powered assisted calls. RPA applications allow CSPs to reduce costs, enhance accuracy, improve efficiency and deliver a better customer experience. Intelligent Virtual Agents: Intelligent Virtual Agents based on AI technologies gain traction in the telecommunication sector, resulting in improved customer experience and satisfaction. Telecom providers have turned to virtual assistance to optimize the processing of the huge number of support requests for troubleshooting, billing inquiries, maintenance, device settings, etc. AI-powered assistants handle all service-type questions and process transactions efficiently and at high speed.
Representing Knowledge as Predictions (and State as Knowledge)
This paper shows how a single mechanism allows knowledge to be constructed layer by layer directly from an agent's raw sensorimotor stream. This mechanism, the General Value Function (GVF) or "forecast," captures high-level, abstract knowledge as a set of predictions about existing features and knowledge, based exclusively on the agent's low-level senses and actions. Thus, forecasts provide a representation for organizing raw sensorimotor data into useful abstractions over an unlimited number of layers--a long-sought goal of AI and cognitive science. The heart of this paper is a detailed thought experiment providing a concrete, step-by-step formal illustration of how an artificial agent can build true, useful, abstract knowledge from its raw sensorimotor experience alone. The knowledge is represented as a set of layered predictions (forecasts) about the agent's observed consequences of its actions. This illustration shows twelve separate layers: the lowest consisting of raw pixels, touch and force sensors, and a small number of actions; the higher layers increasing in abstraction, eventually resulting in rich knowledge about the agent's world, corresponding roughly to doorways, walls, rooms, and floor plans. I then argue that this general mechanism may allow the representation of a broad spectrum of everyday human knowledge.
Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward
Liu, Weilin, Mu, Ye, Yu, Chao, Ning, Xuefei, Cao, Zhong, Wu, Yi, Liang, Shuang, Yang, Huazhong, Wang, Yu
Discovering hazardous scenarios is crucial in testing and further improving driving policies. However, conducting efficient driving policy testing faces two key challenges. On the one hand, the probability of naturally encountering hazardous scenarios is low when testing a well-trained autonomous driving strategy. Thus, discovering these scenarios by purely real-world road testing is extremely costly. On the other hand, a proper determination of accident responsibility is necessary for this task. Collecting scenarios with wrong-attributed responsibilities will lead to an overly conservative autonomous driving strategy. To be more specific, we aim to discover hazardous scenarios that are autonomous-vehicle responsible (AV-responsible), i.e., the vulnerabilities of the under-test driving policy. To this end, this work proposes a Safety Test framework by finding Av-Responsible Scenarios (STARS) based on multi-agent reinforcement learning. STARS guides other traffic participants to produce Av-Responsible Scenarios and make the under-test driving policy misbehave via introducing Hazard Arbitration Reward (HAR). HAR enables our framework to discover diverse, complex, and AV-responsible hazardous scenarios. Experimental results against four different driving policies in three environments demonstrate that STARS can effectively discover AV-responsible hazardous scenarios. These scenarios indeed correspond to the vulnerabilities of the under-test driving policies, thus are meaningful for their further improvements.
Towards Autonomous Satellite Communications: An AI-based Framework to Address System-level Challenges
Garau-Luis, Juan Jose, Eiskowitz, Skylar, Pachler, Nils, Crawley, Edward, Cameron, Bruce
The next generation of satellite constellations is designed to better address the future needs of our connected society: highly-variable data demand, mobile connectivity, and reaching more under-served regions. Artificial Intelligence (AI) and learning-based methods are expected to become key players in the industry, given the poor scalability and slow reaction time of current resource allocation mechanisms. While AI frameworks have been validated for isolated communication tasks or subproblems, there is still not a clear path to achieve fully-autonomous satellite systems. Part of this issue results from the focus on subproblems when designing models, instead of the necessary system-level perspective. In this paper we try to bridge this gap by characterizing the system-level needs that must be met to increase satellite autonomy, and introduce three AI-based components (Demand Estimator, Offline Planner, and Real Time Engine) that jointly address them. We first do a broad literature review on the different subproblems and identify the missing links to the system-level goals. In response to these gaps, we outline the three necessary components and highlight their interactions. We also discuss how current models can be incorporated into the framework and possible directions of future work.
5 amazing books about AI that you should be reading
I always think that people, especially in the scientific and engineering society, underestimate the importance of simple explanations of difficult concepts, especially concerning people that are new in the field; these books help on making difficult concepts seem not so difficult! Also, I've just published my new ebook on Amazon, and I'm already working to publish some other books across this year… keep in touch, follow me and let's do it together. If you want to go further on your learning journey, I've prepared for you an amazing list with more than 60 training courses about AI, Machine Learning, Deep Learning, and Data Science that you can do right now for free: If you want to continue to discover new resources and learn about AI, In my ebook (link below), I am sharing the best articles, websites, and free training courses online about Artificial Intelligence, Machine Learning, Deep Learning, Data Science, Business Intelligence, Analytics, and others to help you start learning and develop your career.
How Natural Language Programming and Conversational AI Are Taking on the Call Center
Natural language processing (NLP) and conversational AI are often used together with machine learning, natural language understanding (NLU) to create sophisticated applications that enable machines to communicate with human beings. This article will look at how NLP and conversational AI are being used to improve and enhance the Call Center. NLP is a technological process that facilitates the ability to convert text or speech into encoded, structured information. By using NLP and NLU, machines are able to understand human speech and can respond appropriately, which, in turn, enables humans to interact with them using conversational, natural speech patterns. Predictive algorithmic forecasting is a method of AI-based estimation in which statistical algorithms are provided with historical data in order to predict what is likely to happen in the future.
Towards autonomous artificial agents with an active self: modeling sense of control in situated action
Kahl, Sebastian, Wiese, Sebastian, Russwinkel, Nele, Kopp, Stefan
In this paper we present a computational modeling account of an active self in artificial agents. In particular we focus on how an agent can be equipped with a sense of control and how it arises in autonomous situated action and, in turn, influences action control. We argue that this requires laying out an embodied cognitive model that combines bottom-up processes (sensorimotor learning and fine-grained adaptation of control) with top-down processes (cognitive processes for strategy selection and decision-making). We present such a conceptual computational architecture based on principles of predictive processing and free energy minimization. Using this general model, we describe how a sense of control can form across the levels of a control hierarchy and how this can support action control in an unpredictable environment. We present an implementation of this model as well as first evaluations in a simulated task scenario, in which an autonomous agent has to cope with un-/predictable situations and experiences corresponding sense of control. We explore different model parameter settings that lead to different ways of combining low-level and high-level action control. The results show the importance of appropriately weighting information in situations where the need for low/high-level action control varies and they demonstrate how the sense of control can facilitate this.