Agents
Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey
Abdellatif, Alaa Awad, Mhaisen, Naram, Chkirbene, Zina, Mohamed, Amr, Erbad, Aiman, Guizani, Mohsen
The rapid increase in the percentage of chronic disease patients along with the recent pandemic pose immediate threats on healthcare expenditure and elevate causes of death. This calls for transforming healthcare systems away from one-on-one patient treatment into intelligent health systems, to improve services, access and scalability, while reducing costs. Reinforcement Learning (RL) has witnessed an intrinsic breakthrough in solving a variety of complex problems for diverse applications and services. Thus, we conduct in this paper a comprehensive survey of the recent models and techniques of RL that have been developed/used for supporting Intelligent-healthcare (I-health) systems. This paper can guide the readers to deeply understand the state-of-the-art regarding the use of RL in the context of I-health. Specifically, we first present an overview for the I-health systems challenges, architecture, and how RL can benefit these systems. We then review the background and mathematical modeling of different RL, Deep RL (DRL), and multi-agent RL models. After that, we provide a deep literature review for the applications of RL in I-health systems. In particular, three main areas have been tackled, i.e., edge intelligence, smart core network, and dynamic treatment regimes. Finally, we highlight emerging challenges and outline future research directions in driving the future success of RL in I-health systems, which opens the door for exploring some interesting and unsolved problems.
Building a Foundation for Data-Driven, Interpretable, and Robust Policy Design using the AI Economist
Trott, Alexander, Srinivasa, Sunil, van der Wal, Douwe, Haneuse, Sebastien, Zheng, Stephan
Optimizing economic and public policy is critical to address socioeconomic issues and trade-offs, e.g., improving equality, productivity, or wellness, and poses a complex mechanism design problem. A policy designer needs to consider multiple objectives, policy levers, and behavioral responses from strategic actors who optimize for their individual objectives. Moreover, real-world policies should be explainable and robust to simulation-to-reality gaps, e.g., due to calibration issues. Existing approaches are often limited to a narrow set of policy levers or objectives that are hard to measure, do not yield explicit optimal policies, or do not consider strategic behavior, for example. Hence, it remains challenging to optimize policy in real-world scenarios. Here we show that the AI Economist framework enables effective, flexible, and interpretable policy design using two-level reinforcement learning (RL) and data-driven simulations. We validate our framework on optimizing the stringency of US state policies and Federal subsidies during a pandemic, e.g., COVID-19, using a simulation fitted to real data. We find that log-linear policies trained using RL significantly improve social welfare, based on both public health and economic outcomes, compared to past outcomes. Their behavior can be explained, e.g., well-performing policies respond strongly to changes in recovery and vaccination rates. They are also robust to calibration errors, e.g., infection rates that are over or underestimated. As of yet, real-world policymaking has not seen adoption of machine learning methods at large, including RL and AI-driven simulations. Our results show the potential of AI to guide policy design and improve social welfare amidst the complexity of the real world.
Communicative Learning with Natural Gestures for Embodied Navigation Agents with Human-in-the-Scene
Wu, Qi, Wu, Cheng-Ju, Zhu, Yixin, Joo, Jungseock
Human-robot collaboration is an essential research topic in artificial intelligence (AI), enabling researchers to devise cognitive AI systems and affords an intuitive means for users to interact with the robot. Of note, communication plays a central role. To date, prior studies in embodied agent navigation have only demonstrated that human languages facilitate communication by instructions in natural languages. Nevertheless, a plethora of other forms of communication is left unexplored. In fact, human communication originated in gestures and oftentimes is delivered through multimodal cues, e.g. "go there" with a pointing gesture. To bridge the gap and fill in the missing dimension of communication in embodied agent navigation, we propose investigating the effects of using gestures as the communicative interface instead of verbal cues. Specifically, we develop a VR-based 3D simulation environment, named Ges-THOR, based on AI2-THOR platform. In this virtual environment, a human player is placed in the same virtual scene and shepherds the artificial agent using only gestures. The agent is tasked to solve the navigation problem guided by natural gestures with unknown semantics; we do not use any predefined gestures due to the diversity and versatile nature of human gestures. We argue that learning the semantics of natural gestures is mutually beneficial to learning the navigation task--learn to communicate and communicate to learn. In a series of experiments, we demonstrate that human gesture cues, even without predefined semantics, improve the object-goal navigation for an embodied agent, outperforming various state-of-the-art methods.
Model-Based Opponent Modeling
Yu, Xiaopeng, Jiang, Jiechuan, Jiang, Haobin, Lu, Zongqing
When one agent interacts with a multi-agent environment, it is challenging to deal with various opponents unseen before. Modeling the behaviors, goals, or beliefs of opponents could help the agent adjust its policy to adapt to different opponents. In addition, it is also important to consider opponents who are learning simultaneously or capable of reasoning. However, existing work usually tackles only one of the aforementioned types of opponent. In this paper, we propose model-based opponent modeling (MBOM), which employs the environment model to adapt to all kinds of opponent. MBOM simulates the recursive reasoning process in the environment model and imagines a set of improving opponent policies. To effectively and accurately represent the opponent policy, MBOM further mixes the imagined opponent policies according to the similarity with the real behaviors of opponents. Empirically, we show that MBOM achieves more effective adaptation than existing methods in competitive and cooperative environments, respectively with different types of opponent, i.e., fixed policy, na\"ive learner, and reasoning learner.
Citizen crime app releases Protect, an on-demand subscription security feature
After months of testing, Citizen, the crime and neighborhood watch app, is releasing Protect, a subscription-based feature that lets users contact virtual agents for help if they feel they're in danger. According to Citizen, the feature can connect users with a Protect agent either through video, audio, or text available around the clock. The company said audio and text-only communication allows users to discreetly call for help "in difficult situations" where they might not be able to or are scared to be seen calling 911. Protect began beta testing earlier this year as the feature has been available to 100,000 users, Citizen said. The new feature comes as Citizen currently has more than 8 million users who have sent out more than billion alerts in major U.S. cities including New York, Los Angeles, Chicago, Atlanta, Houston and the San Francisco Bay Area.
Interesting Books to Read on Artificial Intelligence for Tech Enthusiasts
Artificial intelligence has made its place in all our lives, from correcting our bad grammar, personalizing our music on apps, to automating work in several industries. AI holds a massive potential to transform the future of work. But to understand this disruptive technology, the general public needs to have a working knowledge of the capabilities. To start slow and avoid the feeling of being overwhelming, here are 10 books that will help you grasp the concept. This book is beginner-friendly and gives a less technical overview of several AI topics.
Dynamic communication topologies for distributed heuristics in energy system optimization algorithms
Holly, Stefanie, Nieße, Astrid
ISTRIBUTED heuristics are a promising field for current and future energy systems control and optimization tasks, In [12] we showed that different communication topologies and have been designed and evaluated in recent years on have an effect on the performance of the reflected algorithm agent-based systems [1] [2] [3]. While conventional control class: Highly meshed topologies converged into good solutions systems - centralized or hierarchical in their control paradigm - reliably and quickly, but increased communication overhead and perfectly fit to centralized generation and transmission systems, premature convergence. In contrast, results for sparsely meshed distributed renewable energy systems show properties that topologies were much less reliable. In the application domain promote the application of distributed optimization systems: of energy systems as critical infrastructures, this behavior is First, future energy systems can be regarded as complex highly unwanted. We presume that dynamically adjusting the systems of systems, sometimes framed as cyber-physical multienergy topology during runtime leads to a beneficial transition of systems, coupling communication systems, power, heat exploration and exploitation of the search space for distributed and gas systems.
Goal Recognition for Deceptive Human Agents through Planning and Gaze
Le, Thao, Singh, Ronal, Miller, Tim
Eye gaze has the potential to provide insight into the minds of individuals, and this idea has been used in prior research to improve human goal recognition by combining human's actions and gaze. However, most existing research assumes that people are rational and honest. In adversarial scenarios, people may deliberately alter their actions and gaze, which presents a challenge to goal recognition systems. In this paper, we present new models for goal recognition under deception using a combination of gaze behaviour and observed movements of the agent. These models aim to detect when a person is deceiving by analysing their gaze patterns and use this information to adjust the goal recognition. We evaluated our models in two human-subject studies: (1) using data collected from 30 individuals playing a navigation game inspired by an existing deception study and (2) using data collected from 40 individuals playing a competitive game (Ticket To Ride). We found that one of our models (Modulated Deception Gaze Ontic) offers promising results compared to the previous state-of-the-art model in both studies. Our work complements existing adversarial goal recognition systems by equipping these systems with the ability to tackle ambiguous gaze behaviours.
Scheduling Aerial Vehicles in an Urban Air Mobility Scheme
Rigas, Emmanouil S., Kolios, Panayiotis, Ellinas, Georgios
Highly populated cities face several challenges, one of them being the intense traffic congestion. In recent years, the concept of Urban Air Mobility has been put forward by large companies and organizations as a way to address this problem, and this approach has been rapidly gaining ground. This disruptive technology involves aerial vehicles (AVs) for hire than can be utilized by customers to travel between locations within large cities. This concept has the potential to drastically decrease traffic congestion and reduce air pollution, since these vehicles typically use electric motors powered by batteries. This work studies the problem of scheduling the assignment of AVs to customers, having as a goal to maximize the serviced customers and minimize the energy consumption of the AVs by forcing them to fly at the lowest possible altitude. Initially, an Integer Linear Program (ILP) formulation is presented, that is solved offline and optimally, followed by a near-optimal algorithm, that solves the problem incrementally, one AV at a time, to address scalability issues, allowing scheduling in problems involving large numbers of locations, AVs, and customer requests.
The application of artificial intelligence in software engineering: a review challenging conventional wisdom
Batarseh, Feras A., Mohod, Rasika, Kumar, Abhinav, Bui, Justin
The field of artificial intelligence (AI) is witnessing a recent upsurge in research, tools development, and deployment of applications. Multiple software companies are shifting their focus to developing intelligent systems; and many others are deploying AI paradigms to their existing processes. In parallel, the academic research community is injecting AI paradigms to provide solutions to traditional engineering problems. Similarly, AI has evidently been proved useful to software engineering (SE). When one observes the SE phases (requirements, design, development, testing, release, and maintenance), it becomes clear that multiple AI paradigms (such as neural networks, machine learning, knowledge-based systems, natural language processing) could be applied to improve the process and eliminate many of the major challenges that the SE field has been facing. This survey chapter is a review of the most commonplace methods of AI applied to SE. The review covers methods between years 1975-2017, for the requirements phase, 46 major AI-driven methods are found, 19 for design, 15 for development, 68 for testing, and 15 for release and maintenance. Furthermore, the purpose of this chapter is threefold; firstly, to answer the following questions: is there sufficient intelligence in the SE lifecycle? What does applying AI to SE entail? Secondly, to measure, formulize, and evaluate the overlap of SE phases and AI disciplines. Lastly, this chapter aims to provide serious questions to challenging the current conventional wisdom (i.e., status quo) of the state-of-the-art, craft a call for action, and to redefine the path forward.