AITopics

2001.06627

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Transportation (0.90)
Information Technology (0.66)

Technology:

Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

arXiv.org Artificial IntelligenceJan-18-2020

Smooth markets: A basic mechanism for organizing gradient-based learners

Balduzzi, David, Czarnecki, Wojciech M, Anthony, Thomas W, Gemp, Ian M, Hughes, Edward, Leibo, Joel Z, Piliouras, Georgios, Graepel, Thore

With the success of modern machine learning, it is becoming increasingly important to understand and control how learning algorithms interact. Unfortunately, negative results from game theory show there is little hope of understanding or controlling general n-player games. We therefore introduce smooth markets (SM-games), a class of n-player games with pairwise zero sum interactions. SM-games codify a common design pattern in machine learning that includes (some) GANs, adversarial training, and other recent algorithms. We show that SM-games are amenable to analysis and optimization using first-order methods.

conference paper, forecast, smgame, (15 more...)

2001.04678

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Singapore (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Rădulescu, Roxana, Mannion, Patrick, Zhang, Yijie, Roijers, Diederik M., Nowé, Ann

A utility-based analysis of equilibria in multi-objective normal form games

arXiv.org Artificial IntelligenceJan-17-2020

Example application domains include urban and air traffic control (Mannion et al., 2016a; Yliniemi et al., 2015), autonomous vehicles (R adulescu et al., 2018; Talpert et al., 2019) and energy systems (Walraven and Spaan, 2016; Mannion et al., 2016b; Reymond et al., 2018). Although many such problems feature multiple conflicting objectives to optimise, most MAS research focuses on agents maximising their return w.r.t. a single objective. By contrast, in multi-objective multi-agent systems (MOMAS), agents explicitly consider the possible tradeoffs between conflicting objective functions. Agents in a MOMAS receive vector-valued payoffs for their actions, where each component of a payoff vector represents the performance on a different objective. Following the utility-based approach (Roijers et al., 2013), we assume that each agent has a utility function which maps vector-valued payoffs to scalar utility values. Compromises between competing objectives are then considered on the the basis of the utility that these tradeoffs have for the users of a MOMAS. The utility-based approach naturally leads to two different optimisation criteria for agents in a MOMAS: expected scalarised returns (ESR) and scalarised expected returns (SER). To date, the differences between the SER and ESR approaches have received little attention in multi-agent settings, despite having received some attention in single-agent settings (see e.g.

equilibria, equilibrium, utility function, (14 more...)

2001.08177

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Belgium (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre:

Overview (0.67)
Research Report (0.64)

Industry:

Energy (1.00)
Transportation > Infrastructure & Services (0.86)
Leisure & Entertainment > Games (0.68)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Papangelis, Alexandros, Namazifar, Mahdi, Khatri, Chandra, Wang, Yi-Chia, Molino, Piero, Tur, Gokhan

Plato Dialogue System: A Flexible Conversational AI Research Platform

arXiv.org Artificial IntelligenceJan-17-2020

As the field of Spoken Dialogue Systems and Conversational AI grows, so does the need for tools and environments that abstract away implementation details in order to expedite the development process, lower the barrier of entry to the field, and offer a common test-bed for new ideas. In this paper, we present Plato, a flexible Conversational AI platform written in Python that supports any kind of conversational agent architecture, from standard architectures to architectures with jointly-trained components, single- or multi-party interactions, and offline or online training of any conversational agent component. Plato has been designed to be easy to understand and debug and is agnostic to the underlying learning frameworks that train each component.

agent, conversational agent, plato, (16 more...)

2001.06463

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > France (0.04)
(7 more...)

Genre: Research Report (0.64)

Industry: Education > Educational Setting > Online (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Laureano-Cruces, Ana Lilia, Hernández-Domínguez, Laura, Mora-Torres, Martha, Torres-Moreno, Juan-Manuel, Cabrera-López, Jaime Enrique

Visual Simplified Characters' Emotion Emulator Implementing OCC Model

arXiv.org Artificial IntelligenceJan-17-2020

In this paper, we present a visual emulator of the emotions seen in characters in stories. This system is based on a simplified view of the cognitive structure of emotions proposed by Ortony, Clore and Collins (OCC Model). The goal of this paper is to provide a visual platform that allows us to observe changes in the characters' different emotions, and the intricate interrelationships between: 1) each character's emotions, 2) their affective relationships and actions, 3) The events that take place in the development of a plot, and 4) the objects of desire that make up the emotional map of any story. This tool was tested on stories with a contrasting variety of emotional and affective environments: Othello, Twilight, and Harry Potter, behaving sensibly and in keeping with the atmosphere in which the characters were immersed.

agent, emotion, reaction, (15 more...)

2001.0619

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report (0.40)

Industry:

Leisure & Entertainment (1.00)
Education > Educational Setting (0.47)
Media > Theater (0.38)
Media > Film (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.94)

Bohlouli, Mahdi, Holland, Alexander, Fathi, Madjid

Knowledge Integration of Collaborative Product Design Using Cloud Computing Infrastructure

-- T he pivotal key for the success of manufacturing enterprises is sustainable and innovative product design and development. In collaborative design, stakehol ders are heterogeneously distributed chain - like . Due to the growing volume of data and knowledge, an effective management of the knowledge acquired in the product design and development is one of the key challenges facing most manufacturing enterprises. Opportunities for improving efficiency and performance of IT - based product design applications through centralization of resources such as knowledge and computation have increased in the last few years with maturation of technologies such as SOA, virtualization, grid computing, and /or cloud computing. The main focus of this paper is the concept of ongoing research in providing the knowledge integration service for collaborative product design and development using cloud computing infra structure . P otential s of the cloud computing to support the Knowledge integration functionalities as a Service by providing functionalities such as knowledge mapping, merging, searching, and transferring in product design procedure are described in this paper . Proposed knowledge integration services support users by giving real - time access to knowledge resources. The framework has the advantage of availability, efficiency, cost reduction, less time to result, and scalability . Changes made during the early design stage do not cause the significant increase in costs, while during the production stage, sharp increase in costs will occur since many blueprints, design documents or components would require re - work and re - design [ 5 ] . Today's research is focused on optimising the development methodologies to enable shorter time, lower costs and higher quality of the systems [ 2 ] . The pivotal key for the success of manufacturing enterprises is sustainable and innovative product design and development . In order to achieve this goal, it is required to have a real and deep knowledge of former and current procedures in the manufacturing enterprise [4] and future needs as well as customer feedback s and various stages of production cha in activities. Realization of an efficient knowledge transfer between different stakeholders of product development process such as linking customers and suppliers proactively throughout the entire value chain, and collaborating across boundaries in distri buted enterprise s is reinforcing this trend.

cloud, knowledge, product design, (14 more...)

2001.09796

Country:

Europe > Germany > Baden-Württemberg > Karlsruhe Region > Heidelberg (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
Europe > Hungary (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
(2 more...)

Elrakaiby, Yehia, Spoletini, Paola, Nuseibeh, Bashar

Optimal by Design: Model-Driven Synthesis of Adaptation Strategies for Autonomous Systems

--Many software systems have become too large and complex to be managed efficiently by human administrators, particularly when they operate in uncertain and dynamic environments and require frequent changes. Requirements-driven adaptation techniques have been proposed to endow systems with the necessary means to autonomously decide ways to satisfy their requirements. However, many current approaches rely on general-purpose languages, models and/or frameworks to design, develop and analyze autonomous systems. Unfortunately, these tools are not tailored towards the characteristics of adaptation problems in autonomous systems. D proposes a model (and a language) for the high-level description of the basic elements of self-adaptive systems, namely the system, capabilities, requirements and environment. Based on those elements, a Markov Decision Process (MDP) is constructed to compute the optimal strategy or the most rewarding system behavior . Furthermore, this defines a reflex controller that can ensure timely responses to changes. One novel feature of the framework is that it benefits both from goal-oriented techniques, developed for requirement elicitation, refinement and analysis, and synthesis capabilities and extensive research around MDPs, their extensions and tools. Our preliminary evaluation results demonstrate the practicality and advantages of the framework. Autonomous systems such as unmanned vehicles and robots play an increasingly relevant role in our societies. Many factors contribute to the complexity in the design and development of those systems. First, they typically operate in dynamic and uncontrollable environments [1]-[5]. Therefore, they must continuously adapt their configuration in response to changes, both in their operating environment and in themselves. Since the frequency of change cannot be controlled, decision-making must be almost instantaneous to ensure timely responses. From a design and management perspective, it is desirable to minimize the effort needed to design the system and to enable its runtime updating and maintenance. A promising technique to address those challenges is requirements-driven adaptation that endow systems with the necessary means to autonomously operate based on their requirements. Requirements are prescriptive statements of intent to be satisfied by cooperation of the agents forming the system [6]. They say what the system will do and not how it will do it [7].

controller, requirement, transition, (15 more...)

2001.08525

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > United Kingdom > England > Buckinghamshire > Milton Keynes (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.88)

Adversarially Guided Self-Play for Adopting Social Conventions

Tucker, Mycal, Zhou, Yilun, Shah, Julie

Robotic agents must adopt existing social conventions in order to be effective teammates. These social conventions, such as driving on the right or left side of the road, are arbitrary choices among optimal policies, but all agents on a successful team must use the same convention. Prior work has identified a method of combining self-play with paired input-output data gathered from existing agents in order to learn their social convention without interacting with them. We build upon this work by introducing a technique called Adversarial Self-Play (ASP) that uses adversarial training to shape the space of possible learned policies and substantially improves learning efficiency. ASP only requires the addition of unpaired data: a dataset of outputs produced by the social convention without associated inputs. Theoretical analysis reveals how ASP shapes the policy space and the circumstances (when behaviors are clustered or exhibit some other structure) under which it offers the greatest benefits. Empirical results across three domains confirm ASP's advantages: it produces models that more closely match the desired social convention when given as few as two paired datapoints.

agent, asp, social convention, (12 more...)

2001.05994

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.28)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Arisaka, Ryuta, Ito, Takayuki

Broadening Label-based Argumentation Semantics with May-Must Scales

The semantics as to which set of arguments in a given argumentation graph may be acceptable (acceptability semantics) can be characterised in a few different ways. Among them, labelling-based approach allows for concise and flexible determination of acceptability statuses of arguments through assignment of a label indicating acceptance, rejection, or undecided to each argument. In this work, we contemplate a way of broadening it by accommodating may- and must- conditions for an argument to be accepted or rejected, as determined by the number(s) of rejected and accepted attacking arguments. We show that the broadened label-based semantics can be used to express more mild indeterminacy than inconsistency for acceptability judgement when, for example, it may be the case that an argument is accepted and when it may also be the case that it is rejected. We identify that finding which conditions a labelling satisfies for every argument can be an undecidable problem, which has an unfavourable implication to semantics. We propose to address this problem by enforcing a labelling to maximally respect the conditions, while keeping the rest that would necessarily cause non-termination labelled undecided.

argument, argumentation, rejection, (15 more...)

2001.0573

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Japan > Honshū > Chūbu > Aichi Prefecture > Nagoya (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

Chawla, Ronshee, Sankararaman, Abishek, Ganesh, Ayalvadi, Shakkottai, Sanjay

The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits

arXiv.org Machine LearningJan-15-2020

We consider a decentralized multi-agent Multi Armed Bandit (MAB) setup consisting of $N$ agents, solving the same MAB instance to minimize individual cumulative regret. In our model, agents collaborate by exchanging messages through pairwise gossip style communications. We develop two novel algorithms, where each agent only plays from a subset of all the arms. Agents use the communication medium to recommend only arm-IDs (not samples), and thus update the set of arms from which they play. We establish that, if agents communicate $\Omega(\log(T))$ times through any connected pairwise gossip mechanism, then every agent's regret is a factor of order $N$ smaller compared to the case of no collaborations. Furthermore, we show that the communication constraints only have a second order effect on the regret of our algorithm. We then analyze this second order term of the regret to derive bounds on the regret-communication tradeoffs. Finally, we empirically evaluate our algorithm and conclude that the insights are fundamental and not artifacts of our bounds. We also show a lower bound which gives that the regret scaling obtained by our algorithm cannot be improved even in the absence of any communication constraints. Our results demonstrate that even a minimal level of collaboration among agents greatly reduces regret for all agents.

agent, algorithm, communication budget, (14 more...)

arXiv.org Machine Learning

2001.05452

Country:

North America > United States > Texas > Travis County > Austin (0.04)
Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)