AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

Can AI simulations predict the future?

#artificialintelligenceNov-17-2021, 14:45:05 GMT

The recent U.S. backflip on Syria has certainly not helped the nation's residents. Before the Syrian Civil War in 2017, the estimated population was 22 million; today it is roughly five million fewer, with another six million "internally displaced." With Turkey launching an invasion, we can expect more Syrian citizens to become refugees. Beyond the occasional news feature inside of refugee camps, you hear very little about where Syrians end up, save when far right leaders demand it not be in their backyard. How can you tell if they will successfully integrate into the foreign populations they must seek aid from?

ai simulation predict, maai, syrian refugee, (5 more...)

#artificialintelligence

Country:

Asia > Middle East > Syria (0.27)
Asia > Middle East > Republic of Türkiye (0.25)
North America > Canada > Ontario > Toronto (0.05)
(2 more...)

Industry: Government (0.73)

Technology:

Information Technology > Communications > Social Media (0.53)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.35)

Add feedback

SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition

Mao, Hangyu, Wang, Chao, Hao, Xiaotian, Mao, Yihuan, Lu, Yiming, Wu, Chengjie, Hao, Jianye, Li, Dong, Tang, Pingzhong

arXiv.org Artificial IntelligenceNov-16-2021

The MineRL competition is designed for the development of reinforcement learning and imitation learning algorithms that can efficiently leverage human demonstrations to drastically reduce the number of environment interactions needed to solve the complex ObtainDiamond task with sparse rewards. To address the challenge, in this paper, we present SEIHAI, a Sample-efficient Hierarchical AI, that fully takes advantage of the human demonstrations and the task structure. Specifically, we split the task into several sequentially dependent subtasks, and train a suitable agent for each subtask using reinforcement learning and imitation learning. We further design a scheduler to select different agents for different subtasks automatically. SEIHAI takes the first place in the preliminary and final of the NeurIPS-2020 MineRL competition.

agent, competition, demonstration, (13 more...)

arXiv.org Artificial Intelligence

2111.08857

Country:

North America > United States > California > Alameda County > Oakland (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)

Genre: Research Report (1.00)

Industry: Materials > Metals & Mining (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

The Partially Observable History Process

Morrill, Dustin, Greenwald, Amy R., Bowling, Michael

arXiv.org Artificial IntelligenceNov-15-2021

We introduce the partially observable history process (POHP) formalism for reinforcement learning. POHP centers around the actions and observations of a single agent and abstracts away the presence of other players without reducing them to stochastic processes. Our formalism provides a streamlined interface for designing algorithms that defy categorization as exclusively single or multi-agent, and for developing theory that applies across these domains. We show how the POHP formalism unifies traditional models including the Markov decision process, the Markov game, the extensive-form game, and their partially observable extensions, without introducing burdensome technical machinery or violating the philosophical underpinnings of reinforcement learning. We illustrate the utility of our formalism by concisely exploring observable sequential rationality, re-deriving the extensive-form regret minimization (EFR) algorithm, and examining EFR's theoretical properties in greater generality.

agent, information state, pohp, (16 more...)

arXiv.org Artificial Intelligence

2111.08102

Country:

North America > United States > Florida > Hillsborough County > University (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.88)

Add feedback

Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics

Schubert, Ingmar, Driess, Danny, Oguz, Ozgur S., Toussaint, Marc

arXiv.org Artificial IntelligenceNov-15-2021

Applications of Reinforcement Learning (RL) in robotics are often limited by high data demand. On the other hand, approximate models are readily available in many robotics scenarios, making model-based approaches like planning a data-efficient alternative. Still, the performance of these methods suffers if the model is imprecise or wrong. In this sense, the respective strengths and weaknesses of RL and model-based planners are. In the present work, we investigate how both approaches can be integrated into one framework that combines their strengths. We introduce Learning to Execute (L2E), which leverages information contained in approximate plans to learn universal policies that are conditioned on plans. In our robotic manipulation experiments, L2E exhibits increased performance when compared to pure RL, pure planning, or baseline methods combining learning and planning.

agent, baseline, learning, (15 more...)

arXiv.org Artificial Intelligence

2111.07908

Country:

Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

AI in Games: Techniques, Challenges and Opportunities

Yin, Qiyue, Yang, Jun, Ni, Wancheng, Liang, Bin, Huang, Kaiqi

arXiv.org Artificial IntelligenceNov-15-2021

With breakthrough of AlphaGo, AI in human-computer game has become a very hot topic attracting researchers all around the world, which usually serves as an effective standard for testing artificial intelligence. Various game AI systems (AIs) have been developed such as Libratus, OpenAI Five and AlphaStar, beating professional human players. In this paper, we survey recent successful game AIs, covering board game AIs, card game AIs, first-person shooting game AIs and real time strategy game AIs. Through this survey, we 1) compare the main difficulties among different kinds of games for the intelligent decision making field ; 2) illustrate the mainstream frameworks and techniques for developing professional level AIs; 3) raise the challenges or drawbacks in the current AIs for intelligent decision making; and 4) try to propose future trends in the games and intelligent decision making techniques. Finally, we hope this brief review can provide an introduction for beginners, inspire insights for researchers in the filed of AI in games.

agent, learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2111.07631

Country:

North America > Canada > Alberta (0.14)
Asia > China > Beijing > Beijing (0.04)
North America > United States > Texas (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Leisure & Entertainment > Games > Chess (0.93)
Leisure & Entertainment > Games > Go (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

A Survey on AI Assurance

Batarseh, Feras A., Freeman, Laura

arXiv.org Artificial IntelligenceNov-14-2021

Artificial Intelligence (AI) algorithms are increasingly providing decision making and operational support across multiple domains. AI includes a wide library of algorithms for different problems. One important notion for the adoption of AI algorithms into operational decision process is the concept of assurance. The literature on assurance, unfortunately, conceals its outcomes within a tangled landscape of conflicting approaches, driven by contradicting motivations, assumptions, and intuitions. Accordingly, albeit a rising and novel area, this manuscript provides a systematic review of research works that are relevant to AI assurance, between years 1985 - 2021, and aims to provide a structured alternative to the landscape. A new AI assurance definition is adopted and presented and assurance methods are contrasted and tabulated. Additionally, a ten-metric scoring system is developed and introduced to evaluate and compare existing methods. Lastly, in this manuscript, we provide foundational insights, discussions, future directions, a roadmap, and applicable recommendations for the development and deployment of AI assurance.

artificial intelligence, assurance, intelligence, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1186/s40537-021-00445-7

2111.07505

Country:

North America > United States > Virginia > Arlington County > Arlington (0.04)
Europe > Finland > Central Finland > Jyväskylä (0.04)
North America > United States > Colorado (0.04)
(6 more...)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.45)
Instructional Material > Course Syllabus & Notes (0.45)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (1.00)
Government > Regional Government > North America Government > United States Government (0.68)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
(11 more...)

Add feedback

Stefano Somenzi, Athics: On no-code AI and deploying conversational bots

#artificialintelligenceNov-13-2021, 23:20:17 GMT

No-code AI solutions are helping more businesses to get started on their AI journeys than ever. AI News caught up with Stefano Somenzi, CTO at Athics, to get his thoughts on no-code AI and the development of virtual agents. AI News: Do you think "no-code" will help more businesses to begin their AI journeys? Stefano Somenzi: The real advantage of "no code" is not just the reduced effort required for businesses to get things done, it is also centered around changing the role of the user who will build the AI solution. "No code" means that the AI solution is built not by a data scientist but by the process owner.

agent, athic, stefano somenzi, (12 more...)

#artificialintelligence

Country: Europe (0.06)

Genre: Personal > Interview (0.57)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.43)

Add feedback

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Chennakesavalu, Shriram, Rotskoff, Grant M.

arXiv.org Machine LearningNov-12-2021

Experimental advances enabling high-resolution external control create new opportunities to produce materials with exotic properties. In this work, we investigate how a multi-agent reinforcement learning approach can be used to design external control protocols for self-assembly. We find that a fully decentralized approach performs remarkably well even with a "coarse" level of external control. More importantly, we see that a partially decentralized approach, where we include information about the local environment allows us to better control our system towards some target distribution. We explain this by analyzing our approach as a partially-observed Markov decision process. With a partially decentralized approach, the agent is able to act more presciently, both by preventing the formation of undesirable structures and by better stabilizing target structures as compared to a fully decentralized approach.

information, protocol, reinforcement, (15 more...)

arXiv.org Machine Learning

2111.06875

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.05)
Europe > Sweden > Stockholm > Stockholm (0.05)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report (0.50)

Industry: Energy (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.35)

Add feedback

Competing Models

Olea, Jose Luis Montiel, Ortoleva, Pietro, Pai, Mallesh M, Prat, Andrea

arXiv.org Artificial IntelligenceNov-11-2021

Different agents need to make a prediction. They observe identical data, but have different models: they predict using different explanatory variables. We study which agent believes they have the best predictive ability -- as measured by the smallest subjective posterior mean squared prediction error -- and show how it depends on the sample size. With small samples, we present results suggesting it is an agent using a low-dimensional model. With large samples, it is generally an agent with a high-dimensional model, possibly including irrelevant variables, but never excluding relevant ones. We apply our results to characterize the winning model in an auction of productive assets, to argue that entrepreneurs and investors with simple models will be over-represented in new sectors, and to understand the proliferation of "factors" that explain the cross-sectional variation of expected stock returns in the asset-pricing literature.

artificial intelligence, machine learning, modeling & simulation, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1093/qje/qjac015

1907.03809

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government (1.00)
Banking & Finance > Trading (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

Winning Solution of the AIcrowd SBB Flatland Challenge 2019-2020

Andreica, Mugurel-Ionut

arXiv.org Artificial IntelligenceNov-11-2021

This report describes the main ideas of the solution which won the AIcrowd SBB Flatland Challenge 2019-2020, with a score of 99% (meaning that, on average, 99% of the agents were routed to their destinations within the allotted time steps). The details of the task can be found on the competition's website. The solution consists of 2 major components: 1) A component which (re-)generates paths over a time-expanded graph for each agent 2) A component which updates the agent paths after a malfunction occurs, in order to try to preserve the same agent ordering of entering each cell as before the malfunction. The goal of this component is twofold: a) to (try to) avoid deadlocks b) to bring the system back to a consistent state (where each agent has a feasible path over the time-expanded graph) I am discussing both of these components, as well as a series of potentially promising, but unexplored ideas, below. The invariant for this component is that every agent always has an assigned path (where it will be located at each time step over the whole time horizon), and this component only tries to improve the overall path assignment). Initially, all the agents have a default path assigned which doesn't enter the environment at all (they always just stay at their initial location, outside the environment).

agent, algorithm, deadlock, (13 more...)

arXiv.org Artificial Intelligence

2111.07876

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback