AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

Trustworthiness and Safety for Intelligent Ethical Logical Agents via Interval Temporal Logic and Runtime Self-Checking

Costantini, Stefania (Universita') | Gasperis, Giovanni De (degli Studi dell'Aquila) | Dyoub, Abeer ( Universita') | Pitoni, Valentina (degli Studi dell'Aquila )

AAAI ConferencesMar-21-2018

Implementing Machine Ethics in Intelligent Agents involves trustworthiness and safety, meaning that agents should do what is expected they should do (at least, even in case of malfunctioning of any kind, concerning high-priority goals) and should not behave in unexpected potentially harmful ways. This topics are strongly related with "assurance", i.e., to ensuring that system users can rely upon the system. This paper deals with assurance of logical agent systems via temporal-logic-based runtime self-monitoring and checking.

artificial intelligence, intelligent ethical logical agent, temporal logic and runtime self-checking, (2 more...)

AAAI Conferences

2018 AAAI Spring Symposium Series

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.73)

Add feedback

SUCAG: Stochastic Unbiased Curvature-aided Gradient Method for Distributed Optimization

Wai, Hoi-To, Freris, Nikolaos M., Nedic, Angelia, Scaglione, Anna

arXiv.org Machine LearningMar-21-2018

We propose and analyze a new stochastic gradient method, which we call Stochastic Unbiased Curvature-aided Gra- dient (SUCAG), for finite sum optimization problems. SUCAG constitutes an unbiased total gradient tracking technique that uses Hessian information to accelerate convergence. We an- alyze our method under the general asynchronous model of computation, in which functions are selected infinitely often, but with delays that can grow sublinearly. For strongly convex problems, we establish linear convergence for the SUCAG method. When the initialization point is sufficiently close to the optimal solution, the established convergence rate is only dependent on the condition number of the problem, making it strictly faster than the known rate for the SAGA method. Furthermore, we describe a Markov-driven approach of implementing the SUCAG method in a distributed asynchronous multi-agent setting, via gossiping along a random walk on the communication graph. We show that our analysis applies as long as the undirected graph is connected and, notably, establishes an asymptotic linear convergence rate that is robust to the graph topology. Numerical results demonstrate the merit of our algorithm over existing methods.

artificial intelligence, machine learning, optimization problem, (16 more...)

arXiv.org Machine Learning

1803.08198

Country: North America > United States (0.68)

Genre:

Instructional Material > Course Syllabus & Notes (0.68)
Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)

Add feedback

Swarm Optimization: Goodbye Gradients

@machinelearnbotMar-19-2018, 03:26:46 GMT

These combinations of real-time biological systems can blend knowledge, exploration, and exploitation to unify intelligence and solve problems more efficiently. These simple agents interact locally, within their environment, and new behaviors emerge from the group as a whole. In the world of evolutionary alogirthms one such inspired method is particle swarm optimization (PSO). It is a swarm intelligence based computational technique that can be used to find an approximate solution to a problem by iteratively trying to search candidate solutions (called particles) with regard to a given measure of quality around a global optimum. The movements of the particles are guided by their own best known position in the search-space as well as the entire swarm's best known position.

artificial intelligence, particle, upstream oil & gas, (16 more...)

@machinelearnbot

Industry: Energy > Oil & Gas > Upstream (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents

Machado, Marlos C., Bellemare, Marc G., Talvitie, Erik, Veness, Joel, Hausknecht, Matthew, Bowling, Michael

Journal of Artificial Intelligence ResearchMar-19-2018

The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge of building AI agents with general competency across dozens of Atari 2600 games. It supports a variety of different problem settings and it has been receiving increasing attention from the scientific community, leading to some high-profile success stories such as the much publicized Deep Q-Networks (DQN). In this article we take a big picture look at how the ALE is being used by the research community. We show how diverse the evaluation methodologies in the ALE have become with time, and highlight some key concerns when evaluating agents in the ALE. We use this discussion to present some methodological best practices and provide new benchmark results using these best practices. To further the progress in the field, we introduce a new version of the ALE that supports multiple game modes and provides a form of stochasticity we call sticky actions. We conclude this big picture look by revisiting challenges posed when the ALE was introduced, summarizing the state-of-the-art in various problems and highlighting problems that remain open.

agent, bellemare, learning, (16 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.5699

AI Access Foundation

11182

Journal of Artificial Intelligence Research

Country:

North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
Europe > Sweden > Skåne County > Malmö (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(3 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Leisure & Entertainment > Sports (0.93)
Education (0.85)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

Wu, Cathy, Rajeswaran, Aravind, Duan, Yan, Kumar, Vikash, Bayen, Alexandre M, Kakade, Sham, Mordatch, Igor, Abbeel, Pieter

arXiv.org Machine LearningMar-19-2018

Policy gradient methods have enjoyed great success in deep reinforcement learning but suffer from high variance of gradient estimates. The high variance problem is particularly exasperated in problems with long horizons or high-dimensional action spaces. To mitigate this issue, we derive a bias-free action-dependent baseline for variance reduction which fully exploits the structural form of the stochastic policy itself and does not make any additional assumptions about the MDP. We demonstrate and quantify the benefit of the action-dependent baseline through both theoretical analysis as well as numerical results, including an analysis of the suboptimality of the optimal state-dependent baseline. The result is a computationally efficient policy gradient algorithm, which scales to high-dimensional control problems, as demonstrated by a synthetic 2000-dimensional target matching task. Our experimental results indicate that action-dependent baselines allow for faster learning on standard reinforcement learning benchmarks and high-dimensional hand manipulation and synthetic tasks. Finally, we show that the general idea of including additional information in baselines for improved variance reduction can be extended to partially observed and multi-agent tasks.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

1803.07246

Genre: Research Report (0.64)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Swarm AI: Shaping the Conscience of Tomorrow's Artificial Intelligence - 1redDrop

#artificialintelligenceMar-17-2018, 19:11:10 GMT

Artificial intelligence might arguably be the newest frontier of human experience, but there's no denying that man has been fascinated with the concept for millennia. From the mythical stories of Hephaestus creating mechanical servants and brazen-footed bulls that puffed fire from their mouths, to the talking heads of the 13th century, to IBM Watson and modern forms of AI, the subject has been bubbling on the surface of human consciousness. The time is now here for AI to come of age; and, in many ways, it already has. But now there's a new problem, and it's not one of how AI can be implemented, as has been the major challenge in the past. AI has now sprouted into a plethora of forms, each rivaling the other in an attempt to showcase its superior capabilities.

evolutionary algorithm, machine learning, natural language, (17 more...)

#artificialintelligence

Country: North America > United States (0.15)

Industry:

Government (0.98)
Leisure & Entertainment > Games (0.97)

Technology:

Information Technology > Artificial Intelligence > Robots (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.45)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.45)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.35)

Add feedback

The Near Future: See How Healthcare Tech Will Transform Our Lives

#artificialintelligenceMar-14-2018, 18:12:30 GMT

CableLabs just released a cool short film called The Near Future: A Better Place that explores how emerging technologies in healthcare will transform our daily lives. A substantial percentage of the population worldwide is over the age of 60, and it will dramatically increase in the next two decades. This really underscores the importance of healthcare advancements, and connectivity is the underlying component that will power the emerging technologies that can transform our daily lives, such as IoT, telemedicine, intelligent agents and new sensors. For example, Cookie – the little robot AI Agent in the film is an in-home companion that provides social interaction, around the clock monitoring, as well as a direct interface with the complex system of care at the hospital. With this short film, CableLabs wants to inspire you and the entire tech and healthcare industry to help make this vision a reality in the near future.

artificial intelligence, daily lives, lives, (2 more...)

#artificialintelligence

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.87)

Add feedback

DEPSO Algorithm: Project Portal – Xiao-Feng Xie, Ph.D.

#artificialintelligenceMar-12-2018, 06:09:50 GMT

DEPSO [1], or called DEPS, is an algorithm for (constrained) numerical optimization problem (NOP). DEPSO combines the advantages of Particle Swarm Optimization (PSO) and Differential Evolution (DE). It is incorporated into cooperative group optimization (CGO) system [2]. The DEPSO paper has been cited over 400 times with various applications. DEPSO was also implemented (by Sun Microsystems Inc.) into NLPSolver (Solver for Nonlinear Programming), an extension of Calc in Apache OpenOffice.

evolutionary algorithm, information, machine learning, (10 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.65)

Add feedback

New Ideas for Brain Modelling 4

Greer, Kieran

arXiv.org Artificial IntelligenceMar-12-2018

This paper continues the research that considers a new cognitive model based strongly on the human brain. In particular, it considers the neural binding structure of an earlier paper. It also describes some new methods in the areas of image processing and behaviour simulation. The work is all based on earlier research by the author and the new additions are intended to fit in with the overall design. For image processing, a grid-like structure is used with 'full linking'. Each cell in the classifier grid stores a list of all other cells it gets associated with and this is used as the learned image that new input is compared to. For the behaviour metric, a new prediction equation is suggested, as part of a simulation, that uses feedback and history to dynamically determine its course of action. While the new methods are from widely different topics, both can be compared with the binary-analog type of interface that is the main focus of the paper. It is suggested that the simplest of linking between a tree and ensemble can explain neural binding and variable signal strengths.

artificial intelligence, machine learning, pattern recognition, (17 more...)

arXiv.org Artificial Intelligence

1708.04806

Country:

North America > United States (0.46)
Europe (0.46)

Genre: Research Report (0.41)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.94)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.93)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)

Add feedback

Video Friday: Human-Drone Interaction, Soft Robotics, and Basketball Robot

IEEE Spectrum RoboticsMar-10-2018, 00:05:04 GMT

Video Friday is your weekly selection of awesome robotics videos, collected by your Automaton bloggers. We'll also be posting a weekly calendar of upcoming robotics events for the next few months; here's what we have so far (send us your events!): Let us know if you have suggestions for next week, and enjoy today's videos. We were at the 2018 Human Robot Interaction conference all this week, and on Wednesday, there was a special video session. The audience, who was provided with popcorn, voted by applause, and here are the top three videos.

artificial intelligence, robot, video friday, (10 more...)

IEEE Spectrum Robotics

Country: North America > United States > California (0.15)

Industry:

Health & Medicine (0.31)
Leisure & Entertainment > Sports > Basketball (0.30)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.73)

Add feedback