AITopics

Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is beginning to show some successes in real-world scenarios. However, much of the research advances in RL are hard to leverage in real-world systems due to a series of assumptions that are rarely satisfied in practice. In this work, we identify and formalize a series of independent challenges that embody the difficulties that must be addressed for RL to be commonly deployed in real-world systems. For each challenge, we define it formally in the context of a Markov Decision Process, analyze the effects of the challenge on state-of-the-art learning algorithms, and present some existing attempts at tackling it. We believe that an approach that addresses our set of proposed challenges would be readily deployable in a large number of real world problems. Our proposed challenges are implemented in a suite of continuous control environments called realworldrl-suite which we propose an as an open-source benchmark.

agent, algorithm, constraint, (12 more...)

2003.11881

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.82)

Industry:

Transportation (0.67)
Leisure & Entertainment > Games (0.67)
Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

COEBA: A Coevolutionary Bat Algorithm for Discrete Evolutionary Multitasking

Osaba, Eneko, Del Ser, Javier, Yang, Xin-She, Iglesias, Andres, Galvez, Akemi

Multitasking optimization is an emerging research field which has attracted lot of attention in the scientific community. The main purpose of this paradigm is how to solve multiple optimization problems or tasks simultaneously by conducting a single search process. The main catalyst for reaching this objective is to exploit possible synergies and complementarities among the tasks to be optimized, helping each other by virtue of the transfer of knowledge among them (thereby being referred to as Transfer Optimization). In this context, Evolutionary Multitasking addresses Transfer Optimization problems by resorting to concepts from Evolutionary Computation for simultaneous solving the tasks at hand. This work contributes to this trend by proposing a novel algorithmic scheme for dealing with multitasking environments. The proposed approach, coined as Coevolutionary Bat Algorithm, finds its inspiration in concepts from both co-evolutionary strategies and the metaheuristic Bat Algorithm. We compare the performance of our proposed method with that of its Multifactorial Evolutionary Algorithm counterpart over 15 different multitasking setups, composed by eight reference instances of the discrete Traveling Salesman Problem. The experimentation and results stemming therefrom support the main hypothesis of this study: the proposed Coevolutionary Bat Algorithm is a promising meta-heuristic for solving Evolutionary Multitasking scenarios.

algorithm, coeba, mfea, (12 more...)

2003.11628

Country:

Europe > United Kingdom (0.14)
North America > United States > New York (0.04)
North America > United States > New Jersey (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning

Mousavi, Ali, Li, Lihong, Liu, Qiang, Zhou, Denny

Off-policy estimation for long-horizon problems is important in many real-life applications such as healthcare and robotics, where high-fidelity simulators may not be available and on-policy evaluation is expensive or impossible. Recently, \cite{liu18breaking} proposed an approach that avoids the \emph{curse of horizon} suffered by typical importance-sampling-based methods. While showing promising results, this approach is limited in practice as it requires data be drawn from the \emph{stationary distribution} of a \emph{known} behavior policy. In this work, we propose a novel approach that eliminates such limitations. In particular, we formulate the problem as solving for the fixed point of a certain operator. Using tools from Reproducing Kernel Hilbert Spaces (RKHSs), we develop a new estimator that computes importance ratios of stationary distributions, without knowledge of how the off-policy data are collected. We analyze its asymptotic consistency and finite-sample generalization. Experiments on benchmarks verify the effectiveness of our approach.

behavior policy, stationary distribution, trajectory, (14 more...)

2003.11126

Country:

North America > United States > New York (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry:

Health & Medicine (0.66)
Transportation > Air (0.42)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Bassani, Hansenclever F., Delgado, Renie A., Junior, Jose Nilton de O. Lima, Medeiros, Heitor R., Braga, Pedro H. M., Tapp, Alain

Learning to Play Soccer by Reinforcement and Applying Sim-to-Real to Compete in the Real World

This work presents an application of Reinforcement Learning (RL) for the complete control of real soccer robots of the IEEE Very Small Size Soccer (VSSS) [1], a traditional league in the Latin American Robotics Competition (LARC). In the VSSS league, two teams of three small robots play against each other. We propose a simulated environment in which continuous or discrete control policies can be trained, and a Sim-to-Real method to allow using the obtained policies to control a robot in the real world. The results show that the learned policies display a broad repertoire of behaviors which are difficult to specify by hand. This approach, called VSSS-RL, was able to beat the human-designed policy for the striker of the team ranked 3rd place in the 2018 LARC, in 1-vs-1 matches.

arxiv preprint arxiv, reinforcement learning, sim-to-real, (11 more...)

2003.11102

Country:

South America > Brazil > Pernambuco > Recife (0.07)
North America > Canada > Quebec > Montreal (0.05)
Europe > Portugal > Braga > Braga (0.05)

Genre: Research Report (0.70)

Industry: Leisure & Entertainment > Sports > Soccer (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.55)

Mulamba, Maxime, Mandi, Jayanta, Canoy, Rocsildes, Guns, Tias

Hybrid Classification and Reasoning for Image-based Constraint Solving

There is an increased interest in solving complex constrained problems where part of the input is not given as facts but received as raw sensor data such as images or speech. We will use "visual sudoku" as a prototype problem, where the given cell digits are handwritten and provided as an image thereof. In this case, one first has to train and use a classifier to label the images, so that the labels can be used for solving the problem. In this paper, we explore the hybridization of classifying the images with the reasoning of a constraint solver. We show that pure constraint reasoning on predictions does not give satisfactory results. Instead, we explore the possibilities of a tighter integration, by exposing the probabilistic estimates of the classifier to the constraint solver. This allows joint inference on these probabilistic estimates, where we use the solver to find the maximum likelihood solution. We explore the trade-off between the power of the classifier and the power of the constraint reasoning, as well as further integration through the additional use of structural knowledge. Furthermore, we investigate the effect of calibration of the probabilistic estimates on the reasoning. Our results show that such hybrid approaches vastly outperform a separate approach, which encourages a further integration of prediction (probabilities) and constraint solving.

constraint, prediction, reasoning, (17 more...)

2003.11001

Country: North America > United States (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Leisure & Entertainment > Games > Sudoku (0.31)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Riley, Connor, Van Hentenryck, Pascal, Yuan, Enpeng

Real-Time Dispatching of Large-Scale Ride-Sharing Systems: Integrating Optimization, Machine Learning, and Model Predictive Control

This paper considers the dispatching of large-scale real-time ride-sharing systems to address congestion issues faced by many cities. The goal is to serve all customers (service guarantees) with a small number of vehicles while minimizing waiting times under constraints on ride duration. This paper proposes an end-to-end approach that tightly integrates a state-of-the-art dispatching algorithm, a machine-learning model to predict zone-to-zone demand over time, and a model predictive control optimization to relocate idle vehicles. Experiments using historic taxi trips in New York City indicate that this integration decreases average waiting times by about 30% over all test cases and reaches close to 55% on the largest instances for high-demand zones.

optimization, relocation, vehicle, (14 more...)

2003.10942

Country:

North America > United States > New York (0.25)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.64)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)

Hirata, Daiki, Takahashi, Norikazu

Ensemble learning in CNN augmented with fully connected subnetworks

Convolutional Neural Networks (CNNs) [1] are attracting a great deal of attention because they show remarkable performance in general object recognition tasks. Various methods have been proposed so far for improving the performance of CNNs: pre-processing [2-4], dropout [5], batch normalization [6], ensemble learning [7, 8], and so on. In this paper, we propose a new model based on CNNs to further improve the performance in image recognitioin tasks. Our model consists of one base CNN and multiple Fully Connected SubNetworks (FCSNs). The base CNN generates a set of multi-channel feature-maps after each convolutional layer. The set of feature-maps generated by the last convolutional layer is divided along channels into disjoint subsets, and each subset is assigned to one of the FCSNs, which is trained independent of others so that it can predict the class label from the subset of the featuremaps assigned to it.

artificial intelligence, machine learning, subnetwork, (18 more...)

doi: 10.1587/transinf.2022EDL8098

2003.08562

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.05)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

#artificialintelligenceMar-23-2020, 23:57:36 GMT

Machine Learning Algorithms

Editor's Note: This presentation was given by Lauren Shin at GraphConnect New York City in September 2018. Presentation Summary Lauren Shin is a developer relations intern with Neo4j and a student at UC Berkeley. In her presentation, Shin briefly introduces the concept of machine learning. To those who may be wary of a robot takeover, machine learning is an application of statistics so that machines are able to learn with data. The majority of this is data cleansing and a lot of math.

graph, machine learning, procedure, (12 more...)

#artificialintelligence

Country: North America > United States > New York (0.24)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceMar-23-2020, 23:56:36 GMT

Tech-Enabled A2J: From Text to Machine Learning, How Legal Aid Is Leveraging Technology to Increase Access to Justice Legal Executive Institute

In an new column, "Tech-Enabled A2J", we will take a look at how legal start-ups and legal technology innovations are impacting the push toward better Access to Justice for more citizens. Whereas LSOs have found past success in reaching clients through basic tools like texting, they are now moving to more advanced platforms like document automation to better streamline internal processes. Some are even going one step further by embarking on artificial intelligence (AI) and machine learning (ML) projects to determine how they can help address the 86% of civil legal problems reported by low-income Americans that aren't fully resolved. Access to justice starts with literal access: figuring out how clients best receive, digest, and act on legal information. On the lower-tech end, text messaging has proven to be a successful tool for reaching those in need.

classifier, justice legal executive institute, leveraging technology, (11 more...)

#artificialintelligence

Country:

North America > United States > Wyoming (0.05)
North America > United States > Oregon > Multnomah County > Portland (0.05)
North America > United States > New York (0.05)
(5 more...)

Industry:

Law (1.00)
Education > Educational Setting (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceMar-23-2020, 23:50:19 GMT

Twitter round-up: Andrej Karpathy on the Tesla Autopilot top tweet in February 2020

Verdict lists ten of the most popular tweets on artificial intelligence in February 2020, based on data from GlobalData's Influencer Platform. The top tweets were chosen from influencers as tracked by GlobalData's Influencer Platform, which is based on a scientific process that works on pre-defined parameters. Influencers are selected after a deep analysis of the influencer's relevance, network strength, engagement, and leading discussions on new and emerging trends. He shared a video featuring the advanced driver-assistance system, claiming that in no field AI expertise could be making so much of a difference. Help revolutionize the world with full self-driving by joining us at Tesla Autopilot: https://t.co/ekekjKDOZF

influencer, twitter handle, username, (11 more...)

#artificialintelligence

Country:

North America > United States (0.16)
Asia > China (0.05)

Genre: Research Report (0.30)

Industry:

Automobiles & Trucks > Manufacturer (0.72)
Transportation > Ground > Road (0.31)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)