Not enough data to create a plot.
Try a different view from the menu above.
Automatic Data Augmentation for Generalization in Reinforcement Learning
Deep reinforcement learning (RL) agents often fail to generalize beyond their training environments. To alleviate this problem, recent work has proposed the use of data augmentation. However, different tasks tend to benefit from different types of augmentations and selecting the right one typically requires expert knowledge. In this paper, we introduce three approaches for automatically finding an effective augmentation for any RL task. These are combined with two novel regularization terms for the policy and value function, required to make the use of data augmentation theoretically sound for actor-critic algorithms.
Causal Shapley Values: Exploiting Causal Knowledge to Explain Individual Predictions of Complex Models
Shapley values underlie one of the most popular model-agnostic methods within explainable artificial intelligence. These values are designed to attribute the difference between a model's prediction and an average baseline to the different features used as input to the model. Being based on solid game-theoretic principles, Shapley values uniquely satisfy several desirable properties, which is why they are increasingly used to explain the predictions of possibly complex and highly non-linear machine learning models. Shapley values are well calibrated to a user's intuition when features are independent, but may lead to undesirable, counterintuitive explanations when the independence assumption is violated. In this paper, we propose a novel framework for computing Shapley values that generalizes recent work that aims to circumvent the independence assumption. By employing Pearl's do-calculus, we show how these'causal' Shapley values can be derived for general causal graphs without sacrificing any of their desirable properties. Moreover, causal Shapley values enable us to separate the contribution of direct and indirect effects. We provide a practical implementation for computing causal Shapley values based on causal chain graphs when only partial information is available and illustrate their utility on a real-world example.
College students demolish world record for fastest Rubik's cube robot
Breakthroughs, discoveries, and DIY tips sent every weekday. Mitsubishi's bragging rights for designing the world's fastest Rubik's cube-solving robot have officially been stolen by a team of undergrads in Indiana. Earlier this month, Purdue University announced four collaborators in its Elmore Family School of Electrical and Computer Engineering (ECE) successfully designed and built a bot that not only set the new Guinness World Record--it absolutely demolished the multinational company's previous time. Meet Purdubik's Cube: a machine capable of completing a randomly shuffled Rubik's cube in just 0.103 seconds. At 1-2 times faster than the blink of a human eye, the feat is difficult to see, much less comprehend.
Urgent warning to Americans over 'dangerous' technology quietly rolled out in 80 airports
Within seconds, you've been scanned, stored, and tracked--before even reaching airport security. Without ever handing over your ID, the Transportation Security Administration (TSA) already knows exactly who you are. This is happening at 84 airports across the US. And chances are, you didn't even notice. Marketed as a tool to enhance security, TSA's facial recognition system is drawing criticism for its potential to track Americans from the terminal entrance to their final destination.