Not enough data to create a plot.
Try a different view from the menu above.
Stentz, Anthony
Learning to Manipulate Unknown Objects in Clutter by Reinforcement
Boularias, Abdeslam (Carnegie Mellon University) | Bagnell, James Andrew (Carnegie Mellon University) | Stentz, Anthony (Carnegie Mellon University)
We present a fully autonomous robotic system for grasping objects in dense clutter. The objects are unknown and have arbitrary shapes. Therefore, we cannot rely on prior models. Instead, the robot learns online, from scratch, to manipulate the objects by trial and error. Grasping objects in clutter is significantly harder than grasping isolated objects, because the robot needs to push and move objects around in order to create sufficient space for the fingers. These pre-grasping actions do not have an immediate utility, and may result in unnecessary delays. The utility of a pre-grasping action can be measured only by looking at the complete chain of consecutive actions and effects. This is a sequential decision-making problem that can be cast in the reinforcement learning framework. We solve this problem by learning the stochastic transitions between the observed states, using nonparametric density estimation. The learned transition function is used only for re-calculating the values of the executed actions in the observed states, with different policies. Values of new state-actions are obtained by regressing the values of the executed actions. The state of the system at a given time is a depth (3D) image of the scene. We use spectral clustering for detecting the different objects in the image. The performance of our system is assessed on a robot with real-world objects.
Toward Mobile Robots Reasoning Like Humans
Oh, Jean H (Carnegie Mellon University) | Suppé, Arne (Carnegie Mellon University) | Duvallet, Felix (Carnegie Mellon University) | Boularias, Abdeslam (Carnegie Mellon University) | Navarro-Serment, Luis (Carnegie Mellon University) | Hebert, Martial (Carnegie Mellon University) | Stentz, Anthony (Carnegie Mellon University) | Vinokurov, Jerry (Carnegie Mellon University) | Romero, Oscar (Carnegie Mellon University) | Lebiere, Christian (Carnegie Mellon University) | Dean, Robert (General Dynamics Robotic Systems)
Robots are increasingly becoming key players in human-robot teams. To become effective teammates, robots must possess profound understanding of an environment, be able to reason about the desired commands and goals within a specific context, and be able to communicate with human teammates in a clear and natural way. To address these challenges, we have developed an intelligence architecture that combines cognitive components to carry out high-level cognitive tasks, semantic perception to label regions in the world, and a natural language component to reason about the command and its relationship to the objects in the world. This paper describes recent developments using this architecture on a fielded mobile robot platform operating in unknown urban environments. We report a summary of extensive outdoor experiments; the results suggest that a multidisciplinary approach to robotics has the potential to create competent human-robot teams.
Efficient Optimization for Autonomous Robotic Manipulation of Natural Objects
Boularias, Abdeslam (Carnegie Mellon University) | Bagnell, James Andrew (Carnegie Mellon University) | Stentz, Anthony (Carnegie Mellon University)
Manipulating natural objects of irregular shapes, such as rocks, is an essential capability of robots operating in outdoor environments. Physics-based simulators are commonly used to plan stable grasps for man-made objects. However, planning is an expensive process that is based on simulating hand and object trajectories in different configurations, and evaluating the outcome of each trajectory. This problem is particularly concerning when the objects are irregular or cluttered, because the space of feasible grasps is significantly smaller, and more configurations need to be evaluated before finding a good one. In this paper, we first present a learning technique for fast detection of an initial set of potentially stable grasps in a cluttered scene. The best detected grasps are further optimized by fine-tuning the configuration of the hand in simulation. To reduce the computational burden of this last operation, we model the outcomes of the grasps as a Gaussian Process, and use an entropy-search method in order to focus the optimization on regions where the best grasp is most likely to be. This approach is tested on the task of clearing piles of real, unknown, rock debris with an autonomous robot. Empirical results show a clear advantage of the proposed approach when the time window for decision is short.
Using Expectations to Drive Cognitive Behavior
Kurup, Unmesh (Carnegie Mellon University) | Lebiere, Christian (Carnegie Mellon University) | Stentz, Anthony (Carnegie Mellon University) | Hebert, Martial (Carnegie Mellon University)
Generating future states of the world is an essential component of high-level cognitive tasks such as planning. We explore the notion that such future-state generation is more widespread and forms an integral part of cognition. We call these generated states expectations, and propose that cognitive systems constantly generate expectations, match them to observed behavior and react when a difference exists between the two. We describe an ACT-R model that performs expectation-driven cognition on two tasks – pedestrian tracking and behavior classification. The model generates expectations of pedestrian movements to track them. The model also uses differences in expectations to identify distinctive features that differentiate these tracks. During learning, the model learns the association between these features and the various behaviors. During testing, it classifies pedestrian tracks by recalling the behavior associated with the features of each track. We tested the model on both single and multiple behavior datasets and compared the results against a k-NN classifier. The k-NN classifier outperformed the model in correct classifications, but the model had fewer incorrect classifications in the multiple behavior case, and both systems had about equal incorrect classifications in the single behavior case.