Integrating Visual Foundation Models for Enhanced Robot Manipulation and Motion Planning: A Layered Approach

Sep-20-2023–arXiv.org Artificial Intelligence

This paper presents a novel layered framework that integrates visual foundation models to improve robot manipulation tasks and motion planning. The framework consists of five layers: Perception, Cognition, Planning, Execution, and Learning. Using visual foundation models, we enhance the robot's perception of its environment, enabling more efficient task understanding and accurate motion planning. This approach allows for real-time adjustments and continual learning, leading to significant improvements in task execution. Experimental results demonstrate the effectiveness of the proposed framework in various robot manipulation tasks and motion planning scenarios, highlighting its potential for practical deployment in dynamic environments.

enhanced robot manipulation, integrating visual foundation model, robot manipulation and motion planning, (1 more...)

arXiv.org Artificial Intelligence

Sep-20-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.69)

Technology:
- Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found