Constrained Robotic Navigation on Preferred Terrains Using LLMs and Speech Instruction: Exploiting the Power of Adverbs

Lotfi, Faraz, Faraji, Farnoosh, Kakodkar, Nikhil, Manderson, Travis, Meger, David, Dudek, Gregory

Apr-2-2024–arXiv.org Artificial Intelligence

This paper explores leveraging large language models for map-free off-road navigation using generative AI, reducing the need for traditional data collection and annotation. We propose a method where a robot receives verbal instructions, converted to text through Whisper, and a large language model (LLM) model extracts landmarks, preferred terrains, and crucial adverbs translated into speed settings for constrained navigation. A language-driven semantic segmentation model generates text-based masks for identifying landmarks and terrain types in images. By translating 2D image points to the vehicle's motion plane using camera parameters, an MPC controller can guides the vehicle towards the desired terrain. This approach enhances adaptation to diverse environments and facilitates the use of high-level instructions for navigating complex and challenging terrains. Keywords: Constrained map-free navigation, large language models, languagedriven semantic segmentation, preferred terrains, speech instruction, adverbs.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Apr-2-2024

arXiv.org PDF

Add feedback

Country:
- North America > Canada > Quebec > Montreal (0.14)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Energy > Oil & Gas (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.67)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found