VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions

Oct-10-2025, 09:19:55 GMT–Neural Information Processing Systems

These skills are refined and updated through an iterative comparison strategy, enabling efficient adaptation to unseen environments.

arxiv preprint arxiv, human video, video, (14 more...)

Neural Information Processing Systems

Oct-10-2025, 09:19:55 GMT

Conferences PDF

Country:
- North America > Montserrat (0.04)
- Asia
  - Japan > Shikoku
    - Kagawa Prefecture > Takamatsu (0.04)
  - China
    - Hong Kong (0.04)
    - Beijing > Beijing (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Education (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions

Similar Docs Excel Report more

Title	Similarity	Source
None found