VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions

Feb-16-2026, 13:52:04 GMT–Neural Information Processing Systems

These skills are refined and updated through an iterative comparison strategy, enabling efficient adaptation to unseen environments.

arxiv preprint arxiv, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Feb-16-2026, 13:52:04 GMT

Conferences PDF

Country:
- North America > Montserrat (0.04)
- Asia
  - Japan > Shikoku
    - Kagawa Prefecture > Takamatsu (0.04)
  - China
    - Hong Kong (0.04)
    - Beijing > Beijing (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Education (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions

Similar Docs Excel Report more

Title	Similarity	Source
None found