Text2Traj2Text: Learning-by-Synthesis Framework for Contextual Captioning of Human Movement Trajectories

Asano, Hikaru, Yonetani, Ryo, Sekii, Taiki, Ouchi, Hiroki

Sep-19-2024–arXiv.org Artificial Intelligence

This paper presents Text2Traj2Text, a novel learning-by-synthesis framework for captioning possible contexts behind shopper's trajectory data in retail stores. Our work will impact various retail applications that need better customer understanding, such as targeted advertising and inventory management. The key idea is leveraging large language models to synthesize a diverse and realistic collection of contextual captions as well as the corresponding movement trajectories on a store map. Despite learned from fully synthesized data, the captioning model can generalize well to trajectories/captions created by real human subjects. Our systematic evaluation confirmed the effectiveness of the proposed framework over competitive approaches in terms of ROUGE and BERT Score metrics.

caption, customer, trajectory, (17 more...)

arXiv.org Artificial Intelligence

Sep-19-2024

arXiv.org PDF

Add feedback

Country:
- South America > Peru (0.04)
- Asia
  - China (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - Middle East
    - Republic of Türkiye (0.04)
    - Iran (0.04)
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre:
- Overview (0.68)
- Workflow (0.48)

Industry:
- Health & Medicine (0.68)
- Retail (0.67)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis
  - Beverages (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found