Recall: Empowering Multimodal Embedding for Edge Devices

Cai, Dongqi, Wang, Shangguang, Peng, Chen, Zhang, Zeling, Xu, Mengwei

Sep-9-2024–arXiv.org Artificial Intelligence

Human memory is inherently prone to forgetting. To address this, multimodal embedding models have been introduced, which transform diverse real-world data into a unified embedding space. These embeddings can be retrieved efficiently, aiding mobile users in recalling past information. However, as model complexity grows, so do its resource demands, leading to reduced throughput and heavy computational requirements that limit mobile device implementation. In this paper, we introduce RECALL, a novel on-device multimodal embedding system optimized for resource-limited mobile environments. RECALL achieves high-throughput, accurate retrieval by generating coarse-grained embeddings and leveraging query-based filtering for refined retrieval. Experimental results demonstrate that RECALL delivers high-quality embeddings with superior throughput, all while operating unobtrusively with minimal memory and energy consumption.

mem, mobile device, retrieval, (12 more...)

arXiv.org Artificial Intelligence

Sep-9-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - District of Columbia > Washington (0.05)
  - Alabama > Mobile County
    - Mobile (0.04)
- Europe > Switzerland
  - Zürich > Zürich (0.14)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Information Technology
  - Services (0.68)
  - Security & Privacy (0.67)

Technology:
- Information Technology
  - Communications > Mobile (1.00)
  - Cloud Computing (0.93)
  - Hardware (0.90)
  - Artificial Intelligence
    - Vision (1.00)
    - Representation & Reasoning (1.00)
    - Natural Language (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found