Recall: Empowering Multimodal Embedding for Edge Devices
Cai, Dongqi, Wang, Shangguang, Peng, Chen, Zhang, Zeling, Xu, Mengwei
–arXiv.org Artificial Intelligence
Human memory is inherently prone to forgetting. To address this, multimodal embedding models have been introduced, which transform diverse real-world data into a unified embedding space. These embeddings can be retrieved efficiently, aiding mobile users in recalling past information. However, as model complexity grows, so do its resource demands, leading to reduced throughput and heavy computational requirements that limit mobile device implementation. In this paper, we introduce RECALL, a novel on-device multimodal embedding system optimized for resource-limited mobile environments. RECALL achieves high-throughput, accurate retrieval by generating coarse-grained embeddings and leveraging query-based filtering for refined retrieval. Experimental results demonstrate that RECALL delivers high-quality embeddings with superior throughput, all while operating unobtrusively with minimal memory and energy consumption.
arXiv.org Artificial Intelligence
Sep-9-2024
- Country:
- North America > United States
- District of Columbia > Washington (0.05)
- Alabama > Mobile County
- Mobile (0.04)
- Europe > Switzerland
- North America > United States
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Information Technology
- Services (0.68)
- Security & Privacy (0.67)
- Information Technology
- Technology:
- Information Technology
- Communications > Mobile (1.00)
- Cloud Computing (0.93)
- Hardware (0.90)
- Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Machine Learning > Neural Networks
- Deep Learning (0.93)
- Information Technology