Recall: Empowering Multimodal Embedding for Edge Devices
Cai, Dongqi, Wang, Shangguang, Peng, Chen, Zhang, Zeling, Xu, Mengwei
–arXiv.org Artificial Intelligence
Human memory is inherently prone to forgetting. To address this, multimodal embedding models have been introduced, which transform diverse real-world data into a unified embedding space. These embeddings can be retrieved efficiently, aiding mobile users in recalling past information. However, as model complexity grows, so do its resource demands, leading to reduced throughput and heavy computational requirements that limit mobile device implementation. In this paper, we introduce RECALL, a novel on-device multimodal embedding system optimized for resource-limited mobile environments. RECALL achieves high-throughput, accurate retrieval by generating coarse-grained embeddings and leveraging query-based filtering for refined retrieval. Experimental results demonstrate that RECALL delivers high-quality embeddings with superior throughput, all while operating unobtrusively with minimal memory and energy consumption.
arXiv.org Artificial Intelligence
Sep-9-2024
- Country:
- Europe > Switzerland
- North America > United States
- Alabama > Mobile County
- Mobile (0.04)
- District of Columbia > Washington (0.05)
- Alabama > Mobile County
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Information Technology
- Security & Privacy (0.67)
- Services (0.68)
- Information Technology
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (0.93)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Machine Learning > Neural Networks
- Cloud Computing (0.93)
- Communications > Mobile (1.00)
- Hardware (0.90)
- Artificial Intelligence
- Information Technology