HotelMatch-LLM: Joint Multi-Task Training of Small and Large Language Models for Efficient Multimodal Hotel Retrieval
Askari, Arian, Stergiadis, Emmanouil, Gusev, Ilya, Beladev, Moran
–arXiv.org Artificial Intelligence
We present HotelMatch-LLM, a multimodal dense retrieval model for the travel domain that enables natural language property search, addressing the limitations of traditional travel search engines which require users to start with a destination and editing search parameters. HotelMatch-LLM features three key innovations: (1) Domain-specific multi-task optimization with three novel retrieval, visual, and language modeling objectives; (2) Asymmetrical dense retrieval architecture combining a small language model (SLM) for efficient online query processing and a large language model (LLM) for embedding hotel data; and (3) Extensive image processing to handle all property image galleries. Experiments on four diverse test sets show HotelMatch-LLM significantly outperforms state-of-the-art models, including VISTA and MARVEL. Specifically, on the test set -- main query type -- we achieve 0.681 for HotelMatch-LLM compared to 0.603 for the most effective baseline, MARVEL. Our analysis highlights the impact of our multi-task optimization, the generalizability of HotelMatch-LLM across LLM architectures, and its scalability for processing large image galleries.
arXiv.org Artificial Intelligence
Jun-10-2025
- Country:
- Asia
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Japan > Honshū
- Europe
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Netherlands > South Holland
- Leiden (0.04)
- Serbia > Central Serbia
- Belgrade (0.04)
- Croatia > Dubrovnik-Neretva County
- North America
- Mexico > Mexico City
- Mexico City (0.04)
- United States (0.04)
- Mexico > Mexico City
- Asia
- Genre:
- Research Report
- Experimental Study (0.46)
- New Finding (0.46)
- Research Report
- Industry:
- Technology: