Personal Assistant Systems
Inference Computation Scaling for Feature Augmentation in Recommendation Systems
Liu, Weihao, Du, Zhaocheng, Zhao, Haiyuan, Zhang, Wenbo, Zhao, Xiaoyan, Wang, Gang, Dong, Zhenhua, Xu, Jun
Large language models have become a powerful method for feature augmentation in recommendation systems. However, existing approaches relying on quick inference often suffer from incomplete feature coverage and insufficient specificity in feature descriptions, limiting their ability to capture fine-grained user preferences and undermining overall performance. Motivated by the recent success of inference scaling in math and coding tasks, we explore whether scaling inference can address these limitations and enhance feature quality. Our experiments show that scaling inference leads to significant improvements in recommendation performance, with a 12% increase in NDCG@10. The gains can be attributed to two key factors: feature quantity and specificity. In particular, models using extended Chain-of-Thought (CoT) reasoning generate a greater number of detailed and precise features, offering deeper insights into user preferences and overcoming the limitations of quick inference. We further investigate the factors influencing feature quantity, revealing that model choice and search strategy play critical roles in generating a richer and more diverse feature set. This is the first work to apply inference scaling to feature augmentation in recommendation systems, bridging advances in reasoning tasks to enhance personalized recommendation.
Dynamic Knowledge Selector and Evaluator for recommendation with Knowledge Graph
In recent years recommendation systems typically employ the edge information provided by knowledge graphs combined with the advantages of high-order connectivity of graph networks in the recommendation field. However, this method is limited by the sparsity of labels, cannot learn the graph structure well, and a large number of noisy entities in the knowledge graph will affect the accuracy of the recommendation results. In order to alleviate the above problems, we propose a dynamic knowledge-selecting and evaluating method guided by collaborative signals to distill information in the knowledge graph. Specifically, we use a Chain Route Evaluator to evaluate the contributions of different neighborhoods for the recommendation task and employ a Knowledge Selector strategy to filter the less informative knowledge before evaluating. We conduct baseline model comparison and experimental ablation evaluations on three public datasets. The experiments demonstrate that our proposed model outperforms current state-of-the-art baseline models, and each modules effectiveness in our model is demonstrated through ablation experiments.
Bridging Domain Gaps between Pretrained Multimodal Models and Recommendations
Zhang, Wenyu, Luo, Jie, Zhang, Xinming, Fang, Yuan
With the explosive growth of multimodal content online, pre-trained visual-language models have shown great potential for multimodal recommendation. However, while these models achieve decent performance when applied in a frozen manner, surprisingly, due to significant domain gaps (e.g., feature distribution discrepancy and task objective misalignment) between pre-training and personalized recommendation, adopting a joint training approach instead leads to performance worse than baseline. Existing approaches either rely on simple feature extraction or require computationally expensive full model fine-tuning, struggling to balance effectiveness and efficiency. To tackle these challenges, we propose \textbf{P}arameter-efficient \textbf{T}uning for \textbf{M}ultimodal \textbf{Rec}ommendation (\textbf{PTMRec}), a novel framework that bridges the domain gap between pre-trained models and recommendation systems through a knowledge-guided dual-stage parameter-efficient training strategy. This framework not only eliminates the need for costly additional pre-training but also flexibly accommodates various parameter-efficient tuning methods.
The Dream of a Dating App That Doesn't Want Your Money
Spending time on dating apps, I know from experience, can make you a little paranoid. When you swipe and swipe and nothing's working out, it could be that you've had bad luck. It could be that you're too picky. It could be--oh God--that you simply don't pull like you thought you did. But sometimes, whether out of self-protection or righteous skepticism of corporate motives, you might think: Maybe the nameless faces who created this product are conspiring against me to turn a profit--meddling in my dating life so that I'll spend the rest of my days alone, paying for any feature that gives me a shred of hope.
Asking a Google speaker to play Amazon Music tunes just got easier
It's long been possible to say "Hey Google" to your Google smart speaker to request a playlist from, say, YouTube Music, Spotify, Pandora, and even Apple Music. But can you spot the major music service that's missing? Until now, Amazon Music had been conspicuously absent from the list of music streamers that Google Assistant could easily control on your Google Nest smart speaker or display. Recently, though, Google has begun changing its tune in regard to Amazon Music support on its Nest devices. As spotted by 9to5Google, Amazon Music can finally be set as a default music service on your Google smart speakers.
InstructAgent: Building User Controllable Recommender via LLM Agent
Xu, Wujiang, Shi, Yunxiao, Liang, Zujie, Ning, Xuying, Mei, Kai, Wang, Kun, Zhu, Xi, Xu, Min, Zhang, Yongfeng
Traditional recommender systems usually take the user-platform paradigm, where users are directly exposed under the control of the platform's recommendation algorithms. However, the defect of recommendation algorithms may put users in very vulnerable positions under this paradigm. First, many sophisticated models are often designed with commercial objectives in mind, focusing on the platform's benefits, which may hinder their ability to protect and capture users' true interests. Second, these models are typically optimized using data from all users, which may overlook individual user's preferences. Due to these shortcomings, users may experience several disadvantages under the traditional user-platform direct exposure paradigm, such as lack of control over the recommender system, potential manipulation by the platform, echo chamber effects, or lack of personalization for less active users due to the dominance of active users during collaborative learning. Therefore, there is an urgent need to develop a new paradigm to protect user interests and alleviate these issues. Recently, some researchers have introduced LLM agents to simulate user behaviors, these approaches primarily aim to optimize platform-side performance, leaving core issues in recommender systems unresolved. To address these limitations, we propose a new user-agent-platform paradigm, where agent serves as the protective shield between user and recommender system that enables indirect exposure. To this end, we first construct four recommendation datasets, denoted as $\dataset$, along with user instructions for each record.
LLM-based User Profile Management for Recommender System
Bang, Seunghwan, Song, Hwanjun
The rapid advancement of Large Language Models (LLMs) has opened new opportunities in recommender systems by enabling zero-shot recommendation without conventional training. Despite their potential, most existing works rely solely on users' purchase histories, leaving significant room for improvement by incorporating user-generated textual data, such as reviews and product descriptions. Addressing this gap, we propose PURE, a novel LLM-based recommendation framework that builds and maintains evolving user profiles by systematically extracting and summarizing key information from user reviews. PURE consists of three core components: a Review Extractor for identifying user preferences and key product features, a Profile Updater for refining and updating user profiles, and a Recommender for generating personalized recommendations using the most current profile. To evaluate PURE, we introduce a continuous sequential recommendation task that reflects real-world scenarios by adding reviews over time and updating predictions incrementally. Our experimental results on Amazon datasets demonstrate that PURE outperforms existing LLM-based methods, effectively leveraging long-term user information while managing token limitations.
Apple unveils souped up version of its cheapest iPhone
Apple has released a sleeker and more expensive version of its lowest-priced iPhone in an attempt to widen the audience for a bundle of artificial intelligence technology that the company has been hoping will revive demand for its most profitable product lineup. The iPhone 16e unveiled Wednesday is the fourth generation of a model that's sold at a dramatically lower price than the iPhone's standard and premium models. The previous bargain-bin models were called the iPhone SE, with the last version coming out in 2022. Like the higher-priced iPhone 16 lineup unveiled last September, the iPhone 16e includes the souped-up computer chip needed to process an array of AI features that automatically summarise text and audio and create on-the-fly emojis while smartening up the device's virtual assistant, Siri. It will also have a more powerful battery and camera.
PSCon: Toward Conversational Product Search
Zou, Jie, Aliannejadi, Mohammad, Kanoulas, Evangelos, Han, Shuxi, Ma, Heli, Wang, Zheng, Yang, Yang, Shen, Heng Tao
Conversational Product Search (CPS) is confined to simulated conversations due to the lack of real-world CPS datasets that reflect human-like language. Additionally, current conversational datasets are limited to support cross-market and multi-lingual usage. In this paper, we introduce a new CPS data collection protocol and present PSCon, a novel CPS dataset designed to assist product search via human-like conversations. The dataset is constructed using a coached human-to-human data collection protocol and supports two languages and dual markets. Also, the dataset enables thorough exploration of six subtasks of CPS: user intent detection, keyword extraction, system action prediction, question selection, item ranking, and response generation. Furthermore, we also offer an analysis of the dataset and propose a benchmark model on the proposed CPS dataset.
Enhancing LLM-Based Recommendations Through Personalized Reasoning
Liu, Jiahao, Yan, Xueshuo, Li, Dongsheng, Zhang, Guangping, Gu, Hansu, Zhang, Peng, Lu, Tun, Shang, Li, Gu, Ning
Current recommendation systems powered by large language models (LLMs) often underutilize their reasoning capabilities due to a lack of explicit logical structuring. To address this limitation, we introduce CoT-Rec, a framework that integrates Chain-of-Thought (CoT) reasoning into LLM-driven recommendations by incorporating two crucial processes: user preference analysis and item perception evaluation. CoT-Rec operates in two key phases: (1) personalized data extraction, where user preferences and item perceptions are identified, and (2) personalized data application, where this information is leveraged to refine recommendations. Our experimental analysis demonstrates that CoT-Rec improves recommendation accuracy by making better use of LLMs' reasoning potential. The implementation is publicly available at https://anonymous.4open.science/r/CoT-Rec.