Personal Assistant Systems
Air Gap: Protecting Privacy-Conscious Conversational Agents
Bagdasaryan, Eugene, Yi, Ren, Ghalebikesabi, Sahra, Kairouz, Peter, Gruteser, Marco, Oh, Sewoong, Balle, Borja, Ramage, Daniel
The growing use of large language model (LLM)-based conversational agents to manage sensitive user data raises significant privacy concerns. While these agents excel at understanding and acting on context, this capability can be exploited by malicious actors. We introduce a novel threat model where adversarial third-party apps manipulate the context of interaction to trick LLM-based agents into revealing private information not relevant to the task at hand. Grounded in the framework of contextual integrity, we introduce AirGapAgent, a privacy-conscious agent designed to prevent unintended data leakage by restricting the agent's access to only the data necessary for a specific task. Extensive experiments using Gemini, GPT, and Mistral models as agents validate our approach's effectiveness in mitigating this form of context hijacking while maintaining core agent functionality. For example, we show that a single-query context hijacking attack on a Gemini Ultra agent reduces its ability to protect user data from 94% to 45%, while an AirGapAgent achieves 97% protection, rendering the same attack ineffective.
Contrastive Learning Method for Sequential Recommendation based on Multi-Intention Disentanglement
Hu, Zeyu, Xiao, Yuzhi, Huang, Tao, Huo, Xuanrong
Sequential recommendation is one of the important branches of recommender system, aiming to achieve personalized recommended items for the future through the analysis and prediction of users' ordered historical interactive behaviors. However, along with the growth of the user volume and the increasingly rich behavioral information, how to understand and disentangle the user's interactive multi-intention effectively also poses challenges to behavior prediction and sequential recommendation. In light of these challenges, we propose a Contrastive Learning sequential recommendation method based on Multi-Intention Disentanglement (MIDCL). In our work, intentions are recognized as dynamic and diverse, and user behaviors are often driven by current multi-intentions, which means that the model needs to not only mine the most relevant implicit intention for each user, but also impair the influence from irrelevant intentions. Therefore, we choose Variational Auto-Encoder (VAE) to realize the disentanglement of users' multi-intentions. We propose two types of contrastive learning paradigms for finding the most relevant user's interactive intention, and maximizing the mutual information of positive sample pairs, respectively. Experimental results show that MIDCL not only has significant superiority over most existing baseline methods, but also brings a more interpretable case to the research about intention-based prediction and recommendation.
TikTok and ByteDance sue US to block law forcing sale of the app
TikTok and its parent company ByteDance have sued to block a law signed by Joe Biden just weeks ago that would force the sale of the short video app or ban it from the US. The companies filed a lawsuit on Tuesday against the US government in the court of appeals for the District of Columbia, arguing the law is unconstitutional and violates free speech protections. Signed by the president on 24 April as part of a broader foreign aid package, the law gives China's ByteDance until 19 January 2025 to sell TikTok to an approved buyer. If it does not, the US would prohibit app stores from offering TikTok and bar internet hosting services from supporting TikTok. The companies argue in the suit that the divestiture required by the bill "is simply not commercially, legally, or technically possible. "There is no question: the Act (law) will force a shutdown of TikTok by January 19, 2025, silencing the 170 million Americans who use the platform to communicate in ways that cannot be replicated elsewhere," the suit said. The suit confirmed previous reports that ByteDance would not sell TikTok without the powerful recommendation algorithm that has fueled the platform's success. The Chinese government "has made clear that it would not permit a divestment of the recommendation engine that is a key to the success of TikTok in the United States", the suit said. The potential for a ban of TikTok has been escalating since Donald Trump first unsuccessfully attempted to block it in 2020. Critics of TikTok have expressed worry that the platform's China-based parent company could collect sensitive user data and censor content that goes against the Chinese government โ claims TikTok denies. Our US morning briefing breaks down the key stories of the day, telling you what's happening and why it matters Amid the political fallout, TikTok spent more than 2bn to implement measures to protect the data of US users, according to the suit. The suit also highlighted additional commitments the company made in a 90-page draft National Security Agreement developed through negotiations with the Committee on Foreign Investment in the United States (CFIUS), an interagency committee, chaired by the US Treasury Department, that reviews foreign investments in American businesses that implicate national security concerns. CFIUS had been in talks with TikTok to find solutions, though the agreement included TikTok agreeing to a "shut-down option" that would give the US government the authority to suspend TikTok in the US if it violated some obligations", according to the suit.
SVD-AE: Simple Autoencoders for Collaborative Filtering
Hong, Seoyoung, Choi, Jeongwhan, Lee, Yeon-Chang, Kumar, Srijan, Park, Noseong
Collaborative filtering (CF) methods for recommendation systems have been extensively researched, ranging from matrix factorization and autoencoder-based to graph filtering-based methods. Recently, lightweight methods that require almost no training have been recently proposed to reduce overall computation. However, existing methods still have room to improve the trade-offs among accuracy, efficiency, and robustness. In particular, there are no well-designed closed-form studies for \emph{balanced} CF in terms of the aforementioned trade-offs. In this paper, we design SVD-AE, a simple yet effective singular vector decomposition (SVD)-based linear autoencoder, whose closed-form solution can be defined based on SVD for CF. SVD-AE does not require iterative training processes as its closed-form solution can be calculated at once. Furthermore, given the noisy nature of the rating matrix, we explore the robustness against such noisy interactions of existing CF methods and our SVD-AE. As a result, we demonstrate that our simple design choice based on truncated SVD can be used to strengthen the noise robustness of the recommendation while improving efficiency. Code is available at https://github.com/seoyoungh/svd-ae.
Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Jia, Jian, Wang, Yipei, Li, Yan, Chen, Honggang, Bai, Xuehan, Liu, Zhaocheng, Liang, Jian, Chen, Quan, Li, Han, Jiang, Peng, Gai, Kun
Contemporary recommender systems predominantly rely on collaborative filtering techniques, employing ID-embedding to capture latent associations among users and items. However, this approach overlooks the wealth of semantic information embedded within textual descriptions of items, leading to suboptimal performance in cold-start scenarios and long-tail user recommendations. Leveraging the capabilities of Large Language Models (LLMs) pretrained on massive text corpus presents a promising avenue for enhancing recommender systems by integrating open-world domain knowledge. In this paper, we propose an Llm-driven knowlEdge Adaptive RecommeNdation (LEARN) framework that synergizes open-world knowledge with collaborative knowledge. We address computational complexity concerns by utilizing pretrained LLMs as item encoders and freezing LLM parameters to avoid catastrophic forgetting and preserve open-world knowledge. To bridge the gap between the open-world and collaborative domains, we design a twin-tower structure supervised by the recommendation task and tailored for practical industrial application. Through offline experiments on the large-scale industrial dataset and online experiments on A/B tests, we demonstrate the efficacy of our approach.
Leveraging Intelligent Recommender system as a first step resilience measure -- A data-driven supply chain disruption response framework
ABSTRACT In light of the Industry 4.0 era, the global pandemic, and wars, interest in deploying digital technologies to increase supply chain resilience (SCRes) is rising. The utilization of recommender systems as a supply chain (SC) resilience measure is neglected, although these systems can enhance SC resilience. To address this problem, this research proposed a data-driven supply chain disruption response framework based on intelligent recommender system techniques. A prototype implementation was conducted to validate the developed framework through a practical use case. Results show that the proposed framework can be implemented as an effective SC disruption mitigation measure in the SCRes response phase and help SC participants better react after the SC disruption. Keywords: Supply chain resilience, Disruption risk, Recommender System, Supply chain risk management, Decision Support System 1 INTRODUCTION Supply chains (SC) are becoming more sophisticated and complex with globalization, as well as more risks and uncertainty (Manners-Bell 2017).
An Off-Policy Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems
Liu, Peng, Xu, Cong, Zhao, Ming, Zhu, Jiawei, Wang, Bin, Ren, Yi
As the last critical stage of RSs, Multi-Task Fusion (MTF) is responsible for combining multiple scores outputted by Multi-Task Learning (MTL) into a final score to maximize user satisfaction, which determines the ultimate recommendation results. Recently, to optimize long-term user satisfaction within a recommendation session, Reinforcement Learning (RL) is used for MTF in the industry. However, the off-policy RL algorithms used for MTF so far have the following severe problems: 1) to avoid out-of-distribution (OOD) problem, their constraints are overly strict, which seriously damage their performance; 2) they are unaware of the exploration policy used for producing training data and never interact with real environment, so only suboptimal policy can be learned; 3) the traditional exploration policies are inefficient and hurt user experience. To solve the above problems, we propose a novel method named IntegratedRL-MTF customized for MTF in large-scale RSs. IntegratedRL-MTF integrates off-policy RL model with our online exploration policy to relax overstrict and complicated constraints, which significantly improves its performance. We also design an extremely efficient exploration policy, which eliminates low-value exploration space and focuses on exploring potential high-value state-action pairs. Moreover, we adopt progressive training mode to further enhance our model's performance with the help of our exploration policy. We conduct extensive offline and online experiments in the short video channel of Tencent News. The results demonstrate that our model outperforms other models remarkably. IntegratedRL-MTF has been fully deployed in our RS and other large-scale RSs in Tencent, which have achieved significant improvements.
Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors
Huang, Chen, Qin, Peixin, Deng, Yang, Lei, Wenqiang, Lv, Jiancheng, Chua, Tat-Seng
The conversational recommendation system (CRS) has been criticized regarding its user experience in real-world scenarios, despite recent significant progress achieved in academia. Existing evaluation protocols for CRS may prioritize system-centric factors such as effectiveness and fluency in conversation while neglecting user-centric aspects. Thus, we propose a new and inclusive evaluation protocol, Concept, which integrates both system- and user-centric factors. We conceptualise three key characteristics in representing such factors and further divide them into six primary abilities. To implement Concept, we adopt a LLM-based user simulator and evaluator with scoring rubrics that are tailored for each primary ability. Our protocol, Concept, serves a dual purpose. First, it provides an overview of the pros and cons in current CRS models. Second, it pinpoints the problem of low usability in the "omnipotent" ChatGPT and offers a comprehensive reference guide for evaluating CRS, thereby setting the foundation for CRS improvement.
IA-GCN: Interactive Graph Convolutional Network for Recommendation
Zhang, Yinan, Wang, Pei, Liu, Congcong, Zhao, Xiwei, Qi, Hao, He, Jie, Jin, Junsheng, Peng, Changping, Lin, Zhangang, Shao, Jingping
Recently, Graph Convolutional Network (GCN) has become a novel state-of-art for Collaborative Filtering (CF) based Recommender Systems (RS). It is a common practice to learn informative user and item representations by performing embedding propagation on a user-item bipartite graph, and then provide the users with personalized item suggestions based on the representations. Despite effectiveness, existing algorithms neglect precious interactive features between user-item pairs in the embedding process. When predicting a user's preference for different items, they still aggregate the user tree in the same way, without emphasizing target-related information in the user neighborhood. Such a uniform aggregation scheme easily leads to suboptimal user and item representations, limiting the model expressiveness to some extent. In this work, we address this problem by building bilateral interactive guidance between each user-item pair and proposing a new model named IA-GCN (short for InterActive GCN). Specifically, when learning the user representation from its neighborhood, we assign higher attention weights to those neighbors similar to the target item. Correspondingly, when learning the item representation, we pay more attention to those neighbors resembling the target user. This leads to interactive and interpretable features, effectively distilling target-specific information through each graph convolutional operation. Our model is built on top of LightGCN, a state-of-the-art GCN model for CF, and can be combined with various GCN-based CF architectures in an end-to-end fashion. Extensive experiments on three benchmark datasets demonstrate the effectiveness and robustness of IA-GCN.
Thoughtful Things: Building Human-Centric Smart Devices with Small Language Models
King, Evan, Yu, Haoxiang, Vartak, Sahil, Jacob, Jenna, Lee, Sangsu, Julien, Christine
Everyday devices like light bulbs and kitchen appliances are now embedded with so many features and automated behaviors that they have become complicated to actually use. While such "smart" capabilities can better support users' goals, the task of learning the "ins and outs" of different devices is daunting. Voice assistants aim to solve this problem by providing a natural language interface to devices, yet such assistants cannot understand loosely-constrained commands, they lack the ability to reason about and explain devices' behaviors to users, and they rely on connectivity to intrusive cloud infrastructure. Toward addressing these issues, we propose thoughtful things: devices that leverage lightweight, on-device language models to take actions and explain their behaviors in response to unconstrained user commands. We propose an end-to-end framework that leverages formal modeling, automated training data synthesis, and generative language models to create devices that are both capable and thoughtful in the presence of unconstrained user goals and inquiries. Our framework requires no labeled data and can be deployed on-device, with no cloud dependency. We implement two thoughtful things (a lamp and a thermostat) and deploy them on real hardware, evaluating their practical performance.