Personal Assistant Systems
Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants
Vekaria, Yash, Canino, Aurelio Loris, Levitsky, Jonathan, Ciechonski, Alex, Callejo, Patricia, Mandalari, Anna Maria, Shafiq, Zubair
Generative AI (GenAI) browser assistants integrate powerful capabilities of GenAI in web browsers to provide rich experiences such as question answering, content summarization, and agentic navigation. These assistants, available today as browser extensions, can not only track detailed browsing activity such as search and click data, but can also autonomously perform tasks such as filling forms, raising significant privacy concerns. It is crucial to understand the design and operation of GenAI browser extensions, including how they collect, store, process, and share user data. To this end, we study their ability to profile users and personalize their responses based on explicit or inferred demographic attributes and interests of users. We perform network traffic analysis and use a novel prompting framework to audit tracking, profiling, and personalization by the ten most popular GenAI browser assistant extensions. We find that instead of relying on local in-browser models, these assistants largely depend on server-side APIs, which can be auto-invoked without explicit user interaction. When invoked, they collect and share webpage content, often the full HTML DOM and sometimes even the user's form inputs, with their first-party servers. Some assistants also share identifiers and user prompts with third-party trackers such as Google Analytics. The collection and sharing continues even if a webpage contains sensitive information such as health or personal information such as name or SSN entered in a web form. We find that several GenAI browser assistants infer demographic attributes such as age, gender, income, and interests and use this profile--which carries across browsing contexts--to personalize responses. In summary, our work shows that GenAI browser assistants can and do collect personal and sensitive information for profiling and personalization with little to no safeguards.
Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models
Huang, Chengkai, Wu, Junda, Xia, Yu, Yu, Zixu, Wang, Ruhan, Yu, Tong, Zhang, Ruiyi, Rossi, Ryan A., Kveton, Branislav, Zhou, Dongruo, McAuley, Julian, Yao, Lina
Recent breakthroughs in Large Language Models (LLMs) have led to the emergence of agentic AI systems that extend beyond the capabilities of standalone models. By empowering LLMs to perceive external environments, integrate multimodal information, and interact with various tools, these agentic systems exhibit greater autonomy and adaptability across complex tasks. This evolution brings new opportunities to recommender systems (RS): LLM-based Agentic RS (LLM-ARS) can offer more interactive, context-aware, and proactive recommendations, potentially reshaping the user experience and broadening the application scope of RS. Despite promising early results, fundamental challenges remain, including how to effectively incorporate external knowledge, balance autonomy with controllability, and evaluate performance in dynamic, multimodal settings. In this perspective paper, we first present a systematic analysis of LLM-ARS: (1) clarifying core concepts and architectures; (2) highlighting how agentic capabilities -- such as planning, memory, and multimodal reasoning -- can enhance recommendation quality; and (3) outlining key research questions in areas such as safety, efficiency, and lifelong personalization. We also discuss open problems and future directions, arguing that LLM-ARS will drive the next wave of RS innovation. Ultimately, we foresee a paradigm shift toward intelligent, autonomous, and collaborative recommendation experiences that more closely align with users' evolving needs and complex decision-making processes.
ECLAIR: Enhanced Clarification for Interactive Responses
Murzaku, John, Liu, Zifan, Tanjim, Md Mehrab, Muppala, Vaishnavi, Chen, Xiang, Li, Yunyao
We present ECLAIR (Enhanced CLArification for Interactive Responses), a novel unified and end-to-end framework for interactive disambiguation in enterprise AI assistants. ECLAIR generates clarification questions for ambiguous user queries and resolves ambiguity based on the user's response.We introduce a generalized architecture capable of integrating ambiguity information from multiple downstream agents, enhancing context-awareness in resolving ambiguities and allowing enterprise specific definition of agents. We further define agents within our system that provide domain-specific grounding information. We conduct experiments comparing ECLAIR to few-shot prompting techniques and demonstrate ECLAIR's superior performance in clarification question generation and ambiguity resolution.
RAG-based User Profiling for Precision Planning in Mixed-precision Over-the-Air Federated Learning
Yuan, Jinsheng, Tang, Yun, Guo, Weisi
Mixed-precision computing, a widely applied technique in AI, offers a larger trade-off space between accuracy and efficiency. The recent purposed Mixed-Precision Over-the-Air Federated Learning (MP-OTA-FL) enables clients to operate at appropriate precision levels based on their heterogeneous hardware, taking advantages of the larger trade-off space while covering the quantization overheads in the mixed-precision modulation scheme for the OTA aggregation process. A key to further exploring the potential of the MP-OTA-FL framework is the optimization of client precision levels. The choice of precision level hinges on multifaceted factors including hardware capability, potential client contribution, and user satisfaction, among which factors can be difficult to define or quantify. In this paper, we propose a RAG-based User Profiling for precision planning framework that integrates retrieval-augmented LLMs and dynamic client profiling to optimize satisfaction and contributions. This includes a hybrid interface for gathering device/user insights and an RAG database storing historical quantization decisions with feedback. Experiments show that our method boosts satisfaction, energy savings, and global model accuracy in MP-OTA-FL systems.
Why is Elon Musk still CEO of Tesla?
In this week's edition: Elon Musk suffers the slings and arrows of outrageous fortune, Apple beats itself up over Siri, and Meta goes after one of its own over a tell-all book. The past 10 days have marked several of the most significant setbacks for Musk in months. Tesla, arguably his marquee company, continued to fall in value as investors worried about the threat of trade war and possible recession โ as well as declining profits. Escalating protests against the company over the billionaire's role in the government also grew in number and intensity across the US, coupled with rising cases of vandalism and social stigma against his cars. SpaceX has also struggled, with one of its rockets dramatically exploding in midflight last week and then an announcement that it was delaying a rescue mission to retrieve "stranded" astronauts. The company tried again two days later.
ShuffleGate: An Efficient and Self-Polarizing Feature Selection Method for Large-Scale Deep Models in Industry
Huang, Yihong, Chu, Chen, Zhang, Fan, Chen, Fei, Lin, Yu, Li, Ruiduan, Li, Zhihao
Deep models in industrial applications rely on thousands of features for accurate predictions, such as deep recommendation systems. While new features are introduced to capture evolving user behavior, outdated or redundant features often remain, significantly increasing storage and computational costs. To address this issue, feature selection methods are widely adopted to identify and remove less important features. However, existing approaches face two major challenges: (1) they often require complex hyperparameter (Hp) tuning, making them difficult to employ in practice, and (2) they fail to produce well-separated feature importance scores, which complicates straightforward feature removal. Moreover, the impact of removing unimportant features can only be evaluated through retraining the model, a time-consuming and resource-intensive process that severely hinders efficient feature selection. To solve these challenges, we propose a novel feature selection approach, ShuffleGate. In particular, it shuffles all feature values across instances simultaneously and uses a gating mechanism that allows the model to dynamically learn the weights for combining the original and shuffled inputs. Notably, it can generate well-separated feature importance scores and estimate the performance without retraining the model, while introducing only a single Hp. Experiments on four public datasets show that our approach outperforms state-of-the-art methods in feature selection for model retraining. Moreover, it has been successfully integrated into the daily iteration of Bilibili's search models across various scenarios, where it significantly reduces feature set size (up to 60%+) and computational resource usage (up to 20%+), while maintaining comparable performance.
Scaled Supervision is an Implicit Lipschitz Regularizer
Ouyang, Zhongyu, Zhang, Chunhui, Jia, Yaning, Vosoughi, Soroush
In modern social media, recommender systems (RecSys) rely on the click-through rate (CTR) as the standard metric to evaluate user engagement. CTR prediction is traditionally framed as a binary classification task to predict whether a user will interact with a given item. However, this approach overlooks the complexity of real-world social modeling, where the user, item, and their interactive features change dynamically in fast-paced online environments. This dynamic nature often leads to model instability, reflected in overfitting short-term fluctuations rather than higher-level interactive patterns. While overfitting calls for more scaled and refined supervisions, current solutions often rely on binary labels that overly simplify fine-grained user preferences through the thresholding process, which significantly reduces the richness of the supervision. Therefore, we aim to alleviate the overfitting problem by increasing the supervision bandwidth in CTR training. Specifically, (i) theoretically, we formulate the impact of fine-grained preferences on model stability as a Lipschitz constrain; (ii) empirically, we discover that scaling the supervision bandwidth can act as an implicit Lipschitz regularizer, stably optimizing existing CTR models to achieve better generalizability. Extensive experiments show that this scaled supervision significantly and consistently improves the optimization process and the performance of existing CTR models, even without the need for additional hyperparameter tuning.
Rolling Forward: Enhancing LightGCN with Causal Graph Convolution for Credit Bond Recommendation
Ghiye, Ashraf, Barreau, Baptiste, Carlier, Laurent, Vazirgiannis, Michalis
Graph Neural Networks have significantly advanced research in recommender systems over the past few years. These methods typically capture global interests using aggregated past interactions and rely on static embeddings of users and items over extended periods of time. While effective in some domains, these methods fall short in many real-world scenarios, especially in finance, where user interests and item popularity evolve rapidly over time. To address these challenges, we introduce a novel extension to Light Graph Convolutional Network (LightGCN) designed to learn temporal node embeddings that capture dynamic interests. Our approach employs causal convolution to maintain a forward-looking model architecture. By preserving the chronological order of user-item interactions and introducing a dynamic update mechanism for embeddings through a sliding window, the proposed model generates well-timed and contextually relevant recommendations. Extensive experiments on a real-world dataset from BNP Paribas demonstrate that our approach significantly enhances the performance of LightGCN while maintaining the simplicity and efficiency of its architecture. Our findings provide new insights into designing graph-based recommender systems in time-sensitive applications, particularly for financial product recommendations.
Alexa is about to send everything you tell it to Amazon
Amazon's Alexa service is rolling out on March 28, and with it supposedly comes a more personalized, intuitive, and powerful digital assistant thanks to its underlying generative AI technology. But for the new features to work, the company is asking a lot from its Echo and smart device users--whether or not they choose to use Alexa at all. Alexa is billed as a major upgrade that includes individual voice recognition through Alexa Voice ID, nuanced calendar scheduling, Ring home security system integrations, and product purchasing capabilities. It's Amazon's latest effort to generate a profit from Alexa, which lost 25 billion in revenue between 2007-2021 according to The Wall Street Journal last year. While Alexa will be added to all Prime subscriptions, users without Prime can enroll in the program for 19.99 per month.
Apple should focus on fixing Siri, not redesigning iOS again
Now that Apple's recent slew of hardware releases are behind us, we got some news on the software side last week. First, the company publicly announced that it was delaying the smarter, more personal version of Siri that'll be powered by Apple Intelligence. Then, rumors sprang up again that Apple was giving an extensive visual update to its software platforms, including iOS 19 and macOS 16 which are expected to be revealed at WWDC in June. The sources for this redesign rumor are solid. Jon Prosser dropped a video on his YouTube channel Front Page Tech back in January where he said that he had seen a redesigned Camera app for the next version of iOS that had a number of interface changes that made it feel more like a visionOS app. His thinking is that Apple wouldn't redesign a core app like Camera without bringing changes to some of the rest of the OS, as well.