Personal Assistant Systems
Decision-Oriented Dialogue for Human-AI Collaboration
Lin, Jessy, Tomlin, Nicholas, Andreas, Jacob, Eisner, Jason
We describe a class of tasks called decision-oriented dialogues, in which AI assistants must collaborate with one or more humans via natural language to help them make complex decisions. We formalize three domains in which users face everyday decisions: (1) choosing an assignment of reviewers to conference papers, (2) planning a multi-step itinerary in a city, and (3) negotiating travel plans for a group of friends. In each of these settings, AI assistants and users have disparate abilities that they must combine to arrive at the best decision: assistants can access and process large amounts of information, while users have preferences and constraints external to the system. For each task, we build a dialogue environment where agents receive a reward based on the quality of the final decision they reach. Using these environments, we collect human-human dialogues with humans playing the role of assistant. To compare how current AI assistants communicate in these settings, we present baselines using large language models in self-play. Finally, we highlight a number of challenges models face in decision-oriented dialogues, ranging from efficient communication to reasoning and optimization, and release our environments as a testbed for future modeling work.
Explaining Recommendation System Using Counterfactual Textual Explanations
Ranjbar, Niloofar, Momtazi, Saeedeh, Homayounpour, MohammadMehdi
Currently, there is a significant amount of research being conducted in the field of artificial intelligence to improve the explainability and interpretability of deep learning models. It is found that if end-users understand the reason for the production of some output, it is easier to trust the system. Recommender systems are one example of systems that great efforts have been conducted to make their output more explainable. One method for producing a more explainable output is using counterfactual reasoning, which involves altering minimal features to generate a counterfactual item that results in changing the output of the system. This process allows the identification of input features that have a significant impact on the desired output, leading to effective explanations. In this paper, we present a method for generating counterfactual explanations for both tabular and textual features. We evaluated the performance of our proposed method on three real-world datasets and demonstrated a +5\% improvement on finding effective features (based on model-based measures) compared to the baseline method.
The Darwinian Argument for Worrying About AI
A broad coalition of AI experts recently released a brief public statement warning of "the risk of extinction from AI." There are many different ways in which AIs might become serious dangers to humanity, and the exact nature of the risks is still debated, but imagine a CEO who acquires an AI assistant. They begin by giving it simple, low-level assignments, like drafting emails and suggesting purchases. As the AI improves over time, it progressively becomes much better at these things than their employees. So the AI gets "promoted."
Federal regulators fine Amazon $25 million over child privacy issues
The U.S. government alleges that Amazon violated the Children's Online Privacy Protection Act, a 1998 law that has recently been enforced against other popular tech companies including Fortnite-maker Epic Games and YouTube. More than 800,000 children under the age of 13 have their own Alexa profiles, according to the lawsuit filed by the Department of Justice on behalf of the Federal Trade Commission. About five years ago, the company began offering a number of products specifically aimed at children, including the "Echo Dot Kids Edition" smart speaker and parental controls called "FreeTime on Alexa."
The new Amazon Echo Pop is stylish, affordable, and offers good sound for small spaces
The Echo Pop is like a sliced-in-half Echo Dot smart speaker. The setup for the Echo Pop is in the Alexa app, available for iOS and Android devices, and takes about five minutes to complete. The Echo Pop has a small light bar on the top of the speaker that illuminates when the speaker is activated (blue), muted (red), or there are notifications to review (yellow). The top of the speaker has press buttons to turn the volume up/down, and mute the mic. The Echo Pop is built with Amazon's AZ2 Neural Edge processor, allowing for more local processing of voice commands.
Amazon Echo Pop Review (2023): Fun To Look At
Its newest player has arrived: the Echo Pop, a $40 smart speaker with a half-circle body instead of the fully rounded forms of Amazon's other Echo speakers. I've been enjoying the Echo Pop as a small desk companion. Its sound is fine enough for its price--though other, similarly priced Amazon speakers will be a better music experience--but the biggest appeal of the Echo Pop is easily the fun colors and interesting form factor it has. When it's next to the Echo Dot (5th Gen), the Echo Pop looks to be the same size--that is, if you're looking head-on at the Pop's flat, circular face. But when you take a look at the Echo Pop from the side, you can see it's only about two-thirds as deep. Where you really can see the size difference is on the bottom.
TransAct: Transformer-based Realtime User Action Model for Recommendation at Pinterest
Xia, Xue, Eksombatchai, Pong, Pancha, Nikil, Badani, Dhruvil Deven, Wang, Po-Wei, Gu, Neng, Joshi, Saurabh Vishwas, Farahpour, Nazanin, Zhang, Zhiyuan, Zhai, Andrew
Sequential models that encode user activity for next action prediction have become a popular design choice for building web-scale personalized recommendation systems. Traditional methods of sequential recommendation either utilize end-to-end learning on realtime user actions, or learn user representations separately in an offline batch-generated manner. This paper (1) presents Pinterest's ranking architecture for Homefeed, our personalized recommendation product and the largest engagement surface; (2) proposes TransAct, a sequential model that extracts users' short-term preferences from their realtime activities; (3) describes our hybrid approach to ranking, which combines end-to-end sequential modeling via TransAct with batch-generated user embeddings. The hybrid approach allows us to combine the advantages of responsiveness from learning directly on realtime user activity with the cost-effectiveness of batch user representations learned over a longer time period. We describe the results of ablation studies, the challenges we faced during productionization, and the outcome of an online A/B experiment, which validates the effectiveness of our hybrid ranking model. We further demonstrate the effectiveness of TransAct on other surfaces such as contextual recommendations and search. Our model has been deployed to production in Homefeed, Related Pins, Notifications, and Search at Pinterest.
A Survey of Graph Prompting Methods: Techniques, Applications, and Challenges
Wu, Xuansheng, Zhou, Kaixiong, Sun, Mingchen, Wang, Xin, Liu, Ninghao
The recent "pre-train, prompt, predict training" paradigm has gained popularity as a way to learn generalizable models with limited labeled data. The approach involves using a pre-trained model and a prompting function that applies a template to input samples, adding indicative context and reformulating target tasks as the pre-training task. However, the design of prompts could be a challenging and time-consuming process in complex tasks. The limitation can be addressed by using graph data, as graphs serve as structured knowledge repositories by explicitly modeling the interaction between entities. In this survey, we review prompting methods from the graph perspective, where prompting functions are augmented with graph knowledge. In particular, we introduce the basic concepts of graph prompt learning, organize the existing work of designing graph prompting functions, and describe their applications and future challenges. This survey will bridge the gap between graphs and prompt design to facilitate future methodology development.
Robust Reinforcement Learning Objectives for Sequential Recommender Systems
Mozifian, Melissa, Sylvain, Tristan, Evans, Dave, Meng, Lili
Attention-based sequential recommendation methods have demonstrated promising results by accurately capturing users' dynamic interests from historical interactions. In addition to generating superior user representations, recent studies have begun integrating reinforcement learning (RL) into these models. Framing sequential recommendation as an RL problem with reward signals, unlocks developing recommender systems (RS) that consider a vital aspect-incorporating direct user feedback in the form of rewards to deliver a more personalized experience. Nonetheless, employing RL algorithms presents challenges, including off-policy training, expansive combinatorial action spaces, and the scarcity of datasets with sufficient reward signals. Contemporary approaches have attempted to combine RL and sequential modeling, incorporating contrastive-based objectives and negative sampling strategies for training the RL component. In this study, we further emphasize the efficacy of contrastive-based objectives paired with augmentation to address datasets with extended horizons. Additionally, we recognize the potential instability issues that may arise during the application of negative sampling. These challenges primarily stem from the data imbalance prevalent in real-world datasets, which is a common issue in offline RL contexts. While our established baselines attempt to mitigate this through various techniques, instability remains an issue. Therefore, we introduce an enhanced methodology aimed at providing a more effective solution to these challenges.
Graph Exploration Matters: Improving both individual-level and system-level diversity in WeChat Feed Recommender
Yang, Shuai, Zhang, Lixin, Xia, Feng, Lin, Leyu
There are roughly three stages in real industrial recommendation systems, candidates generation (retrieval), ranking and reranking. Individual-level diversity and system-level diversity are both important for industrial recommender systems. The former focus on each single user's experience, while the latter focus on the difference among users. Graph-based retrieval strategies are inevitably hijacked by heavy users and popular items, leading to the convergence of candidates for users and the lack of system-level diversity. Meanwhile, in the reranking phase, Determinantal Point Process (DPP) is deployed to increase individual-level diverisity. Heavily relying on the semantic information of items, DPP suffers from clickbait and inaccurate attributes. Besides, most studies only focus on one of the two levels of diversity, and ignore the mutual influence among different stages in real recommender systems. We argue that individual-level diversity and system-level diversity should be viewed as an integrated problem, and we provide an efficient and deployable solution for web-scale recommenders. Generally, we propose to employ the retrieval graph information in diversity-based reranking, by which to weaken the hidden similarity of items exposed to users, and consequently gain more graph explorations to improve the system-level diveristy. Besides, we argue that users' propensity for diversity changes over time in content feed recommendation. Therefore, with the explored graph, we also propose to capture the user's real-time personalized propensity to the diversity. We implement and deploy the combined system in WeChat App's Top Stories used by hundreds of millions of users. Offline simulations and online A/B tests show our solution can effectively improve both user engagement and system revenue.