Goto

Collaborating Authors

 Personal Assistant Systems


kNN-Embed: Locally Smoothed Embedding Mixtures For Multi-interest Candidate Retrieval

arXiv.org Artificial Intelligence

Candidate retrieval is the first stage in recommendation systems, where a light-weight system is used to retrieve potentially relevant items for an input user. These candidate items are then ranked and pruned in later stages of recommender systems using a more complex ranking model. As the top of the recommendation funnel, it is important to retrieve a high-recall candidate set to feed into downstream ranking models. A common approach is to leverage approximate nearest neighbor (ANN) search from a single dense query embedding; however, this approach this can yield a low-diversity result set with many near duplicates. As users often have multiple interests, candidate retrieval should ideally return a diverse set of candidates reflective of the user's multiple interests. To this end, we introduce kNN-Embed, a general approach to improving diversity in dense ANN-based retrieval. kNN-Embed represents each user as a smoothed mixture over learned item clusters that represent distinct "interests" of the user. By querying each of a user's mixture component in proportion to their mixture weights, we retrieve a high-diversity set of candidates reflecting elements from each of a user's interests. We experimentally compare kNN-Embed to standard ANN candidate retrieval, and show significant improvements in overall recall and improved diversity across three datasets. Accompanying this work, we open source a large Twitter follow-graph dataset (https://huggingface.co/datasets/Twitter/TwitterFollowGraph), to spur further research in graph-mining and representation learning for recommender systems.


Having no luck on Tinder? Get a ROBOT to choose your photos: Dating app tests AI tool that selects users' best-looking photos for their profiles

Daily Mail - Science & tech

Struggle to pick the best picture for your dating profile? Maybe a robot can help. That's because Tinder has just started testing a new artificial intelligence (AI) tool that selects users' best-looking photos for their profiles. It studies a user's photo album and selects the five images that best represent them in the hope of enhancing the chances someone will swipe right. Bernard Kim, the chief executive of Tinder's owner, Match Group, said the aim of the feature was to remove the stress that comes with having to choose a profile picture.


AI influencer attracts men despite not being real; expert shares red flags on celebrity dating apps

FOX News

Celebrity matchmaker Alessandra Conti talks about how AI bots are getting onto celebrity dating application Raya. Virtual influencer Milla Sofia is garnering the attention of men on social media, posing in tiny bikinis, gorgeous gowns and even golf attire. There's only one catch: She's not real. The Finland-based influencer openly discloses on her platforms that she is an artificial intelligent bot, and on her website, Sofia is described as a "24 year old virtual influencer and fashion model." However, that has not curbed interest, with some social media users indicating they wish to meet her in-person.


Let's Give a Voice to Conversational Agents in Virtual Reality

arXiv.org Artificial Intelligence

The dialogue experience with conversational agents can be greatly enhanced with multimodal and immersive interactions in virtual reality. In this work, we present an open-source architecture with the goal of simplifying the development of conversational agents operating in virtual environments. The architecture offers the possibility of plugging in conversational agents of different domains and adding custom or cloud-based Speech-To-Text and Text-To-Speech models to make the interaction voice-based. Using this architecture, we present two conversational prototypes operating in the digital health domain developed in Unity for both non-immersive displays and VR headsets.


Microsoft begins pulling the plug on Cortana

PCWorld

Microsoft has begun following through on its promise to kill off Cortana, the AI assistant that debuted in Windows 10. Microsoft's recent Windows Insider build in the Dev channel turns off Cortana, which only appears as an app within the Microsoft Store. If you apply an available update to the Cortana app, that will essentially turn it off: You'll receive a message saying that Cortana has been deprecated -- programmer-speak for turning off a specific feature. Microsoft had made its intentions clear: In June, the company said that it would begin ending support for the Cortana app in August. That doesn't mean Cortana is entirely gone.


Here's a thought: Tinder tests AI tool to help users select best-looking photos

The Guardian

Beauty is now in the AI of the beholder. The dating app is testing an artificial intelligence tool that selects users' best-looking photos for their profiles, in the hope it will enhance the chances someone will swipe right. The tool will look at a user's photo album and select the five images that best represent them. Bernard Kim, the chief executive of Tinder's owner, Match Group, said AI could answer people's concerns about which picture best represents them and take the stress away from selection. "I really think AI can help our users build better profiles in a more efficient way that really do showcase their personalities," Kim said in a call with investors and analysts.


ADRNet: A Generalized Collaborative Filtering Framework Combining Clinical and Non-Clinical Data for Adverse Drug Reaction Prediction

arXiv.org Artificial Intelligence

Adverse drug reaction (ADR) prediction plays a crucial role in both health care and drug discovery for reducing patient mortality and enhancing drug safety. Recently, many studies have been devoted to effectively predict the drug-ADRs incidence rates. However, these methods either did not effectively utilize non-clinical data, i.e., physical, chemical, and biological information about the drug, or did little to establish a link between content-based and pure collaborative filtering during the training phase. In this paper, we first formulate the prediction of multi-label ADRs as a drug-ADR collaborative filtering problem, and to the best of our knowledge, this is the first work to provide extensive benchmark results of previous collaborative filtering methods on two large publicly available clinical datasets. Then, by exploiting the easy accessible drug characteristics from non-clinical data, we propose ADRNet, a generalized collaborative filtering framework combining clinical and non-clinical data for drug-ADR prediction. Specifically, ADRNet has a shallow collaborative filtering module and a deep drug representation module, which can exploit the high-dimensional drug descriptors to further guide the learning of low-dimensional ADR latent embeddings, which incorporates both the benefits of collaborative filtering and representation learning. Extensive experiments are conducted on two publicly available real-world drug-ADR clinical datasets and two non-clinical datasets to demonstrate the accuracy and efficiency of the proposed ADRNet. The code is available at https://github.com/haoxuanli-pku/ADRnet.


Weighted Multi-Level Feature Factorization for App ads CTR and installation prediction

arXiv.org Artificial Intelligence

This paper provides an overview of the approach we used as team ISISTANITOS for the ACM RecSys Challenge 2023. The competition was organized by ShareChat, and involved predicting the probability of a user clicking an app ad and/or installing an app, to improve deep funnel optimization and a special focus on user privacy. Our proposed method inferring the probabilities of clicking and installing as two different, but related tasks. Hence, the model engineers a specific set of features for each task and a set of shared features. Our model is called Weighted Multi-Level Feature Factorization because it considers the interaction of different order features, where the order is associated to the depth in a neural network. The prediction for a given task is generated by combining the task specific and shared features on the different levels. Our submission achieved the 11 rank and overall score of 55 in the competition academia-track final results. We release our source code at: https://github.com/knife982000/RecSys2023Challenge


Incorporating Recklessness to Collaborative Filtering based Recommender Systems

arXiv.org Artificial Intelligence

Recommender systems that include some reliability measure of their predictions tend to be more conservative in forecasting, due to their constraint to preserve reliability. This leads to a significant drop in the coverage and novelty that these systems can provide. In this paper, we propose the inclusion of a new term in the learning process of matrix factorization-based recommender systems, called recklessness, which enables the control of the risk level desired when making decisions about the reliability of a prediction. Experimental results demonstrate that recklessness not only allows for risk regulation but also improves the quantity and quality of predictions provided by the recommender system.


Athena 2.0: Discourse and User Modeling in Open Domain Dialogue

arXiv.org Artificial Intelligence

Conversational agents are consistently growing in popularity and many people interact with them every day. While many conversational agents act as personal assistants, they can have many different goals. Some are task-oriented, such as providing customer support for a bank or making a reservation. Others are designed to be empathetic and to form emotional connections with the user. The Alexa Prize Challenge aims to create a socialbot, which allows the user to engage in coherent conversations, on a range of popular topics that will interest the user. Here we describe Athena 2.0, UCSC's conversational agent for Amazon's Socialbot Grand Challenge 4. Athena 2.0 utilizes a novel knowledge-grounded discourse model that tracks the entity links that Athena introduces into the dialogue, and uses them to constrain named-entity recognition and linking, and coreference resolution. Athena 2.0 also relies on a user model to personalize topic selection and other aspects of the conversation to individual users.