AITopics | Personal Assistant Systems

Collaborating Authors

Personal Assistant Systems

News Overviews Instructional Materials AI-Alerts Classics

Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering An Zhang Wenchang Ma Xiang Wang T at-Seng Chua Sea-NExT Joint Lab National University of Singapore

Neural Information Processing SystemsAug-14-2025, 04:58:11 GMT

Xiang Wang is the corresponding author.

bc loss, recommendation, representation, (16 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.40)
Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Regret minimization in Linear Bandits with offline data via extended D-optimal exploration

Vijayan, Sushant, Suggala, Arun, Shanmugam, Karthikeyan, Pal, Soumyabrata

arXiv.org Machine LearningAug-14-2025

We consider the problem of online regret minimization in linear bandits with access to prior observations (offline data) from the underlying bandit model. There are numerous applications where extensive offline data is often available, such as in recommendation systems, online advertising. Consequently, this problem has been studied intensively in recent literature. Our algorithm, Offline-Online Phased Elimination (OOPE), effectively incorporates the offline data to substantially reduce the online regret compared to prior work. To leverage offline information prudently, OOPE uses an extended D-optimal design within each exploration phase. OOPE achieves an online regret is $\tilde{O}(\sqrt{\deff T \log \left(|\mathcal{A}|T\right)}+d^2)$. $\deff \leq d)$ is the effective problem dimension which measures the number of poorly explored directions in offline data and depends on the eigen-spectrum $(λ_k)_{k \in [d]}$ of the Gram matrix of the offline data. The eigen-spectrum $(λ_k)_{k \in [d]}$ is a quantitative measure of the \emph{quality} of offline data. If the offline data is poorly explored ($\deff \approx d$), we recover the established regret bounds for purely online setting while, when offline data is abundant ($\Toff >> T$) and well-explored ($\deff = o(1) $), the online regret reduces substantially. Additionally, we provide the first known minimax regret lower bounds in this setting that depend explicitly on the quality of the offline data. These lower bounds establish the optimality of our algorithm in regimes where offline data is either well-explored or poorly explored. Finally, by using a Frank-Wolfe approximation to the extended optimal design we further improve the $O(d^{2})$ term to $O\left(\frac{d^{2}}{\deff} \min \{ \deff,1\} \right)$, which can be substantial in high dimensions with moderate quality of offline data $\deff = Ω(1)$.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2508.0842

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > India > Maharashtra > Mumbai (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.93)
Law > Statutes (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
(2 more...)

Add feedback

SPARC: Soft Probabilistic Adaptive multi-interest Retrieval Model via Codebooks for recommender system

Shi, Jialiang, Dou, Yaguang, Qi, Tian

arXiv.org Artificial IntelligenceAug-14-2025

Modeling multi-interests has arisen as a core problem in real-world RS. Current multi-interest retrieval methods pose three major challenges: 1) Interests, typically extracted from predefined external knowledge, are invariant. Failed to dynamically evolve with users' real-time consumption preferences. 2) Online inference typically employs an over-exploited strategy, mainly matching users' existing interests, lacking proactive exploration and discovery of novel and long-tail interests. To address these challenges, we propose a novel retrieval framework named SPARC(Soft Probabilistic Adaptive Retrieval Model via Codebooks). Our contribution is two folds. First, the framework utilizes Residual Quantized Variational Autoencoder (RQ-VAE) to construct a discretized interest space. It achieves joint training of the RQ-VAE with the industrial large scale recommendation model, mining behavior-aware interests that can perceive user feedback and evolve dynamically. Secondly, a probabilistic interest module that predicts the probability distribution over the entire dynamic and discrete interest space. This facilitates an efficient "soft-search" strategy during online inference, revolutionizing the retrieval paradigm from "passive matching" to "proactive exploration" and thereby effectively promoting interest discovery. Online A/B tests on an industrial platform with tens of millions daily active users, have achieved substantial gains in business metrics: +0.9% increase in user view duration, +0.4% increase in user page views (PV), and a +22.7% improvement in PV500(new content reaching 500 PVs in 24 hours). Offline evaluations are conducted on open-source Amazon Product datasets. Metrics, such as Recall@K and Normalized Discounted Cumulative Gain@K(NDCG@K), also showed consistent improvement. Both online and offline experiments validate the efficacy and practical value of the proposed method.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2508.0909

Country: North America > United States > Idaho (0.16)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

VoteGCL: Enhancing Graph-based Recommendations with Majority-Voting LLM-Rerank Augmentation

Nguyen, Minh-Anh, Nguyen, Bao, T., Ha Lan N., Hoang, Tuan Anh, Le, Duc-Trong, Le, Dung D.

arXiv.org Artificial IntelligenceAug-14-2025

Recommendation systems often suffer from data sparsity, caused by limited user-item interactions, which degrades their performance and amplifies popularity bias in real-world scenarios. This paper proposes a novel data augmentation framework that leverages Large Language Models (LLMs) and item textual descriptions to enrich interaction data. By few-shot prompting LLMs multiple times to rerank items and aggregating the results via majority voting, we generate high-confidence synthetic user-item interactions, supported by theoretical guarantees based on the concentration of measure. To effectively leverage the augmented data in the context of a graph recommendation system, we integrate it into a graph contrastive learning framework to mitigate distributional shift and alleviate popularity bias. Extensive experiments show that our method improves accuracy and reduces popularity bias, outperforming strong baselines.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2507.21563

Genre: Research Report > New Finding (0.68)

Industry:

Media > Film (0.69)
Leisure & Entertainment (0.47)
Information Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Deep Learning Model Acceleration and Optimization Strategies for Real-Time Recommendation Systems

Shao, Junli, Dong, Jing, Wang, Dingzhou, Shih, Kowei, Li, Dannier, Zhou, Chengrui

arXiv.org Artificial IntelligenceAug-14-2025

With the rapid growth of Internet services, recommendation systems play a central role in delivering personalized content. Faced with massive user requests and complex model architectures, the key challenge for real-time recommendation systems is how to reduce inference latency and increase system throughput without sacrificing recommendation quality. This paper addresses the high computational cost and resource bottlenecks of deep learning models in real-time settings by proposing a combined set of modeling- and system-level acceleration and optimization strategies. At the model level, we dramatically reduce parameter counts and compute requirements through lightweight network design, structured pruning, and weight quantization. At the system level, we integrate multiple heterogeneous compute platforms and high-performance inference libraries, and we design elastic inference scheduling and load-balancing mechanisms based on real-time load characteristics. Experiments show that, while maintaining the original recommendation accuracy, our methods cut latency to less than 30% of the baseline and more than double system throughput, offering a practical solution for deploying large-scale online recommendation services.

artificial intelligence, arxiv preprint arxiv, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2506.11421

Country: North America > United States > Nebraska (0.28)

Genre: Research Report (0.64)

Industry:

Health & Medicine (0.70)
Energy (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On Negative-aware Preference Optimization for Recommendation

Ding, Chenlu, Liu, Daoxuan, Wu, Jiancan, Hu, Xingyu, Wu, Junkang, Wang, Haitao, Wang, Yongkang, Wang, Xingxing, Wang, Xiang

arXiv.org Artificial IntelligenceAug-14-2025

Recommendation systems leverage user interaction data to suggest relevant items while filtering out irrelevant (negative) ones. The rise of large language models (LLMs) has garnered increasing attention for their potential in recommendation tasks. However, existing methods for optimizing LLM-based recommenders face challenges in effectively utilizing negative samples. Simply integrating large numbers of negative samples can improve ranking accuracy and mitigate popularity bias but often leads to increased computational overhead and memory costs. Additionally, current approaches fail to account for the varying informativeness of negative samples, leading to suboptimal optimization performance. To address these issues, we propose NAPO (\textbf{N}egative-\textbf{A}ware \textbf{P}reference \textbf{O}ptimization), an enhanced framework for preference optimization in LLM-based recommendation. NAPO introduces two key innovations: (1) in-batch negative sharing, which expands the pool of negative samples without additional memory overhead, and (2) dynamic reward margin adjustment, which adapts model updates based on the confidence of negative samples. Extensive experiments on three public datasets demonstrate that NAPO outperforms existing methods in both recommendation accuracy and popularity bias reduction.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2508.09653

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

I tested a new Home Assistant adapter with Z-Wave superpowers

PCWorldAug-13-2025, 20:00:00 GMT

I recently described how a recent flurry of smart home failures made me turn to Home Assistant, the increasingly polished DIY smart home platform that you can host yourself without relying on the cloud. Starting today, Home Assistant users have an awesome new toy to play with. The Home Assistant Connect ZQA-2 ( 69) is a new smart home adapter with a very tall antenna. And before you ask, it's not for Matter, the latest and hottest new thing in smart home. Instead, the Connect ZQA-2 is all about Z-Wave, an older but widely used smart home technology that's getting renewed attention thanks to its new "Long Range" capability, which allows for connectivity with Z-Wave LR (Long Range) client devices up to a mile--yes, a mile--away.

antenna, connect zqa-2, zqa-2, (10 more...)

PCWorld

Industry: Information Technology > Smart Houses & Appliances (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)

Add feedback

Fight AI-powered online scams with Avast AI Assistant

PCWorldAug-13-2025, 12:22:49 GMT

The threat from AI-powered online scams is on the rise. You can be fooled by realistic-looking fake emails and messages pretending to be from your bank or other companies, that convince you to unwittingly install malware on your device. How can everyday people have a chance against these hugely sophisticated schemes? Avast has the answer – fight fire with fire. It has launched a new AI assistant that can answer security-related questions about communications you may suspect are fraudulent, as well as check URLs to see if websites really are what they report to be.

ai-powered online scam, avast assistant, online scam, (4 more...)

PCWorld

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.62)

Add feedback

Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge

Fabbri, Francesco, Penha, Gustavo, D'Amico, Edoardo, Wang, Alice, De Nadai, Marco, Doremus, Jackie, Gigioli, Paul, Damianou, Andreas, Stal, Oskar, Lalmas, Mounia

arXiv.org Artificial IntelligenceAug-13-2025

Evaluating personalized recommendations remains a central challenge, especially in long-form audio domains like podcasts, where traditional offline metrics suffer from exposure bias and online methods such as A/B testing are costly and operationally constrained. In this paper, we propose a novel framework that leverages Large Language Models (LLMs) as offline judges to assess the quality of podcast recommendations in a scalable and interpretable manner. Our two-stage profile-aware approach first constructs natural-language user profiles distilled from 90 days of listening history. These profiles summarize both topical interests and behavioral patterns, serving as compact, interpretable representations of user preferences. Rather than prompting the LLM with raw data, we use these profiles to provide high-level, semantically rich context-enabling the LLM to reason more effectively about alignment between a user's interests and recommended episodes. This reduces input complexity and improves interpretability. The LLM is then prompted to deliver fine-grained pointwise and pairwise judgments based on the profile-episode match. In a controlled study with 47 participants, our profile-aware judge matched human judgments with high fidelity and outperformed or matched a variant using raw listening histories. The framework enables efficient, profile-aware evaluation for iterative testing and model selection in recommender systems.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3705328.3759305

2508.08777

Country:

Europe (1.00)
North America > United States (0.69)

Genre:

Research Report > Strength High (0.34)
Research Report > Experimental Study (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Using LLMs to Capture Users' Temporal Context for Recommendation

Sabouri, Milad, Mansoury, Masoud, Lin, Kun, Mobasher, Bamshad

arXiv.org Artificial IntelligenceAug-13-2025

Effective recommender systems demand dynamic user understanding, especially in complex, evolving environments. Traditional user profiling often fails to capture the nuanced, temporal contextual factors of user preferences, such as transient short-term interests and enduring long-term tastes. This paper presents an assessment of Large Language Models (LLMs) for generating semantically rich, time-aware user profiles. We do not propose a novel end-to-end recommendation architecture; instead, the core contribution is a systematic investigation into the degree of LLM effectiveness in capturing the dynamics of user context by disentangling short-term and long-term preferences. This approach, framing temporal preferences as dynamic user contexts for recommendations, adaptively fuses these distinct contextual components into comprehensive user embeddings. The evaluation across Movies&TV and Video Games domains suggests that while LLM-generated profiles offer semantic depth and temporal structure, their effectiveness for context-aware recommendations is notably contingent on the richness of user interaction histories. Significant gains are observed in dense domains (e.g., Movies&TV), whereas improvements are less pronounced in sparse environments (e.g., Video Games). This work highlights LLMs' nuanced potential in enhancing user profiling for adaptive, context-aware recommendations, emphasizing the critical role of dataset characteristics for practical applicability.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2508.08512

Country: North America > United States > Minnesota (0.28)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment (0.57)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback