AITopics | Personal Assistant Systems

62302a24b04589f9f9cdd5b02c344b6c-Paper-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 06:48:34 GMT

cost network, dreamshard, placement, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Brazos County > College Station (0.04)
Europe > Portugal > Braga > Braga (0.04)

Genre:

Research Report (0.68)
Workflow (0.46)

Industry:

Information Technology > Services (1.00)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Confounding is a Pervasive Problem in Real World Recommender Systems

Merkov, Alexander, Rohde, David, Gilotte, Alexandre, Heymann, Benjamin

arXiv.org Machine LearningAug-15-2025

Unobserved confounding arises when an unmeasured feature influences both the treatment and the outcome, leading to biased causal effect estimates. This issue undermines observational studies in fields like economics, medicine, ecology or epidemiology. Recommender systems leveraging fully observed data seem not to be vulnerable to this problem. However many standard practices in recommender systems result in observed features being ignored, resulting in effectively the same problem. This paper will show that numerous common practices such as feature engineering, A/B testing and modularization can in fact introduce confounding into recommendation systems and hamper their performance. Several illustrations of the phenomena are provided, supported by simulation studies with practical suggestions about how practitioners may reduce or avoid the affects of confounding in real systems.

artificial intelligence, recommendation, recommender system, (15 more...)

arXiv.org Machine Learning

2508.10479

Country:

Europe > France (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)

Add feedback

Improved Personalized Headline Generation via Denoising Fake Interests from Implicit Feedback

Liu, Kejin, Lian, Junhong, Ao, Xiang, Wang, Ningtao, Fu, Xing, Cheng, Yu, Wang, Weiqiang, Liu, Xinyu

arXiv.org Artificial IntelligenceAug-15-2025

Accurate personalized headline generation hinges on precisely capturing user interests from historical behaviors. However, existing methods neglect personalized-irrelevant click noise in entire historical clickstreams, which may lead to hallucinated headlines that deviate from genuine user preferences. In this paper, we reveal the detrimental impact of click noise on personalized generation quality through rigorous analysis in both user and news dimensions. Based on these insights, we propose a novel Personalized Headline Generation framework via Denoising Fake Interests from Implicit Feedback (PHG-DIF). PHG-DIF first employs dual-stage filtering to effectively remove clickstream noise, identified by short dwell times and abnormal click bursts, and then leverages multi-level temporal fusion to dynamically model users' evolving and multi-faceted interests for precise profiling. Moreover, we release DT-PENS, a new benchmark dataset comprising the click behavior of 1,000 carefully curated users and nearly 10,000 annotated personalized headlines with historical dwell time annotations. Extensive experiments demonstrate that PHG-DIF substantially mitigates the adverse effects of click noise and significantly improves headline quality, achieving state-of-the-art (SOTA) results on DT-PENS. Our framework implementation and dataset are available at https://github.com/liukejin-up/PHG-DIF.

artificial intelligence, natural language, november10-14, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3746252.3761210

2508.07178

Country:

Asia > China (0.49)
North America > United States > Minnesota (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.48)

Add feedback

Federated Reconstruction: Partially Local Federated Learning

Neural Information Processing SystemsAug-14-2025, 17:48:00 GMT

To address this, we explore partially local federated learning.

arxiv preprint arxiv, federated learning, learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.95)

Add feedback

4ad4fc1528374422dd7a69dea9e72948-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsAug-14-2025, 16:14:02 GMT

dataset, please describe, please provide, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts (0.04)

Industry: Law (0.93)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)
(3 more...)

Add feedback

Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems

Neural Information Processing SystemsAug-14-2025, 16:13:58 GMT

Tenrec has the potential to become a useful benchmark dataset for a majority of popular recommendation tasks.

dataset, proceedings, recommendation, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
North America > United States > Massachusetts (0.04)

Genre: Research Report (0.46)

Industry: Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Learning from Distributed Users in Contextual Linear Bandits Without Sharing the Context

Neural Information Processing SystemsAug-14-2025, 14:50:01 GMT

In this paper, we develop algorithms that support the deployment of contextual linear bandits in distributed settings.

agent, algorithm, central learner, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry:

Education (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Communications > Networks (0.93)
Information Technology > Data Science > Data Mining > Big Data (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.46)

Add feedback

334da4cbb76302f37bd2e9d86f558869-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-14-2025, 04:58:15 GMT

bc loss, mean angle, representation, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.69)

Add feedback

Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering An Zhang Wenchang Ma Xiang Wang T at-Seng Chua Sea-NExT Joint Lab National University of Singapore

Neural Information Processing SystemsAug-14-2025, 04:58:11 GMT

Xiang Wang is the corresponding author.

bc loss, recommendation, representation, (16 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.40)
Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Regret minimization in Linear Bandits with offline data via extended D-optimal exploration

Vijayan, Sushant, Suggala, Arun, Shanmugam, Karthikeyan, Pal, Soumyabrata

arXiv.org Machine LearningAug-14-2025

We consider the problem of online regret minimization in linear bandits with access to prior observations (offline data) from the underlying bandit model. There are numerous applications where extensive offline data is often available, such as in recommendation systems, online advertising. Consequently, this problem has been studied intensively in recent literature. Our algorithm, Offline-Online Phased Elimination (OOPE), effectively incorporates the offline data to substantially reduce the online regret compared to prior work. To leverage offline information prudently, OOPE uses an extended D-optimal design within each exploration phase. OOPE achieves an online regret is $\tilde{O}(\sqrt{\deff T \log \left(|\mathcal{A}|T\right)}+d^2)$. $\deff \leq d)$ is the effective problem dimension which measures the number of poorly explored directions in offline data and depends on the eigen-spectrum $(λ_k)_{k \in [d]}$ of the Gram matrix of the offline data. The eigen-spectrum $(λ_k)_{k \in [d]}$ is a quantitative measure of the \emph{quality} of offline data. If the offline data is poorly explored ($\deff \approx d$), we recover the established regret bounds for purely online setting while, when offline data is abundant ($\Toff >> T$) and well-explored ($\deff = o(1) $), the online regret reduces substantially. Additionally, we provide the first known minimax regret lower bounds in this setting that depend explicitly on the quality of the offline data. These lower bounds establish the optimality of our algorithm in regimes where offline data is either well-explored or poorly explored. Finally, by using a Frank-Wolfe approximation to the extended optimal design we further improve the $O(d^{2})$ term to $O\left(\frac{d^{2}}{\deff} \min \{ \deff,1\} \right)$, which can be substantial in high dimensions with moderate quality of offline data $\deff = Ω(1)$.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2508.0842

Country: