AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsNov-21-2025, 15:13:43 GMT

A Meta-Learning Perspective on Cold-Start Recommendations for Items

artificial intelligence, machine learning, meta-learning perspective, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Manasi Vartak, Arvind Thiagarajan, Conrado Miranda, Jeshua Bratman, Hugo Larochelle

A Meta-Learning Perspective on Cold-Start Recommendations for Items

Neural Information Processing SystemsNov-21-2025, 08:56:57 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, recommendation, (18 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

arXiv.org Artificial IntelligenceMay-21-2025

LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis

Zhang, Weiming, Fu, Lingyue, Li, Qingyao, Du, Kounianhua, Lin, Jianghao, Yu, Jingwei, Xia, Wei, Zhang, Weinan, Tang, Ruiming, Yu, Yong

Cognitive diagnosis (CD) plays a crucial role in intelligent education, evaluating students' comprehension of knowledge concepts based on their test histories. However, current CD methods often model students, exercises, and knowledge concepts solely on their ID relationships, neglecting the abundant semantic relationships present within educational data space. Furthermore, contemporary intelligent tutoring systems (ITS) frequently involve the addition of new students and exercises, a situation that ID-based methods find challenging to manage effectively. The advent of large language models (LLMs) offers the potential for overcoming this challenge with open-world knowledge. In this paper, we propose LLM4CD, which Leverages Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis. Our method utilizes the open-world knowledge of LLMs to construct cognitively expressive textual representations, which are then encoded to introduce rich semantic information into the CD task. Additionally, we propose an innovative bi-level encoder framework that models students' test histories through two levels of encoders: a macro-level cognitive text encoder and a micro-level knowledge state encoder. This approach substitutes traditional ID embeddings with semantic representations, enabling the model to accommodate new students and exercises with open-world knowledge and address the cold-start problem. Extensive experimental results demonstrate that our proposed method consistently outperforms previous CD models on multiple real-world datasets, validating the effectiveness of leveraging LLMs to introduce rich semantic information into the CD task.

large language model, machine learning, natural language, (17 more...)

2505.13492

Country:

Asia > China (0.31)
North America > United States (0.30)

Genre: Research Report > New Finding (0.88)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

arXiv.org Artificial IntelligenceApr-10-2025

Addressing Cold-start Problem in Click-Through Rate Prediction via Supervised Diffusion Modeling

Zhu, Wenqiao, Wang, Lulu, Wu, Jun

Predicting Click-Through Rates is a crucial function within recommendation and advertising platforms, as the output of CTR prediction determines the order of items shown to users. The Embedding \& MLP paradigm has become a standard approach for industrial recommendation systems and has been widely deployed. However, this paradigm suffers from cold-start problems, where there is either no or only limited user action data available, leading to poorly learned ID embeddings. The cold-start problem hampers the performance of new items. To address this problem, we designed a novel diffusion model to generate a warmed-up embedding for new items. Specifically, we define a novel diffusion process between the ID embedding space and the side information space. In addition, we can derive a sub-sequence from the diffusion steps to expedite training, given that our diffusion model is non-Markovian. Our diffusion model is supervised by both the variational inference and binary cross-entropy objectives, enabling it to generate warmed-up embeddings for items in both the cold-start and warm-up phases. Additionally, we have conducted extensive experiments on three recommendation datasets. The results confirmed the effectiveness of our approach.

artificial intelligence, diffusion model, machine learning, (14 more...)

2504.0627

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceJan-6-2025

Personalized Fashion Recommendation with Image Attributes and Aesthetics Assessment

Chen, Chongxian, Mo, Fan, Fan, Xin, Yamana, Hayato

Personalized fashion recommendation is a difficult task because 1) the decisions are highly correlated with users' aesthetic appetite, which previous work frequently overlooks, and 2) many new items are constantly rolling out that cause strict cold-start problems in the popular identity (ID)-based recommendation methods. These new items are critical to recommend because of trend-driven consumerism. In this work, we aim to provide more accurate personalized fashion recommendations and solve the cold-start problem by converting available information, especially images, into two attribute graphs focusing on optimized image utilization and noise-reducing user modeling. Compared with previous methods that separate image and text as two components, the proposed method combines image and text information to create a richer attributes graph. Capitalizing on the advancement of large language and vision models, we experiment with extracting fine-grained attributes efficiently and as desired using two different prompts. Preliminary experiments on the IQON3000 dataset have shown that the proposed method achieves competitive accuracy compared with baselines.

artificial intelligence, machine learning, recommendation, (12 more...)

2501.03085

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.15)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.32)

Manasi Vartak, Arvind Thiagarajan, Conrado Miranda, Jeshua Bratman, Hugo Larochelle

A Meta-Learning Perspective on Cold-Start Recommendations for Items

Neural Information Processing SystemsOct-3-2024, 10:11:04 GMT

Neural Information Processing Systems http://nips.cc/

architecture, item history, recommendation, (15 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

arXiv.org Artificial IntelligenceApr-19-2024

Large Language Models for Next Point-of-Interest Recommendation

Li, Peibo, de Rijke, Maarten, Xue, Hao, Ao, Shuang, Song, Yang, Salim, Flora D.

The next Point of Interest (POI) recommendation task is to predict users' immediate next POI visit given their historical data. Location-Based Social Network (LBSN) data, which is often used for the next POI recommendation task, comes with challenges. One frequently disregarded challenge is how to effectively use the abundant contextual information present in LBSN data. Previous methods are limited by their numerical nature and fail to address this challenge. In this paper, we propose a framework that uses pretrained Large Language Models (LLMs) to tackle this challenge. Our framework allows us to preserve heterogeneous LBSN data in its original format, hence avoiding the loss of contextual information. Furthermore, our framework is capable of comprehending the inherent meaning of contextual information due to the inclusion of commonsense knowledge. In experiments, we test our framework on three real-world LBSN datasets. Our results show that the proposed framework outperforms the state-of-the-art models in all three datasets. Our analysis demonstrates the effectiveness of the proposed framework in using contextual information as well as alleviating the commonly encountered cold-start and short trajectory problems.

contextual information, information, trajectory, (14 more...)

doi: 10.1145/3626772.3657840

2404.17591

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > District of Columbia > Washington (0.05)
Oceania > Australia > New South Wales > Sydney (0.05)
(7 more...)

Genre: Research Report > New Finding (0.86)

Industry:

Retail (0.67)
Information Technology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Buffelli, Davide, Gupta, Ashish, Strzalka, Agnieszka, Plachouras, Vassilis

Is Meta-Learning the Right Approach for the Cold-Start Problem in Recommender Systems?

arXiv.org Artificial IntelligenceAug-16-2023

Recommender systems have become fundamental building blocks of modern online products and services, and have a substantial impact on user experience. In the past few years, deep learning methods have attracted a lot of research, and are now heavily used in modern real-world recommender systems. Nevertheless, dealing with recommendations in the cold-start setting, e.g., when a user has done limited interactions in the system, is a problem that remains far from solved. Meta-learning techniques, and in particular optimization-based meta-learning, have recently become the most popular approaches in the academic research literature for tackling the cold-start problem in deep learning models for recommender systems. However, current meta-learning approaches are not practical for real-world recommender systems, which have billions of users and items, and strict latency requirements. In this paper we show that it is possible to obtaining similar, or higher, performance on commonly used benchmarks for the cold-start problem without using meta-learning techniques. In more detail, we show that, when tuned correctly, standard and widely adopted deep learning models perform just as well as newer meta-learning models. We further show that an extremely simple modular approach using common representation learning techniques, can perform comparably to meta-learning techniques specifically designed for the cold-start setting while being much more easily deployable in real-world applications.

artificial intelligence, information, machine learning, (15 more...)

2308.08354

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
Europe > United Kingdom > England > Greater London > London (0.04)
(17 more...)

Genre:

Overview (0.93)
Research Report > New Finding (0.46)

Industry:

Media (0.68)
Information Technology > Services (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceAug-14-2023

AutoAssign+: Automatic Shared Embedding Assignment in Streaming Recommendation

Liu, Ziru, Chen, Kecheng, Song, Fengyi, Chen, Bo, Zhao, Xiangyu, Guo, Huifeng, Tang, Ruiming

With the rapid growth of personalized online applications, recommender systems have been widely implemented by various online businesses, including E-commerce websites, news platforms, online advertising, and so on [1, 2]. Among them, streaming recommendation [3, 4] is one of the common forms of recommender systems, where streaming data are constantly flowing into the recommendation models for training, thus better modeling the user's current preferences. In addition, streaming recommendations are particularly important for time-sensitive items, such as news, as they allow for rapid identification and distribution of relevant content to interested users, which is critical for commercial information retrieval systems. Due to the ability to effectively capture the highly nonlinear relationship between user and item end-to-end, neural network-based models are rapidly becoming the mainstream of recommender systems. As shown in Figure 1, existing deep recommendation models typically follow the "Embedding & Feature Interaction" paradigm [5]. The embedding layer serves as the encoder to represent sparse features in dense latent space, while the feature interaction layers serve to capture interactive signals among these features. In a streaming recommender system, new items and users are continually added to the data corpus, creating a highly dynamic streaming environment that presents several challenges, which can be summarized as: Cold-start: The streaming recommender system is confronted with a constant influx of new users, many of whom can be classified as visitor-type users and possess extremely limited behavior information. Furthermore, the system is constantly updated with new items, yet there has not been enough interaction with these items to generate an adequate level of training data. The consequence of employing insufficiently trained new user/item embeddings is a significant decline in the performance of the recommendation model.

artificial intelligence, machine learning, recommender system, (16 more...)

doi: 10.1007/s10115-023-01951-1

2308.06965

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > China > Hong Kong > Kowloon (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Services > e-Commerce Services (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)