Dataset-Agnostic Recommender Systems

Wijaya, Tri Kurniawan, D'Amico, Edoardo, Shao, Xinyang

arXiv.org Artificial Intelligence 

To this end, we introduce a novel paradigm: Dataset-Agnostic Recommender Systems (DAReS) that aims to enable a single codebase to autonomously adapt to various datasets without the need for fine-tuning, for a given recommender system task. Central to this approach is the Dataset Description Language (DsDL), a structured format that provides metadata about the dataset's features and labels, and allow the system to understand dataset's characteristics, allowing it to autonomously manage processes like feature selection, missing values imputation, noise removal, and hyperparameter optimization. By reducing the need for domainspecific expertise and manual adjustments, DAReS offers a more efficient and scalable solution for building recommender systems across diverse application domains. It addresses critical challenges in the field, such as reusability, reproducibility, and accessibility for non-expert users or entry-level researchers. With DAReS, we hope to spark community's attention in making recommender systems more adaptable, reproducible, and usable, with little to no configuration required from (possibly nonexpert or entry-level) users.