ORBIT - Open Recommendation Benchmark for Reproducible Research with Hidden Tests

Jun-22-2026, 23:47:24 GMT–Neural Information Processing Systems

Recommender systems are among the most impactful AI applications, interacting with billions of users every day, guiding them to relevant products, services, or information tailored to their preferences. However, the research and development of recommender systems are hindered by existing datasets that fail to capture realistic user behaviors and inconsistent evaluation settings that lead to ambiguous conclusions. This paper introduces the Open Recommendation Benchmark for Reproducible Research with HIdden Tests (ORBIT), a unified benchmark for consistent and realistic evaluation of recommendation models. ORBIT offers a standardized evaluation framework of public datasets with reproducible splits and transparent settings for its public leaderboard. Additionally, ORBIT introduces a new webpage recommendation task, ClueWeb-Reco, featuring web browsing sequences from 87 million public, high-quality webpages. ClueWeb-Reco is a synthetic dataset derived from real, user-consented, and privacy-guaranteed browsing data.

large language model, machine learning, natural language, (24 more...)

Neural Information Processing Systems

Jun-22-2026, 23:47:24 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.68)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Media (1.00)
- Leisure & Entertainment (1.00)
- Education (0.92)
- Information Technology
  - Security & Privacy (0.93)
  - Services (0.93)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Personal Assistant Systems (1.00)
    - Natural Language
      - Large Language Model (1.00)
      - Chatbot (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found