What Data is Really Necessary? A Feasibility Study of Inference Data Minimization for Recommender Systems
Leysen, Jens, Favier, Marco, Goethals, Bart
–arXiv.org Artificial Intelligence
Data minimization is a legal principle requiring personal data processing to be limited to what is necessary for a specified purpose. Operationalizing this principle for recommender systems, which rely on extensive personal data, remains a significant challenge. This paper conducts a feasibility study on minimizing implicit feedback inference data for such systems. We propose a novel problem formulation, analyze various minimization techniques, and investigate key factors influencing their effectiveness. We demonstrate that substantial inference data reduction is technically feasible without significant performance loss. However, its practicality is critically determined by two factors: the technical setting (e.g., performance targets, choice of model) and user characteristics (e.g., history size, preference complexity). Thus, while we establish its technical feasibility, we conclude that data minimization remains practically challenging and its dependence on the technical and user context makes a universal standard for data `necessity' difficult to implement.
arXiv.org Artificial Intelligence
Sep-1-2025
- Country:
- Africa
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- South Africa (0.04)
- Ethiopia > Addis Ababa
- Asia
- China > Heilongjiang Province
- Daqing (0.04)
- India > West Bengal
- Kharagpur (0.04)
- Singapore > Central Region
- Singapore (0.04)
- South Korea > Seoul
- Seoul (0.04)
- China > Heilongjiang Province
- Europe
- Belgium > Flanders
- Antwerp Province > Antwerp (0.05)
- Denmark > Capital Region
- Copenhagen (0.04)
- France > Auvergne-Rhône-Alpes
- Greece > Attica
- Athens (0.04)
- Italy > Tuscany
- Pisa Province > Pisa (0.04)
- United Kingdom (0.04)
- Belgium > Flanders
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > Palo Alto (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Idaho > Ada County
- Boise (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- New York > New York County
- New York City (0.05)
- Texas > Harris County
- Houston (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Washington > King County
- Seattle (0.04)
- California
- Canada > Quebec
- Oceania > Australia
- South America > Brazil (0.14)
- Africa
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Government (1.00)
- Information Technology > Security & Privacy (1.00)
- Law > Statutes (1.00)
- Technology: