Differential Privacy in Machine Learning: From Symbolic AI to LLMs
Aguilera-Martínez, Francisco, Berzal, Fernando
–arXiv.org Artificial Intelligence
Machine learning models should not reveal particular information that is not otherwise accessible. Differential privacy provides a formal framework to mitigate privacy risks by ensuring that the inclusion or exclusion of any single data point does not significantly alter the output of an algorithm, thus limiting the exposure of private information. This survey paper explores the foundational definitions of differential privacy, reviews its original formulations and tracing its evolution through key research contributions. It then provides an in-depth examination of how DP has been integrated into machine learning models, analyzing existing proposals and methods to preserve privacy when training ML models. Finally, it describes how DP-based ML techniques can be evaluated in practice. %Finally, it discusses the broader implications of DP, highlighting its potential for public benefit, its real-world applications, and the challenges it faces, including vulnerabilities to adversarial attacks. By offering a comprehensive overview of differential privacy in machine learning, this work aims to contribute to the ongoing development of secure and responsible AI systems.
arXiv.org Artificial Intelligence
Jun-16-2025
- Country:
- North America
- United States
- District of Columbia > Washington (0.04)
- Florida > Orange County
- Orlando (0.04)
- North Carolina > Wake County
- Raleigh (0.04)
- Indiana > Marion County
- Indianapolis (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Colorado > Denver County
- Denver (0.04)
- Arizona > Maricopa County
- Scottsdale (0.04)
- Rhode Island > Providence County
- Providence (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland
- Montgomery County > Bethesda (0.04)
- Baltimore (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- California
- San Francisco County > San Francisco (0.14)
- San Diego County > San Diego (0.04)
- Santa Barbara County > Santa Barbara (0.04)
- Alameda County > Oakland (0.04)
- Santa Clara County
- Santa Clara (0.04)
- San Jose (0.04)
- Los Angeles County
- Los Angeles (0.14)
- Long Beach (0.04)
- New York > New York County
- New York City (0.15)
- Canada
- Ontario > Toronto (0.14)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Austria > Vienna (0.14)
- Switzerland (0.04)
- Italy (0.04)
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Greater London
- London (0.04)
- Scotland > City of Edinburgh
- Spain > Andalusia
- Granada Province > Granada (0.04)
- Middle East > Cyprus
- Greece > Attica
- Athens (0.04)
- France > Hauts-de-France
- Asia
- Middle East > Jordan (0.04)
- Singapore > Central Region
- Singapore (0.04)
- China
- Beijing > Beijing (0.04)
- Ningxia Hui Autonomous Region > Yinchuan (0.04)
- Africa > Rwanda
- North America
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (0.92)
- New Finding (0.67)
- Industry:
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Government (1.00)
- Education (0.92)
- Health & Medicine (0.92)
- Technology:
- Information Technology > Artificial Intelligence
- Representation & Reasoning
- Uncertainty > Bayesian Inference (1.00)
- Optimization (0.92)
- Machine Learning
- Performance Analysis > Accuracy (1.00)
- Neural Networks > Deep Learning (1.00)
- Decision Tree Learning (0.93)
- Ensemble Learning (0.67)
- Statistical Learning
- Regression (0.68)
- Clustering (0.67)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (1.00)
- Representation & Reasoning
- Information Technology > Artificial Intelligence