Accuracy Improvement in Differentially Private Logistic Regression: A Pre-training Approach

Hoseinpour, Mohammad, Hoseinpour, Milad, Aghagolzadeh, Ali

Dec-4-2023–arXiv.org Artificial Intelligence

Machine learning (ML) models can memorize training datasets. As a result, training ML models over private datasets can lead to the violation of individuals' privacy. Differential privacy (DP) is a rigorous privacy notion to preserve the privacy of underlying training datasets. Yet, training ML models in a DP framework usually degrades the accuracy of ML models. This paper aims to boost the accuracy of a DP logistic regression (LR) via a pre-training module. In more detail, we initially pre-train our LR model on a public training dataset that there is no privacy concern about it. Then, we fine-tune our DP-LR model with the private dataset. In the numerical results, we show that adding a pre-training module significantly improves the accuracy of the DP-LR model.

accuracy, dataset, dp-lr model, (11 more...)

arXiv.org Artificial Intelligence

Dec-4-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County > New York City (0.04)
- Asia > Middle East
  - Iran > Tehran Province > Tehran (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (0.76)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.91)