AITopics | Singhal, Vikrant

Collaborating Authors

Singhal, Vikrant

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Not All Learnable Distribution Classes are Privately Learnable

Bun, Mark, Kamath, Gautam, Mouzakis, Argyris, Singhal, Vikrant

arXiv.org Machine LearningFeb-5-2024

This problem, known as distribution learning or density estimation, has enjoyed significant study by a number of communities, including Computer Science, Statistics, and Information Theory (see, e.g., [DL01, KMR

artificial intelligence, estimation, machine learning, (14 more...)

arXiv.org Machine Learning

2402.00267

Country: North America > United States (0.48)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Polynomial Time, Pure Differentially Private Estimator for Binary Product Distributions

Singhal, Vikrant

arXiv.org Machine LearningSep-23-2023

We present the first $\varepsilon$-differentially private, computationally efficient algorithm that estimates the means of product distributions over $\{0,1\}^d$ accurately in total-variation distance, whilst attaining the optimal sample complexity to within polylogarithmic factors. The prior work had either solved this problem efficiently and optimally under weaker notions of privacy, or had solved it optimally while having exponential running times.

artificial intelligence, machine learning, proceedings, (14 more...)

arXiv.org Machine Learning

2304.06787

Country: North America > United States (0.49)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.96)
Information Technology > Security & Privacy (0.93)

Add feedback

Private Distribution Learning with Public Data: The View from Sample Compression

Ben-David, Shai, Bie, Alex, Canonne, Clément L., Kamath, Gautam, Singhal, Vikrant

arXiv.org Artificial IntelligenceAug-14-2023

We study the problem of private distribution learning with access to public data. In this setup, which we refer to as public-private learning, the learner is given public and private samples drawn from an unknown distribution $p$ belonging to a class $\mathcal Q$, with the goal of outputting an estimate of $p$ while adhering to privacy constraints (here, pure differential privacy) only with respect to the private samples. We show that the public-private learnability of a class $\mathcal Q$ is connected to the existence of a sample compression scheme for $\mathcal Q$, as well as to an intermediate notion we refer to as list learning. Leveraging this connection: (1) approximately recovers previous results on Gaussians over $\mathbb R^d$; and (2) leads to new ones, including sample complexity upper bounds for arbitrary $k$-mixtures of Gaussians over $\mathbb R^d$, results for agnostic and distribution-shift resistant learners, as well as closure properties for public-private learnability under taking mixtures and products of distributions. Finally, via the connection to list learning, we show that for Gaussians in $\mathbb R^d$, at least $d$ public samples are necessary for private learnability, which is close to the known upper bound of $d+1$ public samples.

artificial intelligence, gaussian, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2308.06239

Country: North America > United States > New York (0.14)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Private Estimation with Public Data

Bie, Alex, Kamath, Gautam, Singhal, Vikrant

arXiv.org Artificial IntelligenceApr-5-2023

We initiate the study of differentially private (DP) estimation with access to a small amount of public data. For private estimation of d-dimensional Gaussians, we assume that the public data comes from a Gaussian that may have vanishing similarity in total variation distance with the underlying Gaussian of the private data. We show that under the constraints of pure or concentrated DP, d+1 public data samples are sufficient to remove any dependence on the range parameters of the private data distribution from the private sample complexity, which is known to be otherwise necessary without public data. For separated Gaussian mixtures, we assume that the underlying public and private distributions are the same, and we consider two settings: (1) when given a dimension-independent amount of public data, the private sample complexity can be improved polynomially in terms of the number of mixture components, and any dependence on the range parameters of the distribution can be removed in the approximate DP case; (2) when given an amount of public data linear in the dimension, the private sample complexity can be made independent of range parameters even under concentrated DP, and additional improvements can be made to the overall sample complexity.

artificial intelligence, gaussian, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2208.07984

Country: North America > United States (0.46)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

New Lower Bounds for Private Estimation and a Generalized Fingerprinting Lemma

Kamath, Gautam, Mouzakis, Argyris, Singhal, Vikrant

arXiv.org Machine LearningMar-28-2023

We prove new lower bounds for statistical estimation tasks under the constraint of $(\varepsilon, \delta)$-differential privacy. First, we provide tight lower bounds for private covariance estimation of Gaussian distributions. We show that estimating the covariance matrix in Frobenius norm requires $\Omega(d^2)$ samples, and in spectral norm requires $\Omega(d^{3/2})$ samples, both matching upper bounds up to logarithmic factors. The latter bound verifies the existence of a conjectured statistical gap between the private and the non-private sample complexities for spectral estimation of Gaussian covariances. We prove these bounds via our main technical contribution, a broad generalization of the fingerprinting method to exponential families. Additionally, using the private Assouad method of Acharya, Sun, and Zhang, we show a tight $\Omega(d/(\alpha^2 \varepsilon))$ lower bound for estimating the mean of a distribution with bounded covariance to $\alpha$-error in $\ell_2$-distance. Prior known lower bounds for all these problems were either polynomially weaker or held under the stricter condition of $(\varepsilon, 0)$-differential privacy.

artificial intelligence, estimation, machine learning, (17 more...)

arXiv.org Machine Learning

2205.08532

Country: North America > United States > New York (0.14)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.45)

Add feedback

A Bias-Variance-Privacy Trilemma for Statistical Estimation

Kamath, Gautam, Mouzakis, Argyris, Regehr, Matthew, Singhal, Vikrant, Steinke, Thomas, Ullman, Jonathan

arXiv.org Machine LearningFeb-28-2023

The canonical algorithm for differentially private mean estimation is to first clip the samples to a bounded range and then add noise to their empirical mean. Clipping controls the sensitivity and, hence, the variance of the noise that we add for privacy. But clipping also introduces statistical bias. We prove that this tradeoff is inherent: no algorithm can simultaneously have low bias, low variance, and low privacy loss for arbitrary distributions. On the positive side, we show that unbiased mean estimation is possible under approximate differential privacy if we assume that the distribution is symmetric. Furthermore, we show that, even if we assume that the data is sampled from a Gaussian, unbiased mean estimation is impossible under pure or concentrated differential privacy.

algorithm, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

2301.13334

Country: North America > United States > New York (0.14)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Security & Privacy (0.93)

Add feedback

A Private and Computationally-Efficient Estimator for Unbounded Gaussians

Kamath, Gautam, Mouzakis, Argyris, Singhal, Vikrant, Steinke, Thomas, Ullman, Jonathan

arXiv.org Machine LearningNov-8-2021

We give the first polynomial-time, polynomial-sample, differentially private estimator for the mean and covariance of an arbitrary Gaussian distribution $\mathcal{N}(\mu,\Sigma)$ in $\mathbb{R}^d$. All previous estimators are either nonconstructive, with unbounded running time, or require the user to specify a priori bounds on the parameters $\mu$ and $\Sigma$. The primary new technical tool in our algorithm is a new differentially private preconditioner that takes samples from an arbitrary Gaussian $\mathcal{N}(0,\Sigma)$ and returns a matrix $A$ such that $A \Sigma A^T$ has constant condition number.

algorithm, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

2111.04609

Country: North America > United States (0.94)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Differentially Private Algorithms for Learning Mixtures of Separated Gaussians

Kamath, Gautam, Sheffet, Or, Singhal, Vikrant, Ullman, Jonathan

arXiv.org Machine LearningSep-9-2019

Learning the parameters of a Gaussian mixtures models is a fundamental and widely studied problem with numerous applications. In this work, we give new algorithms for learning the parameters of a high-dimensional, well separated, Gaussian mixture model subject to the strong constraint of differential privacy. In particular, we give a differentially private analogue of the algorithm of Achlioptas and McSherry. Our algorithm has two key properties not achieved by prior work: (1) The algorithm's sample complexity matches that of the corresponding non-private algorithm up to lower order terms in a wide range of parameters. (2) The algorithm does not require strong a priori bounds on the parameters of the mixture components.

algorithm, artificial intelligence, survey article, (18 more...)

arXiv.org Machine Learning

1909.03951

Country:

North America > United States (0.68)
North America > Canada > Alberta (0.14)

Genre: Workflow (0.67)

Industry: Information Technology > Security & Privacy (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Security & Privacy (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.67)

Add feedback

Privately Learning High-Dimensional Distributions

Kamath, Gautam, Li, Jerry, Singhal, Vikrant, Ullman, Jonathan

arXiv.org Machine LearningMay-1-2018

We design nearly optimal differentially private algorithms for learning two fundamental families of high-dimensional distributions in total variation distance: multivariate Gaussians in $\mathbb{R}^{d}$ and product distributions on the hypercube. The sample complexity of both our algorithms approaches the sample complexity of non-private learners up to a small multiplicative factor and an additional additive term that is lower order for a wide range of parameters, showing that privacy comes essentially for free for these problems. Our algorithms use a novel technical approach to reducing the sensitivity of the estimation procedure that we call recursive private preconditioning and may find additional applications.

algorithm, artificial intelligence, health & medicine, (19 more...)

arXiv.org Machine Learning

1805.00216

Country: North America > United States (0.46)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning in High Dimensional Spaces (0.40)

Add feedback