Oblivious sketching for logistic regression
Munteanu, Alexander, Omlor, Simon, Woodruff, David
What guarantees are possible for solving logistic regression in one pass over a data stream? To answer this question, we present the first data oblivious sketch for logistic regression. Our sketch can be computed in input sparsity time over a turnstile data stream and reduces the size of a $d$-dimensional data set from $n$ to only $\operatorname{poly}(\mu d\log n)$ weighted points, where $\mu$ is a useful parameter which captures the complexity of compressing the data. Solving (weighted) logistic regression on the sketch gives an $O(\log n)$-approximation to the original problem on the full data set. We also show how to obtain an $O(1)$-approximation with slight modifications. Our sketches are fast, simple, easy to implement, and our experiments demonstrate their practicality.
Jul-14-2021
- Country:
- Asia
- Afghanistan > Parwan Province
- Charikar (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Afghanistan > Parwan Province
- Europe
- Germany > North Rhine-Westphalia
- Arnsberg Region > Dortmund (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany > North Rhine-Westphalia
- North America > United States
- Florida > Palm Beach County
- Boca Raton (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.14)
- Florida > Palm Beach County
- Asia
- Genre:
- Research Report > New Finding (0.89)
- Technology: