vaidya
Characterizing the Accuracy-Communication-Privacy Trade-off in Distributed Stochastic Convex Optimization
Salgia, Sudeep, Pavlovic, Nikola, Chi, Yuejie, Zhao, Qing
Here x X denotes the decision variable where X is a convex, compact set and l(x;z) denotes the loss at point x using the datum z. We study this problem under the additional constraint of ensuring differential privacy [Dwork et al., 2006] of the local datasets at each client. This problem arises in numerous settings and represents a typical scenario for Federated Learning (FL) [McMahan et al., 2017], which has emerged as the de facto approach for collaboratively training machine learning models using a large number of devices coordinated through a central server [Kairouz et al., 2021, Wang et al., 2021]. Designing efficient algorithms for differentially private distributed stochastic convex optimization, also referred to as distributed DP-SCO, requires striking a careful balance between the primary objective of minimizing the optimization error and two competing desiderata -- communication cost and privacy.
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Asia > Middle East > Jordan (0.04)
- Asia > China (0.04)
PolytopeWalk: Sparse MCMC Sampling over Polytopes
High dimensional sampling is an important computational tool in statistics and other computational disciplines, with applications ranging from Bayesian statistical uncertainty quantification, metabolic modeling in systems biology to volume computation. We present $\textsf{PolytopeWalk}$, a new scalable Python library designed for uniform sampling over polytopes. The library provides an end-to-end solution, which includes preprocessing algorithms such as facial reduction and initialization methods. Six state-of-the-art MCMC algorithms on polytopes are implemented, including the Dikin, Vaidya, and John Walk. Additionally, we introduce novel sparse constrained formulations of these algorithms, enabling efficient sampling from sparse polytopes of the form $K_2 = \{x \in \mathbb{R}^d \ | \ Ax = b, x \succeq_k 0\}$. This implementation maintains sparsity in $A$, ensuring scalability to high dimensional settings $(d > 10^5)$. We demonstrate the improved sampling efficiency and per-iteration cost on both Netlib datasets and structured polytopes. $\textsf{PolytopeWalk}$ is available at github.com/ethz-randomwalk/polytopewalk with documentation at polytopewalk.readthedocs.io .
- Europe > Switzerland > Zürich > Zürich (0.04)
- North America > United States > North Carolina > Durham County > Durham (0.04)
- Europe > Spain > Aragón (0.04)
Digital Twins Meet the Koopman Operator: Data-Driven Learning for Robust Autonomy
Samak, Chinmay Vilas, Samak, Tanmay Vilas, Joglekar, Ajinkya, Vaidya, Umesh, Krovi, Venkat
Contrary to on-road autonomous navigation, off-road autonomy is complicated by various factors ranging from sensing challenges to terrain variability. In such a milieu, data-driven approaches have been commonly employed to capture intricate vehicle-environment interactions effectively. However, the success of data-driven methods depends crucially on the quality and quantity of data, which can be compromised by large variability in off-road environments. To address these concerns, we present a novel workflow to recreate the exact vehicle and its target operating conditions digitally for domain-specific data generation. This enables us to effectively model off-road vehicle dynamics from simulation data using the Koopman operator theory, and employ the obtained models for local motion planning and optimal vehicle control. The capabilities of the proposed methodology are demonstrated through an autonomous navigation problem of a 1:5 scale vehicle, where a terrain-informed planner is employed for global mission planning. Results indicate a substantial improvement in off-road navigation performance with the proposed algorithm (5.84x) and underscore the efficacy of digital twinning in terms of improving the sample efficiency (3.2x) and reducing the sim2real gap (5.2%).
- North America > United States > South Carolina > Greenville County > Greenville (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- Automobiles & Trucks (1.00)
- Transportation > Ground > Road (0.49)
- Government > Military (0.35)
Asynchronous Approximate Byzantine Consensus: A Multi-hop Relay Method and Tight Graph Conditions
We study a multi-agent resilient consensus problem, where some agents are of the Byzantine type and try to prevent the normal ones from reaching consensus. In our setting, normal agents communicate with each other asynchronously over multi-hop relay channels with delays. To solve this asynchronous Byzantine consensus problem, we develop the multi-hop weighted mean subsequence reduced (MW-MSR) algorithm. The main contribution is that we characterize a tight graph condition for our algorithm to achieve Byzantine consensus, which is expressed in the novel notion of strictly robust graphs. We show that the multi-hop communication is effective for enhancing the network's resilience against Byzantine agents. As a result, we also obtain novel conditions for resilient consensus under the malicious attack model, which are tighter than those known in the literature. Furthermore, the proposed algorithm can be viewed as a generalization of the conventional flooding-based algorithms, with less computational complexity. Lastly, we provide numerical examples to show the effectiveness of the proposed algorithm.
- Asia > China (0.04)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- (2 more...)
Memory-Constrained Algorithms for Convex Optimization via Recursive Cutting-Planes
Blanchard, Moïse, Zhang, Junhui, Jaillet, Patrick
We propose a family of recursive cutting-plane algorithms to solve feasibility problems with constrained memory, which can also be used for first-order convex optimization. Precisely, in order to find a point within a ball of radius $\epsilon$ with a separation oracle in dimension $d$ -- or to minimize $1$-Lipschitz convex functions to accuracy $\epsilon$ over the unit ball -- our algorithms use $\mathcal O(\frac{d^2}{p}\ln \frac{1}{\epsilon})$ bits of memory, and make $\mathcal O((C\frac{d}{p}\ln \frac{1}{\epsilon})^p)$ oracle calls, for some universal constant $C \geq 1$. The family is parametrized by $p\in[d]$ and provides an oracle-complexity/memory trade-off in the sub-polynomial regime $\ln\frac{1}{\epsilon}\gg\ln d$. While several works gave lower-bound trade-offs (impossibility results) -- we explicit here their dependence with $\ln\frac{1}{\epsilon}$, showing that these also hold in any sub-polynomial regime -- to the best of our knowledge this is the first class of algorithms that provides a positive trade-off between gradient descent and cutting-plane methods in any regime with $\epsilon\leq 1/\sqrt d$. The algorithms divide the $d$ variables into $p$ blocks and optimize over blocks sequentially, with approximate separation vectors constructed using a variant of Vaidya's method. In the regime $\epsilon \leq d^{-\Omega(d)}$, our algorithm with $p=d$ achieves the information-theoretic optimal memory usage and improves the oracle-complexity of gradient descent.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- North America > United States > Virginia (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- (2 more...)
Off-Road Navigation of Legged Robots Using Linear Transfer Operators
Moyalan, Joseph, Zheng, Andrew, Narayanan, Sriram S. K. S, Vaidya, Umesh
This paper presents the implementation of off-road navigation on legged robots using convex optimization through linear transfer operators. Given a traversability measure that captures the off-road environment, we lift the navigation problem into the density space using the Perron-Frobenius (P-F) operator. This allows the problem formulation to be represented as a convex optimization. Due to the operator acting on an infinite-dimensional density space, we use data collected from the terrain to get a finite-dimension approximation of the convex optimization. Results of the optimal trajectory for off-road navigation are compared with a standard iterative planner, where we show how our convex optimization generates a more traversable path for the legged robot compared to the suboptimal iterative planner.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
- North America > United States > Iowa (0.04)
Algorithm for Constrained Markov Decision Process with Linear Convergence
Gladin, Egor, Lavrik-Karmazin, Maksim, Zainullina, Karina, Rudenko, Varvara, Gasnikov, Alexander, Takáč, Martin
The problem of constrained Markov decision process is considered. An agent aims to maximize the expected accumulated discounted reward subject to multiple constraints on its costs (the number of constraints is relatively small). A new dual approach is proposed with the integration of two ingredients: entropy regularized policy optimizer and Vaidya's dual optimizer, both of which are critical to achieve faster convergence. The finite-time error bound of the proposed approach is provided. Despite the challenge of the nonconcave objective subject to nonconcave constraints, the proposed approach is shown to converge (with linear rate) to the global optimum. The complexity expressed in terms of the optimality gap and the constraint violation significantly improves upon the existing primal-dual approaches.
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > New York (0.04)
- (2 more...)
Unlocking the value of artificial intelligence and machine learning
In an era of accelerated digitalisation, artificial intelligence (AI) and machine learning (ML) have fast become part of the IT infrastructure of many businesses. Consequently, how these technologies are being used to derive meaningful insights from vast quantities of data is maturing rapidly. "Early on, when organisations didn't have access to the computing power and zettabytes of data that they have today, AI was only springing up in pockets," says Vaidya JR, SVP and global head of data and AI at IT transformation specialist Hexaware Technologies, "The approach then was to see what AI could do for a company, without truly identifying a well-defined problem. Data science solutions were just a shot in the dark. "Organisations were struggling to put their data to effective use, which led to limited value generated and ineffectual business results," he adds. "You can crunch any amount of data, and create numerous models; it only adds value if there is a significant impact on the business.
Pune Startup Mantra: This personal Gyde makes your company's software easy to use
A less-talked about change in work behaviour during the "work-from-home" phase of our existence, is that of, "call IT", while using or installing an application or software. You can still call IT but the process is now a business opportunity with AI (artificial intelligence) and ML (machine learning) now primed to solve your issues. Cure the "paper-clip feature in Office applications, which provides, based on most user feedback, the kind of help that is needed. Identifying this need for handholding end-users, Gyde – a Pune-based startup – has created a software assistance platform and is on a mission to democratise software guidance and reduce the go-to-market time for companies. Founded by Prasanna Vaidya and Shubham Deshmukh in January 2018, Gyde uses a set of AI-based tools for educating software application users to drive actions for better on-boarding, adoption, engagement and customer success. After completing his BE Mechanical from Mumbai University in 2006, Vaidya changed course and started working in software development. After a short stint in USA, he returned to India with good exposure to artificial Intelligence (AI) and natural language processing (NLP) and machine learning (ML). His co-founder Deshmukh, is a computer engineering graduate from Pune and worked on a team led by Vaidya in the USA. The duo shared ideas and brainstormed about creating a B2C application which would be used by millions. After a lot of pivots, pilots and failed attempts, since 2015, Vaidya and Deshmukh finally realised that they both were from an engineering background and did not have the marketing nous for a B2C product. Says Deshmukh, "We had created a platform for creating chatbots at a hackathon, which we eventually won.
- North America > United States (0.45)
- Asia > India > Maharashtra > Mumbai (0.25)
- North America > Canada (0.05)
- Health & Medicine > Therapeutic Area (0.36)
- Information Technology (0.30)
An Improved Cutting Plane Method for Convex Optimization, Convex-Concave Games and its Applications
Jiang, Haotian, Lee, Yin Tat, Song, Zhao, Wong, Sam Chiu-wai
Given a separation oracle for a convex set $K \subset \mathbb{R}^n$ that is contained in a box of radius $R$, the goal is to either compute a point in $K$ or prove that $K$ does not contain a ball of radius $\epsilon$. We propose a new cutting plane algorithm that uses an optimal $O(n \log (\kappa))$ evaluations of the oracle and an additional $O(n^2)$ time per evaluation, where $\kappa = nR/\epsilon$. $\bullet$ This improves upon Vaidya's $O( \text{SO} \cdot n \log (\kappa) + n^{\omega+1} \log (\kappa))$ time algorithm [Vaidya, FOCS 1989a] in terms of polynomial dependence on $n$, where $\omega < 2.373$ is the exponent of matrix multiplication and $\text{SO}$ is the time for oracle evaluation. $\bullet$ This improves upon Lee-Sidford-Wong's $O( \text{SO} \cdot n \log (\kappa) + n^3 \log^{O(1)} (\kappa))$ time algorithm [Lee, Sidford and Wong, FOCS 2015] in terms of dependence on $\kappa$. For many important applications in economics, $\kappa = \Omega(\exp(n))$ and this leads to a significant difference between $\log(\kappa)$ and $\mathrm{poly}(\log (\kappa))$. We also provide evidence that the $n^2$ time per evaluation cannot be improved and thus our running time is optimal. A bottleneck of previous cutting plane methods is to compute leverage scores, a measure of the relative importance of past constraints. Our result is achieved by a novel multi-layered data structure for leverage score maintenance, which is a sophisticated combination of diverse techniques such as random projection, batched low-rank update, inverse maintenance, polynomial interpolation, and fast rectangular matrix multiplication. Interestingly, our method requires a combination of different fast rectangular matrix multiplication algorithms.
- Asia > Russia (0.04)
- North America > United States > Massachusetts (0.04)
- North America > United States > Virginia (0.04)
- (4 more...)