Goto

Collaborating Authors

 cpa


Large Language Model-based Decision-making for COLREGs and the Control of Autonomous Surface Vehicles

Agyei, Klinsmann, Sarhadi, Pouria, Naeem, Wasif

arXiv.org Artificial Intelligence

In the field of autonomous surface vehicles (ASVs), devising decision-making and obstacle avoidance solutions that address maritime COLREGs (Collision Regulations), primarily defined for human operators, has long been a pressing challenge. Recent advancements in explainable Artificial Intelligence (AI) and machine learning have shown promise in enabling human-like decision-making. Notably, significant developments have occurred in the application of Large Language Models (LLMs) to the decision-making of complex systems, such as self-driving cars. The textual and somewhat ambiguous nature of COLREGs (from an algorithmic perspective), however, poses challenges that align well with the capabilities of LLMs, suggesting that LLMs may become increasingly suitable for this application soon. This paper presents and demonstrates the first application of LLM-based decision-making and control for ASVs. The proposed method establishes a high-level decision-maker that uses online collision risk indices and key measurements to make decisions for safe manoeuvres. A tailored design and runtime structure is developed to support training and real-time action generation on a realistic ASV model. Local planning and control algorithms are integrated to execute the commands for waypoint following and collision avoidance at a lower level. To the authors' knowledge, this study represents the first attempt to apply explainable AI to the dynamic control problem of maritime systems recognising the COLREGs rules, opening new avenues for research in this challenging area. Results obtained across multiple test scenarios demonstrate the system's ability to maintain online COLREGs compliance, accurate waypoint tracking, and feasible control, while providing human-interpretable reasoning for each decision.


PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis

Wu, Yan, Wershof, Esther, Schmon, Sebastian M, Nassar, Marcel, Osiński, Błażej, Eksi, Ridvan, Zhang, Kun, Graepel, Thore

arXiv.org Machine Learning

We present a comprehensive framework for predicting the effects of perturbations in single cells, designed to standardize benchmarking in this rapidly evolving field. Our framework, PerturBench, includes a user-friendly platform, diverse datasets, metrics for fair model comparison, and detailed performance analysis. Extensive evaluations of published and baseline models reveal limitations like mode or posterior collapse, and underscore the importance of rank metrics that assess the ordering of perturbations alongside traditional measures like RMSE. Our findings show that simple models can outperform more complex approaches. This benchmarking exercise sets new standards for model evaluation, supports robust model development, and advances the potential of these models to use high-throughput and high-content genetic and chemical screens for disease target discovery.


Limits of Theory of Mind Modelling in Dialogue-Based Collaborative Plan Acquisition

Bortoletto, Matteo, Ruhdorfer, Constantin, Abdessaied, Adnen, Shi, Lei, Bulling, Andreas

arXiv.org Artificial Intelligence

Recent work on dialogue-based collaborative plan acquisition (CPA) has suggested that Theory of Mind (ToM) modelling can improve missing knowledge prediction in settings with asymmetric skill-sets and knowledge. Although ToM was claimed to be important for effective collaboration, its real impact on this novel task remains under-explored. By representing plans as graphs and by exploiting task-specific constraints we show that, as performance on CPA nearly doubles when predicting one's own missing knowledge, the improvements due to ToM modelling diminish. This phenomenon persists even when evaluating existing baseline methods. To better understand the relevance of ToM for CPA, we report a principled performance comparison of models with and without ToM features. Results across different models and ablations consistently suggest that learned ToM features are indeed more likely to reflect latent patterns in the data with no perceivable link to ToM. This finding calls for a deeper understanding of the role of ToM in CPA and beyond, as well as new methods for modelling and evaluating mental states in computational collaborative agents.


SNOW-SCA: ML-assisted Side-Channel Attack on SNOW-V

Saurabh, Harshit, Golder, Anupam, Titti, Samarth Shivakumar, Kundu, Suparna, Li, Chaoyun, Karmakar, Angshuman, Das, Debayan

arXiv.org Artificial Intelligence

This paper presents SNOW-SCA, the first power side-channel analysis (SCA) attack of a 5G mobile communication security standard candidate, SNOW-V, running on a 32-bit ARM Cortex-M4 microcontroller. First, we perform a generic known-key correlation (KKC) analysis to identify the leakage points. Next, a correlation power analysis (CPA) attack is performed, which reduces the attack complexity to two key guesses for each key byte. The correct secret key is then uniquely identified utilizing linear discriminant analysis (LDA). The profiled SCA attack with LDA achieves 100% accuracy after training with $<200$ traces, which means the attack succeeds with just a single trace. Overall, using the \textit{combined CPA and LDA attack} model, the correct secret key byte is recovered with <50 traces collected using the ChipWhisperer platform. The entire 256-bit secret key of SNOW-V can be recovered incrementally using the proposed SCA attack. Finally, we suggest low-overhead countermeasures that can be used to prevent these SCA attacks.


Convergence of the Chambolle-Pock Algorithm in the Absence of Monotonicity

Evens, Brecht, Latafat, Puya, Patrinos, Panagiotis

arXiv.org Artificial Intelligence

The Chambolle-Pock algorithm (CPA), also known as the primal-dual hybrid gradient method (PDHG), has surged in popularity in the last decade due to its success in solving convex/monotone structured problems. This work provides convergence results for problems with varying degrees of (non)monotonicity, quantified through a so-called oblique weak Minty condition on the associated primal-dual operator. Our results reveal novel stepsize and relaxation parameter ranges which do not only depend on the norm of the linear mapping, but also on its other singular values. In particular, in nonmonotone settings, in addition to the classical stepsize conditions for CPA, extra bounds on the stepsizes and relaxation parameters are required. On the other hand, in the strongly monotone setting, the relaxation parameter is allowed to exceed the classical upper bound of two. Moreover, sufficient convergence conditions are obtained when the individual operators belong to the recently introduced class of semimonotone operators. Since this class of operators encompasses many traditional operator classes including (hypo)- and co(hypo)monotone operators, this analysis recovers and extends existing results for CPA. Several examples are provided for the aforementioned problem classes to demonstrate and establish tightness of the proposed stepsize ranges.


Two-step dynamic obstacle avoidance

Hart, Fabian, Waltz, Martin, Okhrin, Ostap

arXiv.org Artificial Intelligence

Dynamic obstacle avoidance (DOA) is a fundamental challenge for any autonomous vehicle, independent of whether it operates in sea, air, or land. This paper proposes a two-step architecture for handling DOA tasks by combining supervised and reinforcement learning (RL). In the first step, we introduce a data-driven approach to estimate the collision risk of an obstacle using a recurrent neural network, which is trained in a supervised fashion and offers robustness to non-linear obstacle movements. In the second step, we include these collision risk estimates into the observation space of an RL agent to increase its situational awareness.~We illustrate the power of our two-step approach by training different RL agents in a challenging environment that requires to navigate amid multiple obstacles. The non-linear movements of obstacles are exemplarily modeled based on stochastic processes and periodic patterns, although our architecture is suitable for any obstacle dynamics. The experiments reveal that integrating our collision risk metrics into the observation space doubles the performance in terms of reward, which is equivalent to halving the number of collisions in the considered environment. Furthermore, we show that the architecture's performance improvement is independent of the applied RL algorithm.


Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning using Independent Component Analysis

Kariyappa, Sanjay, Guo, Chuan, Maeng, Kiwan, Xiong, Wenjie, Suh, G. Edward, Qureshi, Moinuddin K, Lee, Hsien-Hsin S.

arXiv.org Artificial Intelligence

Federated learning (FL) aims to perform privacy-preserving machine learning on distributed data held by multiple data owners. To this end, FL requires the data owners to perform training locally and share the gradient updates (instead of the private inputs) with the central server, which are then securely aggregated over multiple data owners. Although aggregation by itself does not provably offer privacy protection, prior work showed that it may suffice if the batch size is sufficiently large. In this paper, we propose the Cocktail Party Attack (CPA) that, contrary to prior belief, is able to recover the private inputs from gradients aggregated over a very large batch size. CPA leverages the crucial insight that aggregate gradients from a fully connected layer is a linear combination of its inputs, which leads us to frame gradient inversion as a blind source separation (BSS) problem (informally called the cocktail party problem). We adapt independent component analysis (ICA)--a classic solution to the BSS problem--to recover private inputs for fully-connected and convolutional networks, and show that CPA significantly outperforms prior gradient inversion attacks, scales to ImageNet-sized inputs, and works on large batch sizes of up to 1024.


AI-Powered Mobile Tax App; Interview with Jaideep Singh, CEO and Co-Founder of FlyFin

#artificialintelligence

FlyFin is a new AI-powered mobile tax app for freelancers, creators, gig workers, and the self-employed. This fintech company plans to disrupt the market for individual tax preparation and filing, saving people thousands and eliminating nightmare scenarios on tax day. Jaideep Singh, CEO and Co-Founder of FlyFin will be sharing more information with us in this exclusive interview with TechBullion. Prior to FlyFin, I was an early adopter of AI, as I built Spock, the industry's first and largest people search engine, indexing over 1 billion people representing 1.5 trillion data records. As both a VC and entrepreneur, I've focused on finding disruptive industry startups to invest in, creating more than $3B in value for companies.


AI predicts effective drug combinations to fight complex diseases faster

#artificialintelligence

Finding new ways to repurpose or combine existing drugs has proved to be a powerful tool to treat complex diseases. Drugs used to treat one type of cancer, for instance, have effectively strengthened treatments for other cancer cells. Complex malignant tumors often require a combination of drugs, or "drug cocktails," to formulate a concerted attack on multiple cell types. Drug cocktails can not only help stave off drug resistance but also minimize harmful side effects. But finding an effective combination of existing drugs at the right dose is extremely challenging, partly because there are near-infinite possibilities.


Facebook unveils AI model to mix up cancer-curing cocktails with existing drugs

#artificialintelligence

No longer content to simply provide a platform for keeping up with long-lost cousins and spreading conspiracy theories, Facebook has branched out into developing artificial intelligence to help treat complex diseases. The social media giant's AI research department and the Helmholtz Zentrum München, a research center in Germany focused on environmental health, unveiled an open-source AI model designed to determine the viability of repurposing existing drugs into new pharmaceutical cocktails. Researchers and biologists now have free access to the Compositional Perturbation Autoencoder, or CPA, which evaluates the effects of drug combinations in varying dosages--a complicated task, as the number of possibilities can accelerate exponentially into the billions as more medicines are thrown into the mix. The model predicts not only how the drugs interact with one another, but also how they might work together to attack specific cell types and interrupt diseases. The researchers trained the machine learning model on single-cell RNA sequencing data, to help it gauge the effects of drug cocktails on individual cells without requiring drug- or cell-specific programming.