Agents
Novel Pigeon-inspired 3D Obstacle Detection and Avoidance Maneuver for Multi-UAV Systems
Ahmadvand, Reza, Sharif, Sarah Safura, Banad, Yaser Mike
-- Recent advances in multi - agent systems manipulation have demonstrated a rising demand for the implementation of multi - UAV systems in urban areas, which are always subjected to the presence of static and dynamic obstacles. Inspired by the collective behavior of tilapia fish and pigeons, the focus of the presented research is on the introduction of a nature - inspired collision - free formation control for a multi - UAV system, considering the obstacle avoidance maneuvers. The developed framework in this study utilizes a semi - distributed control approach, in which, based on a probabilistic Lloyd's algorithm, a centralized guidance algorithm works for optimal positioning of the UAVs, while a distributed control approach has been used for the intervehicle collision and obstacle avoidance. Further, the presented framework has been extended to the 3D space with a novel definition of 3D maneuvers. Collision Avoidance, Centroidal Voronoi Tessellation, Distributed Control, Formation Control, Multi - Agent System, Obstacle Avoidance . From an engineering perspective, swarm intelligence shows how decentralized systems, composed of numerous simple agents, can achieve complex collective behaviors.
SwarmFusion: Revolutionizing Disaster Response with Swarm Intelligence and Deep Learning
Disaster response requires rapid, adaptive decision-making in chaotic environments. SwarmFusion, a novel hybrid framework, integrates particle swarm optimization with convolutional neural networks to optimize real-time resource allocation and path planning. By processing live satellite, drone, and sensor data, SwarmFusion enhances situational awareness and operational efficiency in flood and wildfire scenarios. Simulations using the DisasterSim2025 dataset demonstrate up to 40 percentage faster response times and 90 percentage survivor coverage compared to baseline methods. This scalable, data-driven approach offers a transformative solution for time-critical disaster management, with potential applications across diverse crisis scenarios.
The Singapore Consensus on Global AI Safety Research Priorities
Bengio, Yoshua, Maharaj, Tegan, Ong, Luke, Russell, Stuart, Song, Dawn, Tegmark, Max, Xue, Lan, Zhang, Ya-Qin, Casper, Stephen, Lee, Wan Sie, Mindermann, Sรถren, Wilfred, Vanessa, Balachandran, Vidhisha, Barez, Fazl, Belinsky, Michael, Bello, Imane, Bourgon, Malo, Brakel, Mark, Campos, Simรฉon, Cass-Beggs, Duncan, Chen, Jiahao, Chowdhury, Rumman, Seah, Kuan Chua, Clune, Jeff, Dai, Juntao, Delaborde, Agnes, Dziri, Nouha, Eiras, Francisco, Engels, Joshua, Fan, Jinyu, Gleave, Adam, Goodman, Noah, Heide, Fynn, Heidecke, Johannes, Hendrycks, Dan, Hodes, Cyrus, Hsiang, Bryan Low Kian, Huang, Minlie, Jawhar, Sami, Jingyu, Wang, Kalai, Adam Tauman, Kamphuis, Meindert, Kankanhalli, Mohan, Kantamneni, Subhash, Kirk, Mathias Bonde, Kwa, Thomas, Ladish, Jeffrey, Lam, Kwok-Yan, Sie, Wan Lee, Lee, Taewhi, Li, Xiaojian, Liu, Jiajun, Lu, Chaochao, Mai, Yifan, Mallah, Richard, Michael, Julian, Moรซs, Nick, Mรถller, Simon, Nam, Kihyuk, Ng, Kwan Yee, Nitzberg, Mark, Nushi, Besmira, hรigeartaigh, Seรกn O, Ortega, Alejandro, Peignรฉ, Pierre, Petrie, James, Prud'Homme, Benjamin, Rabbany, Reihaneh, Sanchez-Pi, Nayat, Schwettmann, Sarah, Shlegeris, Buck, Siddiqui, Saad, Sinha, Aradhana, Soto, Martรญn, Tan, Cheston, Ting, Dong, Tjhi, William, Trager, Robert, Tse, Brian, H., Anthony Tung K., Wilfred, Vanessa, Willes, John, Wong, Denise, Xu, Wei, Xu, Rongwu, Zeng, Yi, Zhang, HongJiang, ลฝikeliฤ, Djordje
Rapidly improving AI capabilities and autonomy hold significant promise of transformation, but are also driving vigorous debate on how to ensure that AI is safe, i.e., trustworthy, reliable, and secure. Building a trusted ecosystem is therefore essential -- it helps people embrace AI with confidence and gives maximal space for innovation while avoiding backlash. The "2025 Singapore Conference on AI (SCAI): International Scientific Exchange on AI Safety" aimed to support research in this space by bringing together AI scientists across geographies to identify and synthesise research priorities in AI safety. This resulting report builds on the International AI Safety Report chaired by Yoshua Bengio and backed by 33 governments. By adopting a defence-in-depth model, this report organises AI safety research domains into three types: challenges with creating trustworthy AI systems (Development), challenges with evaluating their risks (Assessment), and challenges with monitoring and intervening after deployment (Control).
Service Placement in Small Cell Networks Using Distributed Best Arm Identification in Linear Bandits
Yahya, Mariam, Sezgin, Aydin, Maghsudi, Setareh
As users in small cell networks increasingly rely on computation-intensive services, cloud-based access often results in high latency. Multi-access edge computing (MEC) mitigates this by bringing computational resources closer to end users, with small base stations (SBSs) serving as edge servers to enable low-latency service delivery. However, limited edge capacity makes it challenging to decide which services to deploy locally versus in the cloud, especially under unknown service demand and dynamic network conditions. To tackle this problem, we model service demand as a linear function of service attributes and formulate the service placement task as a linear bandit problem, where SBSs act as agents and services as arms. The goal is to identify the service that, when placed at the edge, offers the greatest reduction in total user delay compared to cloud deployment. We propose a distributed and adaptive multi-agent best-arm identification (BAI) algorithm under a fixed-confidence setting, where SBSs collaborate to accelerate learning. Simulations show that our algorithm identifies the optimal service with the desired confidence and achieves near-optimal speedup, as the number of learning rounds decreases proportionally with the number of SBSs. We also provide theoretical analysis of the algorithm's sample complexity and communication overhead.
Resilient-Native and Intelligent Next-Generation Wireless Systems: Key Enablers, Foundations, and Applications
Bennis, Mehdi, Samarakoon, Sumudu, Alshammari, Tamara, Weeraddana, Chathuranga, Tian, Zhoujun, Issaid, Chaouki Ben
Just like power, water, and transportation systems, wireless networks are a crucial societal infrastructure. As natural and human-induced disruptions continue to grow, wireless networks must be resilient. This requires them to withstand and recover from unexpected adverse conditions, shocks, unmodeled disturbances and cascading failures. Unlike robustness and reliability, resilience is based on the understanding that disruptions will inevitably happen. Resilience, as elasticity, focuses on the ability to bounce back to favorable states, while resilience as plasticity involves agents and networks that can flexibly expand their states and hypotheses through real-time adaptation and reconfiguration. This situational awareness and active preparedness, adapting world models and counterfactually reasoning about potential system failures and the best responses, is a core aspect of resilience. This article will first disambiguate resilience from reliability and robustness, before delving into key mathematical foundations of resilience grounded in abstraction, compositionality and emergence. Subsequently, we focus our attention on a plethora of techniques and methodologies pertaining to the unique characteristics of resilience, as well as their applications through a comprehensive set of use cases. Ultimately, the goal of this paper is to establish a unified foundation for understanding, modeling, and engineering resilience in wireless communication systems, while laying a roadmap for the next-generation of resilient-native and intelligent wireless systems.
Knowledge-Guided Multi-Agent Framework for Automated Requirements Development: A Vision
Huang, Jiangping, Jin, Dongming, Sun, Weisong, Liu, Yang, Jin, Zhi
This paper envisions a knowledge-guided multi-agent framework named KGMAF for automated requirements development. KGMAF aims to address gaps in current automation systems for SE, which prioritize code development and overlook the complexities of requirements tasks. KGMAF is composed of six specialized agents and an artifact pool to improve efficiency and accuracy. Specifically, KGMAF outlines the functionality, actions, and knowledge of each agent and provides the conceptual design of the artifact pool. Our case study highlights the potential of KGMAF in real-world scenarios. Finally, we outline several research opportunities for implementing and enhancing automated requirements development using multi-agent systems. We believe that KGMAF will play a pivotal role in shaping the future of automated requirements development in the era of LLMs.
Learning Truthful Mechanisms without Discretization
Ma, Yunxuan, Wang, Siqiang, Duan, Zhijian, Cheng, Yukun, Deng, Xiaotie
This paper introduces TEDI (Truthful, Expressive, and Dimension-Insensitive approach), a discretization-free algorithm to learn truthful and utility-maximizing mechanisms. Existing learning-based approaches often rely on discretization of outcome spaces to ensure truthfulness, which leads to inefficiency with increasing problem size. To address this limitation, we formalize the concept of pricing rules, defined as functions that map outcomes to prices. Based on this concept, we propose a novel menu mechanism, which can be equivalent to a truthful direct mechanism under specific conditions. The core idea of TEDI lies in its parameterization of pricing rules using Partial GroupMax Network, a new network architecture designed to universally approximate partial convex functions. To learn optimal pricing rules, we develop novel training techniques, including covariance trick and continuous sampling, to derive unbiased gradient estimators compatible with first-order optimization. Theoretical analysis establishes that TEDI guarantees truthfulness, full expressiveness, and dimension-insensitivity. Experimental evaluation in the studied auction setting demonstrates that TEDI achieves strong performance, competitive with or exceeding state-of-the-art methods. This work presents the first approaches to learn truthful mechanisms without outcome discretization, thereby enhancing algorithmic efficiency. The proposed concepts, network architecture, and learning techniques might offer potential value and provide new insights for automated mechanism design and differentiable economics.
FairMarket-RL: LLM-Guided Fairness Shaping for Multi-Agent Reinforcement Learning in Peer-to-Peer Markets
Jadhav, Shrenik, Sevak, Birva, Das, Srijita, Hussain, Akhtar, Su, Wencong, Bui, Van-Hai
Peer-to-peer (P2P) trading is increasingly recognized as a key mechanism for decentralized market regulation, yet existing approaches often lack robust frameworks to ensure fairness. This paper presents FairMarket-RL, a novel hybrid framework that combines Large Language Models (LLMs) with Reinforcement Learning (RL) to enable fairness-aware trading agents. In a simulated P2P microgrid with multiple sellers and buyers, the LLM acts as a real-time fairness critic, evaluating each trading episode using two metrics: Fairness-To-Buyer (FTB) and Fairness-Between-Sellers (FBS). These fairness scores are integrated into agent rewards through scheduled ฮป-coefficients, forming an adaptive LLM-guided reward shaping loop that replaces brittle, rule-based fairness constraints. Agents are trained using Independent Proximal Policy Optimization (IPPO) and achieve equitable outcomes, fulfilling over 90% of buyer demand, maintaining fair seller margins, and consistently reaching FTB and FBS scores above 0.80. The training process demonstrates that fairness feedback improves convergence, reduces buyer shortfalls, and narrows profit disparities between sellers. With its language-based critic, the framework scales naturally, and its extension to a large power distribution system with household prosumers illustrates its practical applicability. FairMarket-RL thus offers a scalable, equity-driven solution for autonomous trading in decentralized energy systems.
Do LLMs Dream of Discrete Algorithms?
Coelho, Claudionor Jr, Li, Yanen, Tee, Philip
Large Language Models (LLMs) have rapidly transformed the landscape of artificial intelligence, enabling natural language interfaces and dynamic orchestration of software components. However, their reliance on probabilistic inference limits their effectiveness in domains requiring strict logical reasoning, discrete decision-making, and robust interpretability. This paper investigates these limitations and proposes a neurosymbolic approach that augments LLMs with logic-based reasoning modules, particularly leveraging Prolog predicates and composable toolsets. By integrating first-order logic and explicit rule systems, our framework enables LLMs to decompose complex queries into verifiable sub-tasks, orchestrate reliable solutions, and mitigate common failure modes such as hallucination and incorrect step decomposition. We demonstrate the practical benefits of this hybrid architecture through experiments on the DABStep benchmark, showing improved precision, coverage, and system documentation in multi-step reasoning tasks. Our results indicate that combining LLMs with modular logic reasoning restores engineering rigor, enhances system reliability, and offers a scalable path toward trustworthy, interpretable AI agents across complex domains.
Evaluating Multi-Agent Defences Against Jailbreaking Attacks on Large Language Models
Wit, Maria Carolina Cornelia, Pang, Jun
Recent advances in large language models (LLMs) have raised concerns about jailbreaking attacks, i.e., prompts that bypass safety mechanisms. This paper investigates the use of multi-agent LLM systems as a defence against such attacks. We evaluate three jailbreaking strategies, including the original AutoDefense attack and two from Deepleaps: BetterDan and JB. Reproducing the AutoDefense framework, we compare single-agent setups with two- and three-agent configurations. Our results show that multi-agent systems enhance resistance to jailbreaks, especially by reducing false negatives. However, its effectiveness varies by attack type, and it introduces trade-offs such as increased false positives and computational overhead. These findings point to the limitations of current automated defences and suggest directions for improving alignment robustness in future LLM systems.