Optimization
AGI Enabled Solutions For IoX Layers Bottlenecks In Cyber-Physical-Social-Thinking Space
Khelloufi, Amar, Ning, Huansheng, Dhelim, Sahraoui, Ding, Jianguo
The integration of the Internet of Everything (IoX) and Artificial General Intelligence (AGI) has given rise to a transformative paradigm aimed at addressing critical bottlenecks across sensing, network, and application layers in Cyber-Physical-Social Thinking (CPST) ecosystems. In this survey, we provide a systematic and comprehensive review of AGI-enhanced IoX research, focusing on three key components: sensing-layer data management, network-layer protocol optimization, and application-layer decision-making frameworks. Specifically, this survey explores how AGI can mitigate IoX bottlenecks challenges by leveraging adaptive sensor fusion, edge preprocessing, and selective attention mechanisms at the sensing layer, while resolving network-layer issues such as protocol heterogeneity and dynamic spectrum management, neuro-symbolic reasoning, active inference, and causal reasoning, Furthermore, the survey examines AGI-enabled frameworks for managing identity and relationship explosion. Key findings suggest that AGI-driven strategies, such as adaptive sensor fusion, edge preprocessing, and semantic modeling, offer novel solutions to sensing-layer data overload, network-layer protocol heterogeneity, and application-layer identity explosion. The survey underscores the importance of cross-layer integration, quantum-enabled communication, and ethical governance frameworks for future AGI-enabled IoX systems. Finally, the survey identifies unresolved challenges, such as computational requirements, scalability, and real-world validation, calling for further research to fully realize AGI's potential in addressing IoX bottlenecks. we believe AGI-enhanced IoX is emerging as a critical research field at the intersection of interconnected systems and advanced AI.
Quantum computing and artificial intelligence: status and perspectives
Acampora, Giovanni, Ambainis, Andris, Ares, Natalia, Banchi, Leonardo, Bhardwaj, Pallavi, Binosi, Daniele, Briggs, G. Andrew D., Calarco, Tommaso, Dunjko, Vedran, Eisert, Jens, Ezratty, Olivier, Erker, Paul, Fedele, Federico, Gil-Fuster, Elies, Gärttner, Martin, Granath, Mats, Heyl, Markus, Kerenidis, Iordanis, Klusch, Matthias, Kockum, Anton Frisk, Kueng, Richard, Krenn, Mario, Lässig, Jörg, Macaluso, Antonio, Maniscalco, Sabrina, Marquardt, Florian, Michielsen, Kristel, Muñoz-Gil, Gorka, Müssig, Daniel, Nautrup, Hendrik Poulsen, Neubauer, Sophie A., van Nieuwenburg, Evert, Orus, Roman, Schmiedmayer, Jörg, Schmitt, Markus, Slusallek, Philipp, Vicentini, Filippo, Weitenberg, Christof, Wilhelm, Frank K.
This white paper discusses and explores the various points of intersection between quantum computing and artificial intelligence (AI). It describes how quantum computing could support the development of innovative AI solutions. It also examines use cases of classical AI that can empower research and development in quantum technologies, with a focus on quantum computing and quantum sensing. The purpose of this white paper is to provide a long-term research agenda aimed at addressing foundational questions about how AI and quantum computing interact and benefit one another. It concludes with a set of recommendations and challenges, including how to orchestrate the proposed theoretical work, align quantum AI developments with quantum hardware roadmaps, estimate both classical and quantum resources - especially with the goal of mitigating and optimizing energy consumption - advance this emerging hybrid software engineering discipline, and enhance European industrial competitiveness while considering societal implications.
A comprehensive control architecture for semi-autonomous dual-arm robots in agriculture settings
Palmieri, Jozsef, Di Lillo, Paolo, Chiaverini, Stefano, Marino, Alessandro
The adoption of mobile robotic platforms in complex environments, such as agricultural settings, requires these systems to exhibit a flexible yet effective architecture that integrates perception and control. In such scenarios, several tasks need to be accomplished simultaneously, ranging from managing robot limits to performing operational tasks and handling human inputs. The purpose of this paper is to present a comprehensive control architecture for achieving complex tasks such as robotized harvesting in vineyards within the framework of the European project CANOPIES. In detail, a 16-DOF dual-arm mobile robot is employed, controlled via a Hierarchical Quadratic Programming (HQP) approach capable of handling both equality and inequality constraints at various priorities to harvest grape bunches selected by the perception system developed within the project. Furthermore, given the complexity of the scenario and the uncertainty in the perception system, which could potentially lead to collisions with the environment, the handling of interaction forces is necessary. Remarkably, this was achieved using the same HQP framework. This feature is further leveraged to enable semi-autonomous operations, allowing a human operator to assist the robotic counterpart in completing harvesting tasks. Finally, the obtained results are validated through extensive testing conducted first in a laboratory environment to prove individual functionalities, then in a real vineyard, encompassing both autonomous and semi-autonomous grape harvesting operations.
Stabilization of industrial processes with time series machine learning
Anoshin, Matvei, Tsurkan, Olga, Lopatkin, Vadim, Fedichkin, Leonid
An application of machine learning to the industrial processes stabilization is an open problem which promises a huge potential benefit to the such critical industries as metals and energy development if solved. Classical optimization methods, such as finite-horizon markov decision processes [1], non-linear programming reformulation of control [2] and point-wise optimization [3] are frequently employed in order to achieve better stability of time series process, successfully improving production quality, minimizing expenses and manufacturing devices deficiency with near-future planing or real-time optimization. Machine learning, known for its prominent results in solution of enterprise problems [4], became widely applied to the time series prediction and generation after recent advances in such fields as natural language processing, due to the similarity aforementioned tasks in their time dependent recurrent nature [5]. Thus, contemporary time series modeling is performed with long short-term memory (LSTM) models [6] and Transformers [7], incorporating different attention strategies. Currently, state-of-the-art approaches to ML-driven optimization include an application of reinforcement learning, but for time series problems, the usual focus stays on approximation of the industrial process as a dynamic system on the basis of recurrent neural network (RNN), with such methods as recurrent stabilization control [8, 9].
Unsupervised Learning-Based Joint Resource Allocation and Beamforming Design for RIS-Assisted MISO-OFDMA Systems
Ma, Yu, Zhou, Xingyu, Li, Xiao, Liang, Le, Jin, Shi
--Reconfigurable intelligent surface (RIS) is regarded as one of the pivotal technologies for sixth-generation wireless communication systems. This paper investigates the downlink transmission of an RIS-assisted multiple-input single-output (MISO) orthogonal frequency division multiple access (OFDMA) communication systems. T o achieve a high system sum rate with low computational complexity, we develop a two-stage unsupervised learning based approach with customized loss function for the RIS reflection phase shift design, active beamforming at base station (BS) and time-frequency resource block (RB) allocation. The proposed approach consists of two neural networks: BeamNet, which takes channel state information (CSI) as input to predict the RIS reflection phase shift, and AllocationNet, which generates RB allocation decisions based on the equivalent CSI from the BS to the users, where the equivalent CSI is obtained by combining the original CSI with the RIS reflection phase shifts predicted by BeamNet. The active beamforming is implemented using the maximum ratio transmission and water-filling algorithm. In order to incorporate the discrete constraints of RIS reflection phase shift and RB allocation decisions into the network while maintaining network differentiability, we introduce a quantization function and the Gumbel softmax trick into BeamNet and AllocationNet, respectively. Furthermore, a customized loss function and phased training strategy are devised to enhance training efficiency and address quality-of-service constraints. Simulation results demonstrate that the proposed approach achieves 99.93% of the system sum rate of the successive convex approximation (SCA) method while requiring only 0.036% of its runtime. Additionally, the method's effectiveness and robustness are validated under different delay tap numbers, user distributions, and Rician factors, demonstrating its strong adaptability to different communication environments. OW ADA YS, with the large-scale deployment of fifth-generation wireless communication systems (5G), the focus of research has gradually shifted to sixth-generation wireless communication systems (6G). Y u Ma, Xingyu Zhou, Xiao Li, and Shi Jin are with the National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China (e-mail: yuma@seu.edu.cn;
Scalable Hypergraph Structure Learning with Diverse Smoothness Priors
Brown, Benjamin T., Zhang, Haoxiang, Lau, Daniel L., Arce, Gonzalo R.
-- In graph signal processing, learning the weighted connections between nodes from a set of sample signals is a fundamental task when the underlying relationships are not known a priori. This task is typically addressed by finding a graph Laplacian on which the observed signals are smooth. With the extension of graphs to hypergraphs - where edges can connect more than two nodes - graph learning methods have similarly been generalized to hypergraphs. However, the absence of a unified framework for calculating total variation has led to divergent definitions of smoothness and, consequently, differing approaches to hyperedge recovery. We confront this challenge through generalization of several previously proposed hypergraph total variations, subsequently allowing ease of substitution into a vector based optimization. T o this end, we propose a novel hypergraph learning method that recovers a hypergraph topology from time-series signals based on a smoothness prior . Our approach, designated as Hypergraph Structure Learning with Smoothness (HSLS), addresses key limitations in prior works, such as hyperedge selection and convergence issues, by formulating the problem as a convex optimization solved via a forward-backward-forward algorithm, ensuring guaranteed convergence. Additionally, we introduce a process that simultaneously limits the span of the hyperedge search and maintains a valid hyperedge selection set. In doing so, our method becomes scalable in increasingly complex network structures. The experimental results demonstrate improved performance, in terms of accuracy, over other state-of-the-art hypergraph inference methods; furthermore, we empirically show our method to be robust to total variation terms, biased towards global smoothness, and scalable to larger hypergraphs. YPERGRAPHS are considered as generalized graphs that capture higher order relationships [1]. While graphs encode pairwise relationships between nodes through edges, the higher order nature of hypergraphs extends node relations to allow an arbitrary number of nodes to be connected by a hyperedge. Figure 1 contains a sample hypergraph displaying these higher order connections where nodes are considered workers and hyperedges connect workers who are collaborating on a project. B. T. Brown, H. Zhang, and D. L. Lau are with the Department of Electrical and Computer Engineering, University of Kentucky, Lexington, KY 40506, USA. G. R. Arce is with the Department of Electrical and Computer Engineering, University of Delaware, Newark, DE 19716, USA. This work was partially supported by the National Science Foundation under grants 1815992 and 1816003 and the AFOSR award FA9550-22-1-0362.
RoomCraft: Controllable and Complete 3D Indoor Scene Generation
Zhou, Mengqi, Wang, Xipeng, Wang, Yuxi, Zhang, Zhaoxiang
Generating realistic 3D indoor scenes from user inputs remains a challenging problem in computer vision and graphics, requiring careful balance of geometric consistency, spatial relationships, and visual realism. While neural generation methods often produce repetitive elements due to limited global spatial reasoning, procedural approaches can leverage constraints for controllable generation but struggle with multi-constraint scenarios. When constraints become numerous, object collisions frequently occur, forcing the removal of furniture items and compromising layout completeness. To address these limitations, we propose RoomCraft, a multi-stage pipeline that converts real images, sketches, or text descriptions into coherent 3D indoor scenes. Our approach combines a scene generation pipeline with a constraint-driven optimization framework. The pipeline first extracts high-level scene information from user inputs and organizes it into a structured format containing room type, furniture items, and spatial relations. It then constructs a spatial relationship network to represent furniture arrangements and generates an optimized placement sequence using a heuristic-based depth-first search (HDFS) algorithm to ensure layout coherence. To handle complex multi-constraint scenarios, we introduce a unified constraint representation that processes both formal specifications and natural language inputs, enabling flexible constraint-oriented adjustments through a comprehensive action space design. Additionally, we propose a Conflict-Aware Positioning Strategy (CAPS) that dynamically adjusts placement weights to minimize furniture collisions and ensure layout completeness. Extensive experiments demonstrate that RoomCraft significantly outperforms existing methods in generating realistic, semantically coherent, and visually appealing room layouts across diverse input modalities.
Inverse Design of Diffractive Metasurfaces Using Diffusion Models
Hen, Liav, Yosef, Erez, Raviv, Dan, Giryes, Raja, Scheuer, Jacob
Metasurfaces are ultra-thin optical elements composed of engineered sub-wavelength structures that enable precise control of light. Their inverse design - determining a geometry that yields a desired optical response - is challenging due to the complex, nonlinear relationship between structure and optical properties. This often requires expert tuning, is prone to local minima, and involves significant computational overhead. In this work, we address these challenges by integrating the generative capabilities of diffusion models into computational design workflows. Using an RCWA simulator, we generate training data consisting of metasurface geometries and their corresponding far-field scattering patterns. We then train a conditional diffusion model to predict meta-atom geometry and height from a target spatial power distribution at a specified wavelength, sampled from a continuous supported band. Once trained, the model can generate metasurfaces with low error, either directly using RCWA-guided posterior sampling or by serving as an initializer for traditional optimization methods. We demonstrate our approach on the design of a spatially uniform intensity splitter and a polarization beam splitter, both produced with low error in under 30 minutes. To support further research in data-driven metasurface design, we publicly release our code and datasets.
Interactive Multi-Objective Probabilistic Preference Learning with Soft and Hard Bounds
Chen, Edward, Truong, Sang T., Dullerud, Natalie, Koyejo, Sanmi, Guestrin, Carlos
High-stakes decision-making involves navigating multiple competing objectives with expensive evaluations. For instance, in brachytherapy, clinicians must balance maximizing tumor coverage (e.g., an aspirational target or soft bound of >95% coverage) against strict organ dose limits (e.g., a non-negotiable hard bound of <601 cGy to the bladder), with each plan evaluation being resource-intensive. Selecting Pareto-optimal solutions that match implicit preferences is challenging, as exhaustive Pareto frontier exploration is computationally and cognitively prohibitive, necessitating interactive frameworks to guide users. While decision-makers (DMs) often possess domain knowledge to narrow the search via such soft-hard bounds, current methods often lack systematic approaches to iteratively refine these multi-faceted preference structures. Critically, DMs must trust their final decision, confident they haven't missed superior alternatives; this trust is paramount in high-consequence scenarios. We present Active-MoSH, an interactive local-global framework designed for this process. Its local component integrates soft-hard bounds with probabilistic preference learning, maintaining distributions over DM preferences and bounds for adaptive Pareto subset refinement. This is guided by an active sampling strategy optimizing exploration-exploitation while minimizing cognitive burden. To build DM trust, Active-MoSH's global component, T-MoSH, leverages multi-objective sensitivity analysis to identify potentially overlooked, high-value points beyond immediate feedback. We demonstrate Active-MoSH's performance benefits through diverse synthetic and real-world applications. A user study on AI-generated image selection further validates our hypotheses regarding the framework's ability to improve convergence, enhance DM trust, and provide expressive preference articulation, enabling more effective DMs.
Joint Task Offloading and Resource Allocation in Low-Altitude MEC via Graph Attention Diffusion
Xue, Yifan, Liang, Ruihuai, Yang, Bo, Cao, Xuelin, Yu, Zhiwen, Debbah, Mérouane, Yuen, Chau
With the rapid development of the low-altitude economy, air-ground integrated multi-access edge computing (MEC) systems are facing increasing demands for real-time and intelligent task scheduling. In such systems, task offloading and resource allocation encounter multiple challenges, including node heterogeneity, unstable communication links, and dynamic task variations. To address these issues, this paper constructs a three-layer heterogeneous MEC system architecture for low-altitude economic networks, encompassing aerial and ground users as well as edge servers. The system is systematically modeled from the perspectives of communication channels, computational costs, and constraint conditions, and the joint optimization problem of offloading decisions and resource allocation is uniformly abstracted into a graph-structured modeling task. On this basis, we propose a graph attention diffusion-based solution generator (GADSG). This method integrates the contextual awareness of graph attention networks with the solution distribution learning capability of diffusion models, enabling joint modeling and optimization of discrete offloading variables and continuous resource allocation variables within a high-dimensional latent space. We construct multiple simulation datasets with varying scales and topologies. Extensive experiments demonstrate that the proposed GADSG model significantly outperforms existing baseline methods in terms of optimization performance, robustness, and generalization across task structures, showing strong potential for efficient task scheduling in dynamic and complex low-altitude economic network environments.