Energy
A Hybrid Multi-Well Hopfield-CNN with Feature Extraction and K-Means for MNIST Classification
This study presents a hybrid model for classifying handwritten digits in the MNIST dataset, combining convolutional neural networks (CNNs) with a multi-well Hopfield network. The approach employs a CNN to extract high-dimensional features from input images, which are then clustered into class-specific prototypes using k-means clustering. These prototypes serve as attractors in a multi-well energy landscape, where a Hopfield network performs classification by minimizing an energy function that balances feature similarity and class assignment.The model's design enables robust handling of intraclass variability, such as diverse handwriting styles, while providing an interpretable framework through its energy-based decision process. Through systematic optimization of the CNN architecture and the number of wells, the model achieves a high test accuracy of 99.2% on 10,000 MNIST images, demonstrating its effectiveness for image classification tasks. The findings highlight the critical role of deep feature extraction and sufficient prototype coverage in achieving high performance, with potential for broader applications in pattern recognition.
Safe Deep Reinforcement Learning for Resource Allocation with Peak Age of Information Violation Guarantees
Reyhan, Berire Gunes, Coleri, Sinem
In Wireless Networked Control Systems (WNCSs), control and communication systems must be co-designed due to their strong interdependence. This paper presents a novel optimization theory-based safe deep reinforcement learning (DRL) framework for ultra-reliable WNCSs, ensuring constraint satisfaction while optimizing performance, for the first time in the literature. The approach minimizes power consumption under key constraints, including Peak Age of Information (PAoI) violation probability, transmit power, and schedulability in the finite blocklength regime. PAoI violation probability is uniquely derived by combining stochastic maximum allowable transfer interval (MATI) and maximum allowable packet delay (MAD) constraints in a multi-sensor network. The framework consists of two stages: optimization theory and safe DRL. The first stage derives optimality conditions to establish mathematical relationships among variables, simplifying and decomposing the problem. The second stage employs a safe DRL model where a teacher-student framework guides the DRL agent (student). The control mechanism (teacher) evaluates compliance with system constraints and suggests the nearest feasible action when needed. Extensive simulations show that the proposed framework outperforms rule-based and other optimization theory based DRL benchmarks, achieving faster convergence, higher rewards, and greater stability.
Towards AI-Native RAN: An Operator's Perspective of 6G Day 1 Standardization
Li, Nan, Sun, Qi, Wang, Lehan, Xu, Xiaofei, Huang, Jinri, Liu, Chunhui, Gao, Jing, Huang, Yuhong, I, Chih-Lin
Artificial Intelligence/Machine Learning (AI/ML) has become the most certain and prominent feature of 6G mobile networks. Unlike 5G, where AI/ML was not natively integrated but rather an add-on feature over existing architecture, 6G shall incorporate AI from the onset to address its complexity and support ubiquitous AI applications. Based on our extensive mobile network operation and standardization experience from 2G to 5G, this paper explores the design and standardization principles of AI-Native radio access networks (RAN) for 6G, with a particular focus on its critical Day 1 architecture, functionalities and capabilities. We investigate the framework of AI-Native RAN and present its three essential capabilities to shed some light on the standardization direction; namely, AI-driven RAN processing/optimization/automation, reliable AI lifecycle management (LCM), and AI-as-a-Service (AIaaS) provisioning. The standardization of AI-Native RAN, in particular the Day 1 features, including an AI-Native 6G RAN architecture, were proposed. For validation, a large-scale field trial with over 5000 5G-A base stations have been built and delivered significant improvements in average air interface latency, root cause identification, and network energy consumption with the proposed architecture and the supporting AI functions. This paper aims to provide a Day 1 framework for 6G AI-Native RAN standardization design, balancing technical innovation with practical deployment.
PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models
Zhang, Yongjian, Wang, Longguang, Li, Kunhong, Zhang, Ye, Wang, Yun, Lin, Liang, Guo, Yulan
Abstract--This work presents PanMatch, a versatile foundation model for robust correspondence matching. Unlike previous methods that rely on task-specific architectures and domain-specific fine-tuning to support tasks like stereo matching, optical flow or feature matching, our key insight is that any two-frame correspondence matching task can be addressed within a 2D displacement estimation framework using the same model weights. Such a formulation eliminates the need for designing specialized unified architectures or task-specific ensemble models. Instead, it achieves multi-task integration by endowing displacement estimation algorithms with unprecedented generalization capabilities. T o this end, we highlight the importance of a robust feature extractor applicable across multiple domains and tasks, and propose the feature transformation pipeline that leverage all-purpose features from Large Vision Models to endow matching baselines with zero-shot cross-view matching capabilities. Furthermore, we assemble a cross-domain dataset with near 1.8 million samples from stereo matching, optical flow, and feature matching domains to pretrain PanMatch. We demonstrate the versatility of PanMatch across a wide range of domains and downstream tasks using the same model weights . Our model outperforms UniMatch and Flow-Anything on cross-task evaluations, and achieves comparable performance to most state-of-the-art task-specific algorithms on task-oriented benchmarks. Additionally, PanMatch presents unprecedented zero-shot performance in abnormal scenarios, such as rainy day and satellite imagery, where most existing robust algorithms fail to yield meaningful results. This technique serves as the foundation for various real-world applications, including stereo matching for driving and navigation, optical flow for video editing and action recognition, and feature matching for 3D reconstruction. Previous research developed specialized architectures and model weights for specific correspondence tasks due to significant difference in task settings, as outlined in T able 1. For instance, stereo matching operates on a pair of synchronized, rectified images and identifies correspondences along horizontal epipolar lines. Feature matching focus on finding reliable correspondence of rigid scenes from varying camera poses and times. Optical flow estimates pixel-wise displacements in dynamic scenes over consecutive frames. Using task-specific priors to construct models simplifies network design and enhances inference efficiency . However, these individual pipelines inherently limits the adaptability of the algorithms across tasks, resulting in numerous specialized architectures and weights for different scenarios, which complicates real-world deployment.
Intelligent Control of Spacecraft Reaction Wheel Attitude Using Deep Reinforcement Learning
El-Dalahmeh, Ghaith, Jabbarpour, Mohammad Reza, Vo, Bao Quoc, Kowalczyk, Ryszard
Reliable satellite attitude control is essential for the success of space missions, particularly as satellites increasingly operate autonomously in dynamic and uncertain environments. Reaction wheels (RWs) play a pivotal role in attitude control, and maintaining control resilience during RW faults is critical to preserving mission objectives and system stability. However, traditional Proportional Derivative (PD) controllers and existing deep reinforcement learning (DRL) algorithms such as TD3, PPO, and A2C often fall short in providing the real time adaptability and fault tolerance required for autonomous satellite operations. This study introduces a DRL-based control strategy designed to improve satellite resilience and adaptability under fault conditions. Specifically, the proposed method integrates Twin Delayed Deep Deterministic Policy Gradient (TD3) with Hindsight Experience Replay (HER) and Dimension Wise Clipping (DWC) referred to as TD3-HD to enhance learning in sparse reward environments and maintain satellite stability during RW failures. The proposed approach is benchmarked against PD control and leading DRL algorithms. Experimental results show that TD3-HD achieves significantly lower attitude error, improved angular velocity regulation, and enhanced stability under fault conditions. These findings underscore the proposed method potential as a powerful, fault tolerant, onboard AI solution for autonomous satellite attitude control.
Audit, Alignment, and Optimization of LM-Powered Subroutines with Application to Public Comment Processing
Raab, Reilly, Parker, Mike, Nally, Dan, Montgomery, Sadie, Bernat, Anastasia, Munikoti, Sai, Horawalavithana, Sameera
Contemporary organizations have shown great interest in integrating language models (LMs) into workflows traditionally performed by human subject matter experts (SMEs), such as in medical diagnostics (Artsi et al., 2025), legal assistance (Padiu et al., 2024), financial risk analysis (AI21 labs, 2025), and governmental permitting or regulatory reviews (Phan et al., 2024). Despite this interest, however, the use of LMs (e.g., via a standard conversational interface) in high-stakes contexts is constrained by the need for decision-making reliability, objectivity, transparency, and accountability that SMEs currently provide (Mori, 2024). Effective reconciliation between LMs and SMEs thus represents a critical frontier in real-world deployments of artificial intelligence. LMs have demonstrated remarkable capabilities in extracting information from large volumes of multi-modal, multi-domain data; synthesizing multi-document concepts; and performing tasks associated with basic reasoning. Nonetheless, LMs are susceptible to "hallucinations" (i.e., inaccurate generation) (Ji et al., 2023), difficulty in handling nuanced, domain-specific requirements (Ashqar, 2025), historical biases inherited from training data (Ranjan et al., 2024), and opaque reasoning in decision-making (Machot et al., 2024). Notably, these weaknesses are often precisely the strengths of SMEs, who are conversely burdened with the inefficient and labor-intensive tasks of cross-document, multi-modal search and information extraction. We can see the need to delineate and integrate the often low-stakes or tedious work that can be performed by LMs with the discerning, high-stakes decision-making tasks performed by SMEs in the real world: The challenge is to harness the time efficiency and broad knowledge capabilities of LMs while preserving the domain expertise, contextual judgment, oversight, and accountability of SMEs. Moreover, we must do so without creating additional burdens for SMEs to work with LMs (e.g., "prompt-engineering" or manual review of all LM tasks), and we wish to minimize the introduction of new risks (e.g., a loss of clarity regarding where or how LMs may be used by each SME, or, in the case of governmental work, the erosion of public trust). In this work, we propose a novel auditable and interactive refinement framework for the effective integration of LMs with SMEs for decision-making workflows.
A Hybrid Multilayer Extreme Learning Machine for Image Classification with an Application to Quadcopters
Hernandez-Hernandez, Rolando A., Rubio-Solis, Adrian
Multilayer Extreme Learning Machine (ML-ELM) and its variants have proven to be an effective technique for the classification of different natural signals such as audio, video, acoustic and images. In this paper, a Hybrid Multilayer Extreme Learning Machine (HML-ELM) that is based on ELM-based autoencoder (ELM-AE) and an Interval Type-2 fuzzy Logic theory is suggested for active image classification and applied to Unmanned Aerial Vehicles (UAVs). The proposed methodology is a hierarchical ELM learning framework that consists of two main phases: 1) self-taught feature extraction and 2) supervised feature classification. First, unsupervised multilayer feature encoding is achieved by stacking a number of ELM-AEs, in which input data is projected into a number of high-level representations. At the second phase, the final features are classified using a novel Simplified Interval Type-2 Fuzzy ELM (SIT2-FELM) with a fast output reduction layer based on the SC algorithm; an improved version of the algorithm Center of Sets Type Reducer without Sorting Requirement (COSTRWSR). To validate the efficiency of the HML-ELM, two types of experiments for the classification of images are suggested. First, the HML-ELM is applied to solve a number of benchmark problems for image classification. Secondly, a number of real experiments to the active classification and transport of four different objects between two predefined locations using a UAV is implemented. Experiments demonstrate that the proposed HML-ELM delivers a superior efficiency compared to other similar methodologies such as ML-ELM, Multilayer Fuzzy Extreme Learning Machine (ML-FELM) and ELM.
Learning-aided Bigraph Matching Approach to Multi-Crew Restoration of Damaged Power Networks Coupled with Road Transportation Networks
Maurer, Nathan, Kaushik, Harshal, Jacob, Roshni Anna, Zhang, Jie, Chowdhury, Souma
The resilience of critical infrastructure networks (CINs) after disruptions, such as those caused by natural hazards, depends on both the speed of restoration and the extent to which operational functionality can be regained. Allocating resources for restoration is a combinatorial optimal planning problem that involves determining which crews will repair specific network nodes and in what order. This paper presents a novel graph-based formulation that merges two interconnected graphs, representing crew and transportation nodes and power grid nodes, into a single heterogeneous graph. To enable efficient planning, graph reinforcement learning (GRL) is integrated with bigraph matching. GRL is utilized to design the incentive function for assigning crews to repair tasks based on the graph-abstracted state of the environment, ensuring generalization across damage scenarios. Two learning techniques are employed: a graph neural network trained using Proximal Policy Optimization and another trained via Neuroevolution. The learned incentive functions inform a bipartite graph that links crews to repair tasks, enabling weighted maximum matching for crew-to-task allocations. An efficient simulation environment that pre-computes optimal node-to-node path plans is used to train the proposed restoration planning methods. An IEEE 8500-bus power distribution test network coupled with a 21 square km transportation network is used as the case study, with scenarios varying in terms of numbers of damaged nodes, depots, and crews. Results demonstrate the approach's generalizability and scalability across scenarios, with learned policies providing 3-fold better performance than random policies, while also outperforming optimization-based solutions in both computation time (by several orders of magnitude) and power restored.
TS-SNN: Temporal Shift Module for Spiking Neural Networks
Yu, Kairong, Zhang, Tianqing, Xu, Qi, Pan, Gang, Wang, Hongwei
Spiking Neural Networks (SNNs) are increasingly recognized for their biological plausibility and energy efficiency, positioning them as strong alternatives to Artificial Neural Networks (ANNs) in neuromorphic computing applications. SNNs inherently process temporal information by leveraging the precise timing of spikes, but balancing temporal feature utilization with low energy consumption remains a challenge. In this work, we introduce Temporal Shift module for Spiking Neural Networks (TS-SNN), which incorporates a novel Temporal Shift (TS) module to integrate past, present, and future spike features within a single timestep via a simple yet effective shift operation. A residual combination method prevents information loss by integrating shifted and original features. The TS module is lightweight, requiring only one additional learnable parameter, and can be seamlessly integrated into existing architectures with minimal additional computational cost. TS-SNN achieves state-of-the-art performance on benchmarks like CIFAR-10 (96.72\%), CIFAR-100 (80.28\%), and ImageNet (70.61\%) with fewer timesteps, while maintaining low energy consumption. This work marks a significant step forward in developing efficient and accurate SNN architectures.
DRAN: A Distribution and Relation Adaptive Network for Spatio-temporal Forecasting
Zou, Xiaobei, Xiong, Luolin, Zhang, Kexuan, Alippi, Cesare, Tang, Yang
--Accurate predictions of spatio-temporal systems are crucial for tasks such as system management, control, and crisis prevention. However, the inherent time variance of many spatio-temporal systems poses challenges to achieving accurate predictions whenever stationarity is not granted. In order to address non-stationarity, we propose a D istribution and R elation A daptive N etwork (DRAN) capable of dynamically adapting to relation and distribution changes over time. While temporal normalization and de-normalization are frequently used techniques to adapt to distribution shifts, this operation is not suitable for the spatio-temporal context as temporal normalization scales the time series of nodes and possibly disrupts the spatial relations among nodes. In order to address this problem, a Spatial Factor Learner (SFL) module is developed that enables the normalization and de-normalization process in spatio-temporal systems. T o adapt to dynamic changes in spatial relationships among sensors, we propose a Dynamic-Static Fusion Learner (DSFL) module that effectively integrates features learned from both dynamic and static relations through an adaptive fusion ratio mechanism. Furthermore, we introduce a Stochastic Learner to capture the noisy components of spatio-temporal representations. Our approach outperforms state-of-the-art methods on weather prediction and traffic flow forecasting tasks. Experimental results show that our SFL efficiently preserves spatial relationships across various temporal normalization operations. Visualizations of the learned dynamic and static relations demonstrate that DSFL can capture both local and distant relationships between nodes. P A TIO-TEMPORAL systems, characterized by intricate spatial interactions among senors (nodes) and rich temporal dynamics, are prevalent in various fields e. g., physics [1], meteorology [2]-[4], power grids [5]-[7] and transportation [8], [9].