Overview
TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and Imputation
Leppich, Robert, Stenger, Michael, Grillmeyer, Daniel, Borst, Vanessa, Kounev, Samuel
We introduce a temporal feature encoding architecture called Time Series Representation Model (TSRM) for multivariate time series forecasting and imputation. The architecture is structured around CNN-based representation layers, each dedicated to an independent representation learning task and designed to capture diverse temporal patterns, followed by an attention-based feature extraction layer and a merge layer, designed to aggregate extracted features. The architecture is fundamentally based on a configuration that is inspired by a Transformer encoder, with self-attention mechanisms at its core. The TSRM architecture outperforms state-of-the-art approaches on most of the seven established benchmark datasets considered in our empirical evaluation for both forecasting and imputation tasks. At the same time, it significantly reduces complexity in the form of learnable parameters. The source code is available at https://github.com/RobertLeppich/TSRM.
Generative to Agentic AI: Survey, Conceptualization, and Challenges
Agentic Artificial Intelligence (AI) builds upon Generative AI (GenAI). It constitutes the next major step in the evolution of AI with much stronger reasoning and interaction capabilities that enable more autonomous behavior to tackle complex tasks. Since the initial release of ChatGPT (3.5), Generative AI has seen widespread adoption, giving users firsthand experience. However, the distinction between Agentic AI and GenAI remains less well understood. To address this gap, our survey is structured in two parts. In the first part, we compare GenAI and Agentic AI using existing literature, discussing their key characteristics, how Agentic AI remedies limitations of GenAI, and the major steps in GenAI's evolution toward Agentic AI. This section is intended for a broad audience, including academics in both social sciences and engineering, as well as industry professionals. It provides the necessary insights to comprehend novel applications that are possible with Agentic AI but not with GenAI. In the second part, we deep dive into novel aspects of Agentic AI, including recent developments and practical concerns such as defining agents. Finally, we discuss several challenges that could serve as a future research agenda, while cautioning against risks that can emerge when exceeding human intelligence.
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Cao, Yixin, Hong, Shibo, Li, Xinze, Ying, Jiahao, Ma, Yubo, Liang, Haiyuan, Liu, Yantao, Yao, Zijun, Wang, Xiaozhi, Huang, Dan, Zhang, Wenxuan, Huang, Lifu, Chen, Muhao, Hou, Lei, Sun, Qianru, Ma, Xingjun, Wu, Zuxuan, Kan, Min-Yen, Lo, David, Zhang, Qi, Ji, Heng, Jiang, Jing, Li, Juanzi, Sun, Aixin, Huang, Xuanjing, Chua, Tat-Seng, Jiang, Yu-Gang
Large Language Models (LLMs) are advancing at an amazing speed and have become indispensable across academia, industry, and daily applications. To keep pace with the status quo, this survey probes the core challenges that the rise of LLMs poses for evaluation. We identify and analyze two pivotal transitions: (i) from task-specific to capability-based evaluation, which reorganizes benchmarks around core competencies such as knowledge, reasoning, instruction following, multi-modal understanding, and safety; and (ii) from manual to automated evaluation, encompassing dynamic dataset curation and "LLM-as-a-judge" scoring. Yet, even with these transitions, a crucial obstacle persists: the evaluation generalization issue. Bounded test sets cannot scale alongside models whose abilities grow seemingly without limit. We will dissect this issue, along with the core challenges of the above two transitions, from the perspectives of methods, datasets, evaluators, and metrics. Due to the fast evolving of this field, we will maintain a living GitHub repository (links are in each section) to crowd-source updates and corrections, and warmly invite contributors and collaborators.
Severity Classification of Chronic Obstructive Pulmonary Disease in Intensive Care Units: A Semi-Supervised Approach Using MIMIC-III Dataset
Shojaei, Akram, Delrobaei, Mehdi
Chronic obstructive pulmonary disease (COPD) is a major global health concern, with accurate severity assessment crucial for effective management, especially in intensive care units (ICUs). This study presents a novel approach to COPD sever - ity classification using machine learning algorithms applied to the MIMIC - III dataset. Our work presents a new application of the MIMIC - III dataset and con - tributes to the growing field of artificial intelligence in critical care medicine. We developed a model to classify COPD severity based on available ICU parameters, including blood gas measurements and vital signs. Our methodology incorpo - rated semi - supervised learning techniques to leverage unlabeled data, enhancing model robustness. A random forest classifier demonstrated superior performance, achieving 92.51% accuracy and 0.98 ROC AUC distinguishing between mild - to - moderate and severe COPD cases. This approach offers a practical, accurate, and accessible tool for rapid COPD severity assessment in ICU settings, poten - tially improving clinical decision - making and patient outcomes. Future research should focus on external validation and integration into clinical decision support systems to enhance COPD management in the ICUs.
I Can Hear You Coming: RF Sensing for Uncooperative Satellite Evasion
Mehlman, Cameron, Falco, Gregory
--This work presents a novel method for leveraging intercepted Radio Frequency (RF) signals to inform a constrained Reinforcement Learning (RL) policy for robust control of a satellite operating in contested environments. Uncooperative satellite engagements with nation-state actors prompts the need for enhanced maneuverability and agility on-orbit. However, robust, autonomous and rapid adversary avoidance capabilities for the space environment is seldom studied. Further, the capability constrained nature of many space vehicles does not afford robust space situational awareness capabilities that can be used for well informed maneuvering. We present a "Cat & Mouse" system for training optimal adversary avoidance algorithms using RL. We propose the novel approach of utilizing intercepted radio frequency communication and dynamic spacecraft state as multi-modal input that could inform paths for a mouse to outmaneuver the cat satellite. Given the current ubiquitous use of RF communications, our proposed system can be applicable to a diverse array of satellites. In addition to providing a comprehensive framework for training and implementing a constrained RL policy capable of providing control for robust adversary avoidance, we also explore several optimization based methods for adversarial avoidance. These methods were then tested on real-world data obtained from the Space Surveillance Network (SSN) to analyze the benefits and limitations of different avoidance methods. In March of 2025, Chinese satellites exhibited dog-fighting capabilities [1], following years of both Russian [2] and Chinese [3] satellites approaching dangerously close to US satellites in geosynchronous orbit. Such uncooperative activity prompts the need for satellite agility and maneuverability which can be facilitated through edge-based autonomy. To achieve this, appropriate sensing would be required to properly characterize the contested environment. Not all satellites have precise space domain awareness (SDA) sensing suites onboard, despite having powerful buses and flight controllers that can facilitate autonomous operations. We propose leveraging an uncooperative space vehicle's communication systems as a means to evaluate safe flight control policies to carefully navigate contested domains in situations where support from the ground is not feasible.
The use of Multi-domain Electroencephalogram Representations in the building of Models based on Convolutional and Recurrent Neural Networks for Epilepsy Detection
Anghinoni, Luiz Antonio Nicolau, Denardin, Gustavo Weber, Gertrudes, Jadson Castro, Casanova, Dalcimar, Oliva, Jefferson Tales
This important role has led researchers to develop various methods for gathering information about brain activity, resulting in significant advancements in medical signal and image acquisition systems [2]. Among these advancements are functional neuroimaging techniques, such as functional magnetic resonance imaging, magnetoencephalography (MEG), positron emission tomography (PET), and electroencephalography [2]. Among these techniques, electroencephalography stands out due to three key advantages: it is a non-invasive method that allows data generation from any individual, has excellent temporal resolution--effectively capturing events occurring within milliseconds--and is relatively cost-effective compared to other examinations [3]. Electroencephalography monitors the brain's electrical activity through electrodes placed on the scalp, and the resulting data, known as the electroencephalogram (EEG), consists of a time series of electrical potentials that reflect neurological activity [4]. The EEG signal is widely used in the field of neuroscience and has the potential to advance brain-computer interfaces [5], facilitate emotion detection [6], enable classification of sleep stages [7] and help clinicians and researchers in identifying brain diseases, including but not limited to Alzheimer's disease [8], dyslexia [9], schizophrenia [10], Creutzfeldt-Jakob disease [11] and cognitive impairment [12]. Epilepsy, for example, is a neurological disorder characterized by abnormal brain activity that can lead to seizures, unusual behaviors, or even loss of consciousness.
Testing Individual Fairness in Graph Neural Networks
The biases in artificial intelligence (AI) models can lead to automated decision-making processes that discriminate against groups and/or individuals based on sensitive properties such as gender and race. While there are many studies on diagnosing and mitigating biases in various AI models, there is little research on individual fairness in Graph Neural Networks (GNNs). Unlike traditional models, which treat data features independently and overlook their inter-relationships, GNNs are designed to capture graph-based structure where nodes are interconnected. This relational approach enables GNNs to model complex dependencies, but it also means that biases can propagate through these connections, complicating the detection and mitigation of individual fairness violations. This PhD project aims to develop a testing framework to assess and ensure individual fairness in GNNs. It first systematically reviews the literature on individual fairness, categorizing existing approaches to define, measure, test, and mitigate model biases, creating a taxonomy of individual fairness. Next, the project will develop a framework for testing and ensuring fairness in GNNs by adapting and extending current fairness testing and mitigation techniques. The framework will be evaluated through industrial case studies, focusing on graph-based large language models.
Event-Based Eye Tracking. 2025 Event-based Vision Workshop
Chen, Qinyu, Gao, Chang, Liu, Min, Perrone, Daniele, Pei, Yan Ru, Wang, Zuowen, Zou, Zhuo, Tan, Shihang, Han, Tao, Lu, Guorui, Xu, Zhen, Ding, Junyuan, Wang, Ziteng, Wu, Zongwei, Han, Han, Wu, Yuliang, Chen, Jinze, Zhai, Wei, Cao, Yang, Zha, Zheng-jun, Bandara, Nuwan, Kandappu, Thivya, Misra, Archan, Lin, Xiaopeng, Huang, Hongxiang, Ren, Hongwei, Cheng, Bojun, Truong, Hoang M., Ly, Vinh-Thuan, Tran, Huy G., Nguyen, Thuan-Phat, Doan, Tram T.
This survey serves as a review for the 2025 Event-Based Eye Tracking Challenge organized as part of the 2025 CVPR event-based vision workshop. This challenge focuses on the task of predicting the pupil center by processing event camera recorded eye movement. W e review and summarize the innovative methods from teams rank the top in the challenge to advance future event-based eye tracking research. In each method, accuracy, model size, and number of operations are reported. In this survey, we also discuss event-based eye tracking from the perspective of hardware design.
The Rise of Small Language Models in Healthcare: A Comprehensive Survey
Garg, Muskan, Raza, Shaina, Rayana, Shebuti, Liu, Xingyi, Sohn, Sunghwan
Despite substantial progress in healthcare applications driven by large language models (LLMs), growing concerns around data privacy, and limited resources; the small language models (SLMs) offer a scalable and clinically viable solution for efficient performance in resource-constrained environments for next-generation healthcare informatics. Our comprehensive survey presents a taxonomic framework to identify and categorize them for healthcare professionals and informaticians. The timeline of healthcare SLM contributions establishes a foundational framework for analyzing models across three dimensions: NLP tasks, stakeholder roles, and the continuum of care. We present a taxonomic framework to identify the architectural foundations for building models from scratch; adapting SLMs to clinical precision through prompting, instruction fine-tuning, and reasoning; and accessibility and sustainability through compression techniques. Our primary objective is to offer a comprehensive survey for healthcare professionals, introducing recent innovations in model optimization and equipping them with curated resources to support future research and development in the field. Aiming to showcase the groundbreaking advancements in SLMs for healthcare, we present a comprehensive compilation of experimental results across widely studied NLP tasks in healthcare to highlight the transformative potential of SLMs in healthcare. The updated repository is available at Github
Three-Factor Learning in Spiking Neural Networks: An Overview of Methods and Trends from a Machine Learning Perspective
Mazurek, Szymon, Caputa, Jakub, Argasiński, Jan K., Wielgosz, Maciej
Three-factor learning rules in Spiking Neural Networks (SNNs) have emerged as a crucial extension to traditional Hebbian learning and Spike-Timing-Dependent Plasticity (STDP), incorporating neuromodulatory signals to improve adaptation and learning efficiency. These mechanisms enhance biological plausibility and facilitate improved credit assignment in artificial neural systems. This paper takes a view on this topic from a machine learning perspective, providing an overview of recent advances in three-factor learning, discusses theoretical foundations, algorithmic implementations, and their relevance to reinforcement learning and neuromorphic computing. In addition, we explore interdisciplinary approaches, scalability challenges, and potential applications in robotics, cognitive modeling, and AI systems. Finally, we highlight key research gaps and propose future directions for bridging the gap between neuroscience and artificial intelligence.