Overview
Bregman-Hausdorff divergence: strengthening the connections between computational geometry and machine learning
Pham, Tuyen, Kouřimská, Hana Dal Poz, Wagner, Hubert
The purpose of this paper is twofold. On a technical side, we propose an extension of the Hausdorff distance from metric spaces to spaces equipped with asymmetric distance measures. Specifically, we focus on the family of Bregman divergences, which includes the popular Kullback--Leibler divergence (also known as relative entropy). As a proof of concept, we use the resulting Bregman--Hausdorff divergence to compare two collections of probabilistic predictions produced by different machine learning models trained using the relative entropy loss. The algorithms we propose are surprisingly efficient even for large inputs with hundreds of dimensions. In addition to the introduction of this technical concept, we provide a survey. It outlines the basics of Bregman geometry, as well as computational geometry algorithms. We focus on algorithms that are compatible with this geometry and are relevant for machine learning.
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Chaubey, Ashutosh, Guan, Xulang, Soleymani, Mohammad
The human face plays a central role in social communication, necessitating the use of performant computer vision tools for human-centered applications. W e propose Face-LLaVA, a multimodal large language model for face-centered, in-context learning, including facial expression and attribute recognition. Additionally, Face-LLaVA is able to generate natural language descriptions that can be used for reasoning. Leveraging existing visual databases, we first developed FaceInstruct-1M, a face-centered database for instruction tuning MLLMs for face processing. W e then developed a novel face-specific visual encoder powered by Face-Region Guided Cross-Attention that integrates face geometry with local visual features. W e evaluated the proposed method across nine different datasets and five different face processing tasks, including facial expression recognition, action unit detection, facial attribute detection, age estimation and deepfake detection. Face-LLaVA achieves superior results compared to existing open-source MLLMs and competitive performance compared to commercial solutions. Our model output also receives a higher reasoning rating by GPT under a zero-shot setting across all the tasks. Both our dataset and model wil be released at https://face-llava.github.io/ to support future advancements in social AI and foundational vision-language research.
Large Language Model (LLM) for Software Security: Code Analysis, Malware Analysis, Reverse Engineering
Jelodar, Hamed, Bai, Samita, Hamedi, Parisa, Mohammadian, Hesamodin, Razavi-Far, Roozbeh, Ghorbani, Ali
Large Language Models (LLMs) have recently emerged as powerful tools in cybersecurity, offering advanced capabilities in malware detection, generation, and real-time monitoring. Numerous studies have explored their application in cybersecurity, demonstrating their effectiveness in identifying novel malware variants, analyzing malicious code structures, and enhancing automated threat analysis. Several transformer-based architectures and LLM-driven models have been proposed to improve malware analysis, leveraging semantic and structural insights to recognize malicious intent more accurately. This study presents a comprehensive review of LLM-based approaches in malware code analysis, summarizing recent advancements, trends, and methodologies. We examine notable scholarly works to map the research landscape, identify key challenges, and highlight emerging innovations in LLM-driven cybersecurity. Additionally, we emphasize the role of static analysis in malware detection, introduce notable datasets and specialized LLM models, and discuss essential datasets supporting automated malware research. This study serves as a valuable resource for researchers and cybersecurity professionals, offering insights into LLM-powered malware detection and defence strategies while outlining future directions for strengthening cybersecurity resilience.
Personalized Recommendation Models in Federated Settings: A Survey
Zhang, Chunxu, Long, Guodong, Zhang, Zijian, Li, Zhiwei, Zhang, Honglei, Yang, Qiang, Yang, Bo
--Federated recommender systems (FedRecSys) have emerged as a pivotal solution for privacy-aware recommendations, balancing growing demands for data security and personalized experiences. Current research efforts predominantly concentrate on adapting traditional recommendation architectures to federated environments, optimizing communication efficiency, and mitigating security vulnerabilities. However, user personalization modeling, which is essential for capturing heterogeneous preferences in this decentralized and non-IID data setting, remains underexplored. This survey addresses this gap by systematically exploring personalization in FedRecSys, charting its evolution from centralized paradigms to federated-specific innovations. We establish a foundational definition of person-alization in a federated setting, emphasizing personalized models as a critical solution for capturing fine-grained user preferences. The work critically examines the technical hurdles of building personalized FedRecSys and synthesizes promising methodologies to meet these challenges. As the first consolidated study in this domain, this survey serves as both a technical reference and a catalyst for advancing personalized FedRecSys research. A. Motivation Federated recommender systems (FedRecSys) [1]-[6] have burgeoned as a remarkable paradigm to promote privacy-preserving recommendation services. Besides, the distributed optimization pattern enables service providers to effectively harness the vast computational resources of various devices. This balance between performance and privacy protection makes FedRecSys an attractive research avenue with significant potential for edge AI development. Current research in FedRecSys primarily derives from the perspectives of RecSys and FL views. Chunxu Zhang, Zijian Zhang and Bo Y ang are with the College of Computer Science and Technology, Jilin University, Jilin, China (e-mail: zhangchunxu@jlu.edu.cn, Guodong Long and Zhiwei li are with the Australian AI Institute, Faculty of Engineering and IT, University of Technology Sydney, Sydney, Australia (e-mail: guodong.long@uts.edu.au, Honglei Zhang is with the School of Computer Science and Technology, Beijing Jiaotong University, Beijing, China (e-mail: hon-glei.zhang@bjtu.edu.cn). Qiang Y ang is Professor Emeritus at the Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, and the Chief AI Officer of WeBank, Shenzhen, China (e-mail: qyang@cse.ust.hk). Personalization technique comparison in centralized and federated RecSys. The colorful module denotes the user-specific parameters and the gray module represents the user-shared parameters. FL's ability to collaboratively train multiple models across different devices naturally supports the development of personalized models, making it easier to tailor recommendations to individual user needs.
A Multimedia Analytics Model for the Foundation Model Era
Worring, Marcel, Zahálka, Jan, Elzen, Stef van den, Fischer, Maximilian T., Keim, Daniel A.
The rapid advances in Foundation Models and agentic Artificial Intelligence are transforming multimedia analytics by enabling richer, more sophisticated interactions between humans and analytical systems. Existing conceptual models for visual and multimedia analytics, however, do not adequately capture the complexity introduced by these powerful AI paradigms. To bridge this gap, we propose a comprehensive multimedia analytics model specifically designed for the foundation model era. Building upon established frameworks from visual analytics, multimedia analytics, knowledge generation, analytic task definition, mixed-initiative guidance, and human-in-the-loop reinforcement learning, our model emphasizes integrated human-AI teaming based on visual analytics agents from both technical and conceptual perspectives. Central to the model is a seamless, yet explicitly separable, interaction channel between expert users and semi-autonomous analytical processes, ensuring continuous alignment between user intent and AI behavior. The model addresses practical challenges in sensitive domains such as intelligence analysis, investigative journalism, and other fields handling complex, high-stakes data. We illustrate through detailed case studies how our model facilitates deeper understanding and targeted improvement of multimedia analytics solutions. By explicitly capturing how expert users can optimally interact with and guide AI-powered multimedia analytics systems, our conceptual framework sets a clear direction for system design, comparison, and future research.
Aqara has a clever solution for a vexing Matter problem
The new Matter standard is getting better at helping Alexa, Apple HomeKit, Google Home, and Samsung SmartThings play nice with each other, but it often does so at the expense of finer-grained features. Some Matter-enabled smart lights, for example, can be turned on or off via Matter or change their color, but Matter controllers might not be able to access their lighting scenes or advanced animation modes. Likewise, smart home manufacturer Aqara found some of its hardware functionality hamstrung by Matter's limitations, such as the lack of Matter support for facial recognition (which might arrive once Matter finally works with security cameras), or for the fall-detection capabilities of its motion sensors. One option would be to wait for the Matter specification to catch up and add that functionality--which could take a while, given the slow pace of Matter specification updates. Instead, Aqara built its own workaround, which involves taking various Aqara scenes and "signals" and turning them into virtual sensors that Matter understands.
Optimizing Power Grid Topologies with Reinforcement Learning: A Survey of Methods and Challenges
van der Sar, Erica, Zocca, Alessandro, Bhulai, Sandjai
Power grid operation is becoming increasingly complex due to the rising integration of renewable energy sources and the need for more adaptive control strategies. Reinforcement Learning (RL) has emerged as a promising approach to power network control (PNC), offering the potential to enhance decision-making in dynamic and uncertain environments. The Learning To Run a Power Network (L2RPN) competitions have played a key role in accelerating research by providing standardized benchmarks and problem formulations, leading to rapid advancements in RL-based methods. This survey provides a comprehensive and structured overview of RL applications for power grid topology optimization, categorizing existing techniques, highlighting key design choices, and identifying gaps in current research. Additionally, we present a comparative numerical study evaluating the impact of commonly applied RL-based methods, offering insights into their practical effectiveness. By consolidating existing research and outlining open challenges, this survey aims to provide a foundation for future advancements in RL-driven power grid optimization.
A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models
Xie, Zhouhang, Wu, Junda, Shen, Yiran, Xia, Yu, Li, Xintong, Chang, Aaron, Rossi, Ryan, Kumar, Sachin, Majumder, Bodhisattwa Prasad, Shang, Jingbo, Ammanabrolu, Prithviraj, McAuley, Julian
Personalized preference alignment for large language models (LLMs), the process of tailoring LLMs to individual users' preferences, is an emerging research direction spanning the area of NLP and personalization. In this survey, we present an analysis of works on personalized alignment and modeling for LLMs. We introduce a taxonomy of preference alignment techniques, including training time, inference time, and additionally, user-modeling based methods. We provide analysis and discussion on the strengths and limitations of each group of techniques and then cover evaluation, benchmarks, as well as open problems in the field.
Open Problems and a Hypothetical Path Forward in LLM Knowledge Paradigms
Ye, Xiaotian, Zhang, Mengqi, Wu, Shu
Knowledge is fundamental to the overall capabilities of Large Language Models (LLMs). The knowledge paradigm of a model, which dictates how it encodes and utilizes knowledge, significantly affects its performance. Despite the continuous development of LLMs under existing knowledge paradigms, issues within these frameworks continue to constrain model potential. This blog post highlight three critical open problems limiting model capabilities: (1) challenges in knowledge updating for LLMs, (2) the failure of reverse knowledge generalization (the reversal curse), and (3) conflicts in internal knowledge. We review recent progress made in addressing these issues and discuss potential general solutions. Based on observations in these areas, we propose a hypothetical paradigm based on Contextual Knowledge Scaling, and further outline implementation pathways that remain feasible within contemporary techniques. Evidence suggests this approach holds potential to address current shortcomings, serving as our vision for future model paradigms. This blog post aims to provide researchers with a brief overview of progress in LLM knowledge systems, while provide inspiration for the development of next-generation model architectures.
AI, Help Me Think$\unicode{x2014}$but for Myself: Assisting People in Complex Decision-Making by Providing Different Kinds of Cognitive Support
Reicherts, Leon, Zhang, Zelun Tony, von Oswald, Elisabeth, Liu, Yuanting, Rogers, Yvonne, Hassib, Mariam
How can we design AI tools that effectively support human decision-making by complementing and enhancing users' reasoning processes? Common recommendation-centric approaches face challenges such as inappropriate reliance or a lack of integration with users' decision-making processes. Here, we explore an alternative interaction model in which the AI outputs build upon users' own decision-making rationales. We compare this approach, which we call ExtendAI, with a recommendation-based AI. Participants in our mixed-methods user study interacted with both AIs as part of an investment decision-making task. We found that the AIs had different impacts, with ExtendAI integrating better into the decision-making process and people's own thinking and leading to slightly better outcomes. RecommendAI was able to provide more novel insights while requiring less cognitive effort. We discuss the implications of these and other findings along with three tensions of AI-assisted decision-making which our study revealed.