Goto

Collaborating Authors

 larsen


ALIGNS: Unlocking nomological networks in psychological measurement through a large language model

Larsen, Kai R., Yan, Sen, Mueller, Roland M., Sang, Lan, Rönkkö, Mikko, Starzl, Ravi, Edmondson, Donald

arXiv.org Artificial Intelligence

Psychological measurement is critical to many disciplines. Despite advances in measurement, building nomological networks, theoretical maps of how concepts and measures relate to establish validity, remains a challenge 70 years after Cronbach and Meehl proposed them as fundamental to validation. This limitation has practical consequences: clinical trials may fail to detect treatment effects, and public policy may target the wrong outcomes. We introduce Analysis of Latent Indicators to Generate Nomological Structures (ALIGNS), a large language model-based system trained with validated questionnaire measures. ALIGNS provides three comprehensive nomological networks containing over 550,000 indicators across psychology, medicine, social policy, and other fields. This represents the first application of large language models to solve a foundational problem in measurement validation. We report classification accuracy tests used to develop the model, as well as three evaluations. In the first evaluation, the widely used NIH PROMIS anxiety and depression instruments are shown to converge into a single dimension of emotional distress. The second evaluation examines child temperament measures and identifies four potential dimensions not captured by current frameworks, and questions one existing dimension. The third evaluation, an applicability check, engages expert psychometricians who assess the system's importance, accessibility, and suitability. ALIGNS is freely available at nomologicalnetwork.org, complementing traditional validation methods with large-scale nomological analysis.


Uppaal Coshy: Automatic Synthesis of Compact Shields for Hybrid Systems

Brorholt, Asger Horn, Høeg-Petersen, Andreas Holck, Jensen, Peter Gjøl, Larsen, Kim Guldstrand, Mikučionis, Marius, Schilling, Christian, Wąsowski, Andrzej

arXiv.org Artificial Intelligence

We present Uppaal Coshy, a tool for automatic synthesis of a safety strategy -- or shield -- for Markov decision processes over continuous state spaces and complex hybrid dynamics. The general methodology is to partition the state space and then solve a two-player safety game, which entails a number of algorithmically hard problems such as reachability for hybrid systems. The general philosophy of Uppaal Coshy is to approximate hard-to-obtain solutions using simulations. Our implementation is fully automatic and supports the expressive formalism of Uppaal models, which encompass stochastic hybrid automata. The precision of our partition-based approach benefits from using finer grids, which however are not efficient to store. We include an algorithm called Caap to efficiently compute a compact representation of a shield in the form of a decision tree, which yields significant reductions.


Bayesian optimization of atomic structures with prior probabilities from universal interatomic potentials

Lyngby, Peder, Larsen, Casper, Jacobsen, Karsten Wedel

arXiv.org Artificial Intelligence

The optimization of atomic structures plays a pivotal role in understanding and designing materials with desired properties. However, conventional methods often struggle with the formidable task of navigating the vast potential energy surface, especially in high-dimensional spaces with numerous local minima. Recent advancements in machine learning-driven surrogate models offer a promising avenue for alleviating this computational burden. In this study, we propose a novel approach that combines the strengths of universal machine learning potentials with a Bayesian approach of the GOFEE/BEACON framework. By leveraging the comprehensive chemical knowledge encoded in pretrained universal machine learning potentials as a prior estimate of energy and forces, we enable the Gaussian process to focus solely on capturing the intricate nuances of the potential energy surface. We demonstrate the efficacy of our approach through comparative analyses across diverse systems, including periodic bulk materials, surface structures, and a cluster.


Efficient Shield Synthesis via State-Space Transformation

Brorholt, Asger Horn, Høeg-Petersen, Andreas Holck, Larsen, Kim Guldstrand, Schilling, Christian

arXiv.org Artificial Intelligence

We consider the problem of synthesizing safety strategies for control systems, also known as shields. Since the state space is infinite, shields are typically computed over a finite-state abstraction, with the most common abstraction being a rectangular grid. However, for many systems, such a grid does not align well with the safety property or the system dynamics. That is why a coarse grid is rarely sufficient, but a fine grid is typically computationally infeasible to obtain. In this paper, we show that appropriate state-space transformations can still allow to use a coarse grid at almost no computational overhead. We demonstrate in three case studies that our transformation-based synthesis outperforms a standard synthesis by several orders of magnitude. In the first two case studies, we use domain knowledge to select a suitable transformation. In the third case study, we instead report on results in engineering a transformation without domain knowledge.


A Digital Twin prototype for traffic sign recognition of a learning-enabled autonomous vehicle

AbdElSalam, Mohamed, Ali, Loai, Bensalem, Saddek, He, Weicheng, Katsaros, Panagiotis, Kekatos, Nikolaos, Peled, Doron, Temperekidis, Anastasios, Wu, Changshun

arXiv.org Artificial Intelligence

In this paper, we present a novel digital twin prototype for a learning-enabled self-driving vehicle. The primary objective of this digital twin is to perform traffic sign recognition and lane keeping. The digital twin architecture relies on co-simulation and uses the Functional Mock-up Interface and SystemC Transaction Level Modeling standards. The digital twin consists of four clients, i) a vehicle model that is designed in Amesim tool, ii) an environment model developed in Prescan, iii) a lane-keeping controller designed in Robot Operating System, and iv) a perception and speed control module developed in the formal modeling language of BIP (Behavior, Interaction, Priority). These clients interface with the digital twin platform, PAVE360-Veloce System Interconnect (PAVE360-VSI). PAVE360-VSI acts as the co-simulation orchestrator and is responsible for synchronization, interconnection, and data exchange through a server. The server establishes connections among the different clients and also ensures adherence to the Ethernet protocol. We conclude with illustrative digital twin simulations and recommendations for future work.


Shielded Reinforcement Learning for Hybrid Systems

Brorholt, Asger Horn, Jensen, Peter Gjøl, Larsen, Kim Guldstrand, Lorber, Florian, Schilling, Christian

arXiv.org Artificial Intelligence

Safe and optimal controller synthesis for switched-controlled hybrid systems, which combine differential equations and discrete changes of the system's state, is known to be intricately hard. Reinforcement learning has been leveraged to construct near-optimal controllers, but their behavior is not guaranteed to be safe, even when it is encouraged by reward engineering. One way of imposing safety to a learned controller is to use a shield, which is correct by design. However, obtaining a shield for non-linear and hybrid environments is itself intractable. In this paper, we propose the construction of a shield using the so-called barbaric method, where an approximate finite representation of an underlying partition-based two-player safety game is extracted via systematically picked samples of the true transition function. While hard safety guarantees are out of reach, we experimentally demonstrate strong statistical safety guarantees with a prototype implementation and Uppaal Stratego. Furthermore, we study the impact of the synthesized shield when applied as either a pre-shield (applied before learning a controller) or a post-shield (only applied after learning a controller). We experimentally demonstrate superiority of the pre-shielding approach. We apply our technique on a range of case studies, including two industrial examples, and further study post-optimization of the post-shielding approach.


MM Algorithms to Estimate Parameters in Continuous-time Markov Chains

Bacci, Giovanni, Ingólfsdóttir, Anna, Larsen, Kim G., Reynouard, Raphaël

arXiv.org Artificial Intelligence

Continuous-time Markov chains (CTMCs) are popular modeling formalism that constitutes the underlying semantics for real-time probabilistic systems such as queuing networks, stochastic process algebras, and calculi for systems biology. Prism and Storm are popular model checking tools that provide a number of powerful analysis techniques for CTMCs. These tools accept models expressed as the parallel composition of a number of modules interacting with each other. The outcome of the analysis is strongly dependent on the parameter values used in the model which govern the timing and probability of events of the resulting CTMC. However, for some applications, parameter values have to be empirically estimated from partially-observable executions. In this work, we address the problem of estimating parameter values of CTMCs expressed as Prism models from a number of partially-observable executions. We introduce the class parametric CTMCs -- CTMCs where transition rates are polynomial functions over a set of parameters -- as an abstraction of CTMCs covering a large class of Prism models. Then, building on a theory of algorithms known by the initials MM, for minorization-maximization, we present iterative maximum likelihood estimation algorithms for parametric CTMCs covering two learning scenarios: when both state-labels and dwell times are observable, or just state-labels are. We conclude by illustrating the use of our technique in a simple but non-trivial case study: the analysis of the spread of COVID-19 in presence of lockdown countermeasures.


Top challenge to internet health is AI power disparity and harm, Mozilla says

#artificialintelligence

The top challenge for the health of the internet is the power disparity between who benefits from AI and who is harmed by AI, Mozilla's new 2022 Internet Health reveals. Once again, this new report puts AI under the spotlight for how companies and governments use the technology. Mozilla's report scrutinized the nature of the AI-driven world citing real examples from different countries. TechRepublic spoke to Solana Larsen, Mozilla's Internet Health report editor, to shed light on the concept of "Responsible AI from the Start," black box AI, the future of regulations and how some AI projects lead by example. Larsen explains that AI systems should be built from the start considering ethics and responsibility, not tacked on at a later date when the harms begin to emerge.


Top challenge to internet health is AI power disparity and harm, Mozilla says

#artificialintelligence

The top challenge for the health of the internet is the power disparity between who benefits from AI and who is harmed by AI, Mozilla's new 2022 Internet Health reveals. Once again, this new report puts AI under the spotlight for how companies and governments use the technology. Mozilla's report scrutinized the nature of the AI-driven world citing real examples from different countries. TechRepublic spoke to Solana Larsen, Mozilla's Internet Health report editor, to shed light on the concept of "Responsible AI from the Start," black box AI, the future of regulations and how some AI projects lead by example. Larsen explains that AI systems should be built from the start considering ethics and responsibility, not tacked on at a later date when the harms begin to emerge.


Overhead-MNIST: Machine Learning Baselines for Image Classification

Larsen, Erik, Noever, David, MacVittie, Korey, Lilly, John

arXiv.org Artificial Intelligence

Twenty-three machine learning algorithms were trained then scored to establish baseline comparison metrics and to select an image classification algorithm worthy of embedding into mission-critical satellite imaging systems. The Overhead-MNIST dataset is a collection of satellite images similar in style to the ubiquitous MNIST hand-written digits found in the machine learning literature. The CatBoost classifier, Light Gradient Boosting Machine, and Extreme Gradient Boosting models produced the highest accuracies, Areas Under the Curve (AUC), and F1 scores in a PyCaret general comparison. Separate evaluations showed that a deep convolutional architecture was the most promising. We present results for the overall best performing algorithm as a baseline for edge deployability and future performance improvement: a convolutional neural network (CNN) scoring 0.965 categorical accuracy on unseen test data.