gothenburg
Cost-Aware Prediction (CAP): An LLM-Enhanced Machine Learning Pipeline and Decision Support System for Heart Failure Mortality Prediction
Yu, Yinan, Dippel, Falk, Lundberg, Christina E., Lindgren, Martin, Rosengren, Annika, Adiels, Martin, Sjöland, Helen
Objective: Machine learning (ML) predictive models are often developed without considering downstream value trade-offs and clinical interpretability. This paper introduces a cost-aware prediction (CAP) framework that combines cost-benefit analysis assisted by large language model (LLM) agents to communicate the trade-offs involved in applying ML predictions. Materials and Methods: We developed an ML model predicting 1-year mortality in patients with heart failure (N = 30,021, 22% mortality) to identify those eligible for home care. We then introduced clinical impact projection (CIP) curves to visualize important cost dimensions - quality of life and healthcare provider expenses, further divided into treatment and error costs, to assess the clinical consequences of predictions. Finally, we used four LLM agents to generate patient-specific descriptions. The system was evaluated by clinicians for its decision support value. Results: The eXtreme gradient boosting (XGB) model achieved the best performance, with an area under the receiver operating characteristic curve (AUROC) of 0.804 (95% confidence interval (CI) 0.792-0.816), area under the precision-recall curve (AUPRC) of 0.529 (95% CI 0.502-0.558) and a Brier score of 0.135 (95% CI 0.130-0.140). Discussion: The CIP cost curves provided a population-level overview of cost composition across decision thresholds, whereas LLM-generated cost-benefit analysis at individual patient-levels. The system was well received according to the evaluation by clinicians. However, feedback emphasizes the need to strengthen the technical accuracy for speculative tasks. Conclusion: CAP utilizes LLM agents to integrate ML classifier outcomes and cost-benefit analysis for more transparent and interpretable decision support.
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.05)
- North America > United States > Florida > Miami-Dade County > Miami (0.04)
- Europe > Sweden > Västerbotten County > Umeå (0.04)
LLM Company Policies and Policy Implications in Software Organizations
Khojah, Ranim, Mohamad, Mazen, Erlenhov, Linda, Neto, Francisco Gomes de Oliveira, Leitner, Philipp
Abstract--The risks associated with adopting large language model (LLM) chatbots in software organizations highlight the need for clear policies. We examine how 11 companies create these policies and the factors that influence them, aiming to help managers safely integrate chatbots into development workflows. In software organizations, the software product is gradually evolving to AI-powered software (AIware) with the use of AI, more specifically, large language models (LLMs) in the development process [2]. LLMs are increasingly seen as valuable tools for improving productivity, which motivated enterprises to adopt them [3]. However, these models have introduced risks and concerns that impact the organization, the software engineers, and the product. Integrating LLMs into software development raises challenges related to the quality and ownership of generated content [4], which complicates accountability and can affect product reliability . In addition, interactions with LLMs (e.g., through external APIs) may expose organizations to liability where developers unintentionally transmit sensitive data, resulting in legal repercussions [5].
- Europe > Austria > Vienna (0.14)
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.06)
- North America > United States (0.04)
- (4 more...)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
Nonlinear filtering based on density approximation and deep BSDE prediction
Bågmark, Kasper, Andersson, Adam, Larsson, Stig
A novel approximate Bayesian filter based on backward stochastic differential equations is introduced. It uses a nonlinear Feynman--Kac representation of the filtering problem and the approximation of an unnormalized filtering density using the well-known deep BSDE method and neural networks. The method is trained offline, which means that it can be applied online with new observations. A mixed a priori-a posteriori error bound is proved under an elliptic condition. The theoretical convergence rate is confirmed in two numerical examples.
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.05)
- Europe > Netherlands > South Holland > Dordrecht (0.04)
Learning Traffic Anomalies from Generative Models on Real-Time Observations
Giasemis, Fotis I., Sopasakis, Alexandros
Accurate detection of traffic anomalies is crucial for effective urban traffic management and congestion mitigation. We use the Spatiotemporal Generative Adversarial Network (STGAN) framework combining Graph Neural Networks and Long Short-Term Memory networks to capture complex spatial and temporal dependencies in traffic data. We apply STGAN to real-time, minute-by-minute observations from 42 traffic cameras across Gothenburg, Sweden, collected over several months in 2020. The images are processed to compute a flow metric representing vehicle density, which serves as input for the model. Training is conducted on data from April to November 2020, and validation is performed on a separate dataset from November 14 to 23, 2020. Our results demonstrate that the model effectively detects traffic anomalies with high precision and low false positive rates. The detected anomalies include camera signal interruptions, visual artifacts, and extreme weather conditions affecting traffic flow.
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.25)
- Europe > France > Île-de-France > Paris > Paris (0.04)
- Transportation (1.00)
- Consumer Products & Services > Travel (0.35)
Using generative AI to support standardization work -- the case of 3GPP
Staron, Miroslaw, Strom, Jonathan, Karlsson, Albin, Meding, Wilhelm
Standardization processes build upon consensus between partners, which depends on their ability to identify points of disagreement and resolving them. Large standardization organizations, like the 3GPP or ISO, rely on leaders of work packages who can correctly, and efficiently, identify disagreements, discuss them and reach a consensus. This task, however, is effort-, labor-intensive and costly. In this paper, we address the problem of identifying similarities, dissimilarities and discussion points using large language models. In a design science research study, we work with one of the organizations which leads several workgroups in the 3GPP standard. Our goal is to understand how well the language models can support the standardization process in becoming more cost-efficient, faster and more reliable. Our results show that generic models for text summarization correlate well with domain expert's and delegate's assessments (Pearson correlation between 0.66 and 0.98), but that there is a need for domain-specific models to provide better discussion materials for the standardization groups.
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.05)
- Asia > China (0.04)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
- Information Technology > Artificial Intelligence > Natural Language > Generation (0.41)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.19)
- Europe > Germany > Saarland > Saarbrücken (0.04)
Welcome Your New AI Teammate: On Safety Analysis by Leashing Large Language Models
Nouri, Ali, Cabrero-Daniel, Beatriz, Törner, Fredrik, Sivencrona, Hȧkan, Berger, Christian
DevOps is a necessity in many industries, including the development of Autonomous Vehicles. In those settings, there are iterative activities that reduce the speed of SafetyOps cycles. One of these activities is "Hazard Analysis & Risk Assessment" (HARA), which is an essential step to start the safety requirements specification. As a potential approach to increase the speed of this step in SafetyOps, we have delved into the capabilities of Large Language Models (LLMs). Our objective is to systematically assess their potential for application in the field of safety engineering. To that end, we propose a framework to support a higher degree of automation of HARA with LLMs. Despite our endeavors to automate as much of the process as possible, expert review remains crucial to ensure the validity and correctness of the analysis results, with necessary modifications made accordingly.
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.06)
- Europe > Portugal > Lisbon > Lisbon (0.06)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > Albania > Durrës County > Durrës (0.04)
- Automobiles & Trucks (0.48)
- Information Technology (0.35)
Prompt Smells: An Omen for Undesirable Generative AI Outputs
Ronanki, Krishna, Cabrero-Daniel, Beatriz, Berger, Christian
Recent Generative Artificial Intelligence (GenAI) trends focus on various applications, including creating stories, illustrations, poems, articles, computer code, music compositions, and videos. Extrinsic hallucinations are a critical limitation of such GenAI, which can lead to significant challenges in achieving and maintaining the trustworthiness of GenAI. In this paper, we propose two new concepts that we believe will aid the research community in addressing limitations associated with the application of GenAI models. First, we propose a definition for the "desirability" of GenAI outputs and three factors which are observed to influence it. Second, drawing inspiration from Martin Fowler's code smells, we propose the concept of "prompt smells" and the adverse effects they are observed to have on the desirability of GenAI outputs. We expect our work will contribute to the ongoing conversation about the desirability of GenAI outputs and help advance the field in a meaningful way.
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.06)
- North America > United States > New York > New York County > New York City (0.05)
- Europe > Portugal > Lisbon > Lisbon (0.05)
- (4 more...)
A Shift In Artistic Practices through Artificial Intelligence
Tatar, Kıvanç, Ericson, Petter, Cotton, Kelsey, del Prado, Paola Torres Núñez, Batlle-Roca, Roser, Cabrero-Daniel, Beatriz, Ljungblad, Sara, Diapoulis, Georgios, Hussain, Jabbar
The explosion of content generated by Artificial Intelligence models has initiated a cultural shift in arts, music, and media, where roles are changing, values are shifting, and conventions are challenged. The readily available, vast dataset of the internet has created an environment for AI models to be trained on any content on the web. With AI models shared openly, and used by many, globally, how does this new paradigm shift challenge the status quo in artistic practices? What kind of changes will AI technology bring into music, arts, and new media?
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.07)
- Europe > Italy (0.05)
- Europe > Sweden > Stockholm > Stockholm (0.05)
- (18 more...)
- Information Technology > Security & Privacy (0.94)
- Media > Music (0.94)
Tools for ML Experiment Tracking and Management
We are a group of researchers from Sweden, Netherlands, and Germany and kindly invite you to our survey on "Machine Learning Experiment Management Tools." Such tools support practitioners performing machine learning (ML) or deep learning (DL) experiments, systematically managing all involved artifacts (scripts, datasets, hyperparameters, models, …). As a machine learning practitioner, we kindly invite you to participate. We also invite you to forward this invitation to other colleagues who might be interested in this survey as well. Our survey elicits information on the management tools practitioners adopt, their perceived benefits, challenges, and limitations.
- Europe > Germany (0.31)
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.18)
- Europe > Netherlands > Gelderland > Nijmegen (0.08)