Goto

Collaborating Authors

 Generative AI


Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series

arXiv.org Machine Learning

This paper develops dimension reduction techniques for accelerating diffusion model inference in the context of synthetic data generation. The idea is to integrate compressed sensing into diffusion models: (i) compress the data into a latent space, (ii) train a diffusion model in the latent space, and (iii) apply a compressed sensing algorithm to the samples generated in the latent space, facilitating the efficiency of both model training and inference. Under suitable sparsity assumptions on data, the proposed algorithm is proved to enjoy faster convergence by combining diffusion model inference with sparse recovery. As a byproduct, we obtain an optimal value for the latent space dimension. We also conduct numerical experiments on a range of datasets, including image data (handwritten digits, medical images, and climate data) and financial time series for stress testing. Key words: Complexity, compressed sensing, diffusion models, inference time, signal recovery, sparsity.


Continuous Monitoring of Large-Scale Generative AI via Deterministic Knowledge Graph Structures

arXiv.org Artificial Intelligence

Generative AI (GEN AI) models have revolutionized diverse application domains but present substantial challenges due to reliability concerns, including hallucinations, semantic drift, and inherent biases. These models typically operate as black-boxes, complicating transparent and objective evaluation. Current evaluation methods primarily depend on subjective human assessment, limiting scalability, transparency, and effectiveness. This research proposes a systematic methodology using deterministic and Large Language Model (LLM)-generated Knowledge Graphs (KGs) to continuously monitor and evaluate GEN AI reliability. We construct two parallel KGs: (i) a deterministic KG built using explicit rule-based methods, predefined ontologies, domain-specific dictionaries, and structured entity-relation extraction rules, and (ii) an LLM-generated KG dynamically derived from real-time textual data streams such as live news articles. Utilizing real-time news streams ensures authenticity, mitigates biases from repetitive training, and prevents adaptive LLMs from bypassing predefined benchmarks through feedback memorization. To quantify structural deviations and semantic discrepancies, we employ several established KG metrics, including Instantiated Class Ratio (ICR), Instantiated Property Ratio (IPR), and Class Instantiation (CI). An automated real-time monitoring framework continuously computes deviations between deterministic and LLM-generated KGs. By establishing dynamic anomaly thresholds based on historical structural metric distributions, our method proactively identifies and flags significant deviations, thus promptly detecting semantic anomalies or hallucinations. This structured, metric-driven comparison between deterministic and dynamically generated KGs delivers a robust and scalable evaluation framework.


On Aligning Prediction Models with Clinical Experiential Learning: A Prostate Cancer Case Study

arXiv.org Artificial Intelligence

Over the past decade, the use of machine learning (ML) models in healthcare applications has rapidly increased. Despite high performance, modern ML models do not always capture patterns the end user requires. For example, a model may predict a non-monotonically decreasing relationship between cancer stage and survival, keeping all other features fixed. In this paper, we present a reproducible framework for investigating this misalignment between model behavior and clinical experiential learning, focusing on the effects of underspecification of modern ML pipelines. In a prostate cancer outcome prediction case study, we first identify and address these inconsistencies by incorporating clinical knowledge, collected by a survey, via constraints into the ML model, and subsequently analyze the impact on model performance and behavior across degrees of underspecification. The approach shows that aligning the ML model with clinical experiential learning is possible without compromising performance. Motivated by recent literature in generative AI, we further examine the feasibility of a feedback-driven alignment approach in non-generative AI clinical risk prediction models through a randomized experiment with clinicians. Our findings illustrate that, by eliciting clinicians' model preferences using our proposed methodology, the larger the difference in how the constrained and unconstrained models make predictions for a patient, the more apparent the difference is in clinical interpretation.


Explainable Knowledge Graph Retrieval-Augmented Generation (KG-RAG) with KG-SMILE

arXiv.org Artificial Intelligence

Generative AI, such as Large Language Models (LLMs), has achieved impressive progress but still produces hallucinations and unverifiable claims, limiting reliability in sensitive domains. Retrieval-Augmented Generation (RAG) improves accuracy by grounding outputs in external knowledge, especially in domains like healthcare, where precision is vital. However, RAG remains opaque and essentially a black box, heavily dependent on data quality. We developed a method-agnostic, perturbation-based framework that provides token and component-level interoperability for Graph RAG using SMILE and named it as Knowledge-Graph (KG)-SMILE. By applying controlled perturbations, computing similarities, and training weighted linear surrogates, KG-SMILE identifies the graph entities and relations most influential to generated outputs, thereby making RAG more transparent. We evaluate KG-SMILE using comprehensive attribution metrics, including fidelity, faithfulness, consistency, stability, and accuracy. Our findings show that KG-SMILE produces stable, human-aligned explanations, demonstrating its capacity to balance model effectiveness with interpretability and thereby fostering greater transparency and trust in machine learning technologies.


Mitigating Clinician Information Overload: Generative AI for Integrated EHR and RPM Data Analysis

arXiv.org Artificial Intelligence

Generative Artificial Intelligence (GenAI), particularly Large Language Models (LLMs), offer powerful capabilities for interpreting the complex data landscape in healthcare. In this paper, we present a comprehensive overview of the capabilities, requirements and applications of GenAI for deriving clinical insights and improving clinical efficiency. We first provide some background on the forms and sources of patient data, namely real-time Remote Patient Monitoring (RPM) streams and traditional Electronic Health Records (EHRs). The sheer volume and heterogeneity of this combined data present significant challenges to clinicians and contribute to information overload. In addition, we explore the potential of LLM-powered applications for improving clinical efficiency. These applications can enhance navigation of longitudinal patient data and provide actionable clinical decision support through natural language dialogue. We discuss the opportunities this presents for streamlining clinician workflows and personalizing care, alongside critical challenges such as data integration complexity, ensuring data quality and RPM data reliability, maintaining patient privacy, validating AI outputs for clinical safety, mitigating bias, and ensuring clinical acceptance. We believe this work represents the first summarization of GenAI techniques for managing clinician data overload due to combined RPM / EHR data complexities.


Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling

arXiv.org Artificial Intelligence

High-Performance Computing (HPC) job scheduling involves balancing conflicting objectives such as minimizing makespan, reducing wait times, optimizing resource use, and ensuring fairness. Traditional methods, including heuristic-based, e.g., First-Come-First-Served (FJFS) and Shortest Job First (SJF), or intensive optimization techniques, often lack adaptability to dynamic workloads and, more importantly, cannot simultaneously optimize multiple objectives in HPC systems. To address this, we propose a novel Large Language Model (LLM)-based scheduler using a ReAct-style framework (Reason + Act), enabling iterative, interpretable decision-making. The system incorporates a scratchpad memory to track scheduling history and refine decisions via natural language feedback, while a constraint enforcement module ensures feasibility and safety. We evaluate our approach using OpenAI's O4-Mini and Anthropic's Claude 3.7 across seven real-world HPC workload scenarios, including heterogeneous mixes, bursty patterns, and adversarial cases etc. Comparisons against FCFS, SJF, and Google OR-Tools (on 10 to 100 jobs) reveal that LLM-based scheduling effectively balances multiple objectives while offering transparent reasoning through natural language traces. The method excels in constraint satisfaction and adapts to diverse workloads without domain-specific training. However, a trade-off between reasoning quality and computational overhead challenges real-time deployment. This work presents the first comprehensive study of reasoning-capable LLMs for HPC scheduling, demonstrating their potential to handle multiobjective optimization while highlighting limitations in computational efficiency. The findings provide insights into leveraging advanced language models for complex scheduling problems in dynamic HPC environments.


Evaluating Quality of Gaming Narratives Co-created with AI

arXiv.org Artificial Intelligence

--This paper proposes a structured methodology to evaluate AI-generated game narratives, leveraging the Delphi study structure with a panel of narrative design experts. Our approach synthesizes story quality dimensions from literature and expert insights, mapping them into the Kano model framework to understand their impact on player satisfaction. The results can inform game developers on prioritizing quality aspects when co-creating game narratives with generative AI. While generative AI has surged into public and research consciousness following the release of systems like ChatGPT, video games have a longer tradition of using AI techniques to generate content that would otherwise be authored by human designers. This tradition is well established in the field of Procedural Content Generation, which encompasses a range of methods for algorithmically creating game elements such as levels, characters, quests, and storylines [1].


Synthesia's AI clones are more expressive than ever. Soon they'll be able to talk back.

MIT Technology Review

When Synthesia launched in 2017, its primary purpose was to match AI versions of real human faces--for example, the former footballer David Beckham--with dubbed voices speaking in different languages. A few years later, in 2020, it started giving the companies that signed up for its services the opportunity to make professional-level presentation videos starring either AI versions of staff members or consenting actors. The avatars' body movements could be jerky and unnatural, their accents sometimes slipped, and the emotions indicated by their voices didn't always match their facial expressions. Now Synthesia's avatars have been updated with more natural mannerisms and movements, as well as expressive voices that better preserve the speaker's accent--making them appear more humanlike than ever before. For Synthesia's corporate clients, these avatars will make for slicker presenters of financial results, internal communications, or staff training videos.


Resonac creates 27-member consortium to pursue advanced chip developments

The Japan Times

Resonac, a Japanese chip-materials maker, has announced the creation of Joint 3, which it describes as a consortium of 27 companies working together on semiconductor-related developments. "With next-generation technologies like generative AI and self-driving cars rapidly spreading, the technology required for semiconductors is getting more advanced and complex," Resonac CEO Hidehito Takahashi said Wednesday. Companies from a number of countries will be involved in Joint 3, which is led by Resonac. The list includes St Paul, Minnesota's 3M, Rolla, Missouri's Brewer Science, Sunnyvale, California's Synopsys and Singapore-headquartered, Hong Kong-listed ASMPT.


Charting the Future of Scholarly Knowledge with AI: A Community Perspective

arXiv.org Artificial Intelligence

Scholarly work and communication encompass the entire system in which research and creative works are created, evaluated for quality, disseminated to the academic community and beyond, used, and preserved for future use. It includes formal publications, such as journal articles and books, as well as informal sharing through preprints, conference presentations, data sharing, and broader engagement with scholarly works and research outputs. Scholarly knowledge serves as the primary engine of progress, shaping our world and guiding our collective future. It forms the backbone of technological advancement, public health systems, and sustainable environmental practices. Obtained through rigorous methods of observation, experimentation, and validation, it is a reliable resource that helps societies solve complex problems and improve the quality of life by achieving sustainable development goals (SDGs) [6]. To be truly useful, scholarly knowledge must first be systematically extracted and organized. However, the scholarly community of today faces the problem of an overload of scientific papers in their respective domains. There is an increasing number of papers published every year (currently, 3 million), in addition to more than 200 million papers that have already been published . This gives rise to the research question: "How can we provide a reliable and living scholarly knowledge base that empowers researchers to query, synthesize, and analyze the vast body of scholarly knowledge?"