AITopics | Generative AI

Collaborating Authors

Generative AI

News Overviews Instructional Materials AI-Alerts Classics

Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series

Guo, Zhengyi, Li, Jiatu, Tang, Wenpin, Yao, David D.

arXiv.org Machine LearningSep-5-2025

This paper develops dimension reduction techniques for accelerating diffusion model inference in the context of synthetic data generation. The idea is to integrate compressed sensing into diffusion models: (i) compress the data into a latent space, (ii) train a diffusion model in the latent space, and (iii) apply a compressed sensing algorithm to the samples generated in the latent space, facilitating the efficiency of both model training and inference. Under suitable sparsity assumptions on data, the proposed algorithm is proved to enjoy faster convergence by combining diffusion model inference with sparse recovery. As a byproduct, we obtain an optimal value for the latent space dimension. We also conduct numerical experiments on a range of datasets, including image data (handwritten digits, medical images, and climate data) and financial time series for stress testing. Key words: Complexity, compressed sensing, diffusion models, inference time, signal recovery, sparsity.

artificial intelligence, diffusion model, machine learning, (19 more...)

arXiv.org Machine Learning

2509.03898

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > India > NCT > New Delhi (0.04)

Genre: Research Report (0.64)

Industry:

Energy (0.68)
Banking & Finance > Trading (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

Add feedback

Continuous Monitoring of Large-Scale Generative AI via Deterministic Knowledge Graph Structures

Gupta, Kishor Datta, Haque, Mohd Ariful, Ali, Hasmot, Kamal, Marufa, Alam, Syed Bahauddin, Rahman, Mohammad Ashiqur

arXiv.org Artificial IntelligenceSep-5-2025

Generative AI (GEN AI) models have revolutionized diverse application domains but present substantial challenges due to reliability concerns, including hallucinations, semantic drift, and inherent biases. These models typically operate as black-boxes, complicating transparent and objective evaluation. Current evaluation methods primarily depend on subjective human assessment, limiting scalability, transparency, and effectiveness. This research proposes a systematic methodology using deterministic and Large Language Model (LLM)-generated Knowledge Graphs (KGs) to continuously monitor and evaluate GEN AI reliability. We construct two parallel KGs: (i) a deterministic KG built using explicit rule-based methods, predefined ontologies, domain-specific dictionaries, and structured entity-relation extraction rules, and (ii) an LLM-generated KG dynamically derived from real-time textual data streams such as live news articles. Utilizing real-time news streams ensures authenticity, mitigates biases from repetitive training, and prevents adaptive LLMs from bypassing predefined benchmarks through feedback memorization. To quantify structural deviations and semantic discrepancies, we employ several established KG metrics, including Instantiated Class Ratio (ICR), Instantiated Property Ratio (IPR), and Class Instantiation (CI). An automated real-time monitoring framework continuously computes deviations between deterministic and LLM-generated KGs. By establishing dynamic anomaly thresholds based on historical structural metric distributions, our method proactively identifies and flags significant deviations, thus promptly detecting semantic anomalies or hallucinations. This structured, metric-driven comparison between deterministic and dynamically generated KGs delivers a robust and scalable evaluation framework.

apreprint-september5, large language model, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2509.03857

Country: Asia > Thailand (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.35)

Add feedback

On Aligning Prediction Models with Clinical Experiential Learning: A Prostate Cancer Case Study

Vallon, Jacqueline J., Overman, William, Xu, Wanqiao, Panjwani, Neil, Ling, Xi, Vij, Sushmita, Bagshaw, Hilary P., Leppert, John T., Shah, Sumit, Sonn, Geoffrey, Srinivas, Sandy, Pollom, Erqi, Buyyounouski, Mark K., Bayati, Mohsen

arXiv.org Artificial IntelligenceSep-5-2025

Over the past decade, the use of machine learning (ML) models in healthcare applications has rapidly increased. Despite high performance, modern ML models do not always capture patterns the end user requires. For example, a model may predict a non-monotonically decreasing relationship between cancer stage and survival, keeping all other features fixed. In this paper, we present a reproducible framework for investigating this misalignment between model behavior and clinical experiential learning, focusing on the effects of underspecification of modern ML pipelines. In a prostate cancer outcome prediction case study, we first identify and address these inconsistencies by incorporating clinical knowledge, collected by a survey, via constraints into the ML model, and subsequently analyze the impact on model performance and behavior across degrees of underspecification. The approach shows that aligning the ML model with clinical experiential learning is possible without compromising performance. Motivated by recent literature in generative AI, we further examine the feasibility of a feedback-driven alignment approach in non-generative AI clinical risk prediction models through a randomized experiment with clinicians. Our findings illustrate that, by eliciting clinicians' model preferences using our proposed methodology, the larger the difference in how the constrained and unconstrained models make predictions for a patient, the more apparent the difference is in clinical interpretation.

artificial intelligence, clinician, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2509.04053

Country: North America > United States > California (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology > Prostate Cancer (0.61)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

Add feedback

Explainable Knowledge Graph Retrieval-Augmented Generation (KG-RAG) with KG-SMILE

Moghaddam, Zahra Zehtabi Sabeti, Dehghani, Zeinab, Rani, Maneeha, Aslansefat, Koorosh, Mishra, Bhupesh Kumar, Kureshi, Rameez Raja, Thakker, Dhavalkumar

arXiv.org Artificial IntelligenceSep-5-2025

Generative AI, such as Large Language Models (LLMs), has achieved impressive progress but still produces hallucinations and unverifiable claims, limiting reliability in sensitive domains. Retrieval-Augmented Generation (RAG) improves accuracy by grounding outputs in external knowledge, especially in domains like healthcare, where precision is vital. However, RAG remains opaque and essentially a black box, heavily dependent on data quality. We developed a method-agnostic, perturbation-based framework that provides token and component-level interoperability for Graph RAG using SMILE and named it as Knowledge-Graph (KG)-SMILE. By applying controlled perturbations, computing similarities, and training weighted linear surrogates, KG-SMILE identifies the graph entities and relations most influential to generated outputs, thereby making RAG more transparent. We evaluate KG-SMILE using comprehensive attribution metrics, including fidelity, faithfulness, consistency, stability, and accuracy. Our findings show that KG-SMILE produces stable, human-aligned explanations, demonstrating its capacity to balance model effectiveness with interpretability and thereby fostering greater transparency and trust in machine learning technologies.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2509.03626

Country: Europe > United Kingdom (0.46)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

Mitigating Clinician Information Overload: Generative AI for Integrated EHR and RPM Data Analysis

Shetgaonkar, Ankit, Pradhan, Dipen, Arora, Lakshit, Girija, Sanjay Surendranath, Kapoor, Shashank, Raj, Aman

arXiv.org Artificial IntelligenceSep-5-2025

Generative Artificial Intelligence (GenAI), particularly Large Language Models (LLMs), offer powerful capabilities for interpreting the complex data landscape in healthcare. In this paper, we present a comprehensive overview of the capabilities, requirements and applications of GenAI for deriving clinical insights and improving clinical efficiency. We first provide some background on the forms and sources of patient data, namely real-time Remote Patient Monitoring (RPM) streams and traditional Electronic Health Records (EHRs). The sheer volume and heterogeneity of this combined data present significant challenges to clinicians and contribute to information overload. In addition, we explore the potential of LLM-powered applications for improving clinical efficiency. These applications can enhance navigation of longitudinal patient data and provide actionable clinical decision support through natural language dialogue. We discuss the opportunities this presents for streamlining clinician workflows and personalizing care, alongside critical challenges such as data integration complexity, ensuring data quality and RPM data reliability, maintaining patient privacy, validating AI outputs for clinical safety, mitigating bias, and ensuring clinical acceptance. We believe this work represents the first summarization of GenAI techniques for managing clinician data overload due to combined RPM / EHR data complexities.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/COMPSAC65507.2025.00284

2509.00073

Country: North America > United States (0.68)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.70)

Add feedback

Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling

Jadhav, Prachi, Jin, Hongwei, Deelman, Ewa, Balaprakash, Prasanna

arXiv.org Artificial IntelligenceSep-5-2025

High-Performance Computing (HPC) job scheduling involves balancing conflicting objectives such as minimizing makespan, reducing wait times, optimizing resource use, and ensuring fairness. Traditional methods, including heuristic-based, e.g., First-Come-First-Served (FJFS) and Shortest Job First (SJF), or intensive optimization techniques, often lack adaptability to dynamic workloads and, more importantly, cannot simultaneously optimize multiple objectives in HPC systems. To address this, we propose a novel Large Language Model (LLM)-based scheduler using a ReAct-style framework (Reason + Act), enabling iterative, interpretable decision-making. The system incorporates a scratchpad memory to track scheduling history and refine decisions via natural language feedback, while a constraint enforcement module ensures feasibility and safety. We evaluate our approach using OpenAI's O4-Mini and Anthropic's Claude 3.7 across seven real-world HPC workload scenarios, including heterogeneous mixes, bursty patterns, and adversarial cases etc. Comparisons against FCFS, SJF, and Google OR-Tools (on 10 to 100 jobs) reveal that LLM-based scheduling effectively balances multiple objectives while offering transparent reasoning through natural language traces. The method excels in constraint satisfaction and adapts to diverse workloads without domain-specific training. However, a trade-off between reasoning quality and computational overhead challenges real-time deployment. This work presents the first comprehensive study of reasoning-capable LLMs for HPC scheduling, demonstrating their potential to handle multiobjective optimization while highlighting limitations in computational efficiency. The findings provide insights into leveraging advanced language models for complex scheduling problems in dynamic HPC environments.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2506.02025

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Energy (0.93)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.35)

Add feedback

Evaluating Quality of Gaming Narratives Co-created with AI

Valdivia, Arturo, Burelli, Paolo

arXiv.org Artificial IntelligenceSep-5-2025

--This paper proposes a structured methodology to evaluate AI-generated game narratives, leveraging the Delphi study structure with a panel of narrative design experts. Our approach synthesizes story quality dimensions from literature and expert insights, mapping them into the Kano model framework to understand their impact on player satisfaction. The results can inform game developers on prioritizing quality aspects when co-creating game narratives with generative AI. While generative AI has surged into public and research consciousness following the release of systems like ChatGPT, video games have a longer tradition of using AI techniques to generate content that would otherwise be authored by human designers. This tradition is well established in the field of Procedural Content Generation, which encompasses a range of methods for algorithmically creating game elements such as levels, characters, quests, and storylines [1].

dimension, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/CoG64752.2025.11114354

2509.04239

Country:

Europe > Denmark (0.15)
North America > United States (0.14)

Genre: Research Report > New Finding (0.47)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.45)

Add feedback

Synthesia's AI clones are more expressive than ever. Soon they'll be able to talk back.

MIT Technology ReviewSep-4-2025, 10:05:33 GMT

When Synthesia launched in 2017, its primary purpose was to match AI versions of real human faces--for example, the former footballer David Beckham--with dubbed voices speaking in different languages. A few years later, in 2020, it started giving the companies that signed up for its services the opportunity to make professional-level presentation videos starring either AI versions of staff members or consenting actors. The avatars' body movements could be jerky and unnatural, their accents sometimes slipped, and the emotions indicated by their voices didn't always match their facial expressions. Now Synthesia's avatars have been updated with more natural mannerisms and movements, as well as expressive voices that better preserve the speaker's accent--making them appear more humanlike than ever before. For Synthesia's corporate clients, these avatars will make for slicker presenters of financial results, internal communications, or staff training videos.

artificial intelligence, machine learning, synthesia, (9 more...)

MIT Technology Review

Industry: Leisure & Entertainment > Sports > Soccer (0.56)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)
Information Technology > Artificial Intelligence > Vision (0.71)

Add feedback

Resonac creates 27-member consortium to pursue advanced chip developments

The Japan TimesSep-4-2025, 08:49:00 GMT

Resonac, a Japanese chip-materials maker, has announced the creation of Joint 3, which it describes as a consortium of 27 companies working together on semiconductor-related developments. "With next-generation technologies like generative AI and self-driving cars rapidly spreading, the technology required for semiconductors is getting more advanced and complex," Resonac CEO Hidehito Takahashi said Wednesday. Companies from a number of countries will be involved in Joint 3, which is led by Resonac. The list includes St Paul, Minnesota's 3M, Rolla, Missouri's Brewer Science, Sunnyvale, California's Synopsys and Singapore-headquartered, Hong Kong-listed ASMPT.

artificial intelligence, deep learning, machine learning, (3 more...)

The Japan Times

Country:

North America > United States > Missouri > Phelps County > Rolla (0.34)
North America > United States > Minnesota (0.34)
North America > United States > California > Santa Clara County > Sunnyvale (0.34)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

Charting the Future of Scholarly Knowledge with AI: A Community Perspective

Jiomekong, Azanzi, McGinty, Hande Küçük, Mills, Keith G., Oelen, Allard, Rajabi, Enayat, McElroy, Harry, Christou, Antrea, Saini, Anmol, Zebaze, Janice Anta, Kim, Hannah, Jacyszyn, Anna M., Auer, Sören

arXiv.org Artificial IntelligenceSep-4-2025

Scholarly work and communication encompass the entire system in which research and creative works are created, evaluated for quality, disseminated to the academic community and beyond, used, and preserved for future use. It includes formal publications, such as journal articles and books, as well as informal sharing through preprints, conference presentations, data sharing, and broader engagement with scholarly works and research outputs. Scholarly knowledge serves as the primary engine of progress, shaping our world and guiding our collective future. It forms the backbone of technological advancement, public health systems, and sustainable environmental practices. Obtained through rigorous methods of observation, experimentation, and validation, it is a reliable resource that helps societies solve complex problems and improve the quality of life by achieving sustainable development goals (SDGs) [6]. To be truly useful, scholarly knowledge must first be systematically extracted and organized. However, the scholarly community of today faces the problem of an overload of scientific papers in their respective domains. There is an increasing number of papers published every year (currently, 3 million), in addition to more than 200 million papers that have already been published . This gives rise to the research question: "How can we provide a reliable and living scholarly knowledge base that empowers researchers to query, synthesize, and analyze the vast body of scholarly knowledge?"

knowledge management, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2509.02581

Country:

Europe (0.93)
North America > United States > Ohio (0.28)

Genre: Research Report > Experimental Study > Negative Result (0.34)

Industry: Health & Medicine > Therapeutic Area (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(5 more...)

Add feedback