AITopics | clinical environment

doi: 10.1109/EMBC58623.2025.11254054

2507.19961

Country:

Asia > Middle East > Republic of Türkiye (0.29)
Europe (0.29)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

arXiv.org Artificial IntelligenceOct-24-2025

Beyond MedQA: Towards Real-world Clinical Decision Making in the Era of LLMs

Xiao, Yunpeng, Yang, Carl, Mai, Mark, Hu, Xiao, Shu, Kai

Large language models (LLMs) show promise for clinical use. They are often evaluated using datasets such as MedQA. However, Many medical datasets, such as MedQA, rely on simplified Question-Answering (Q\A) that underrepresents real-world clinical decision-making. Based on this, we propose a unifying paradigm that characterizes clinical decision-making tasks along two dimensions: Clinical Backgrounds and Clinical Questions. As the background and questions approach the real clinical environment, the difficulty increases. We summarize the settings of existing datasets and benchmarks along two dimensions. Then we review methods to address clinical decision-making, including training-time and test-time techniques, and summarize when they help. Next, we extend evaluation beyond accuracy to include efficiency, explainability. Finally, we highlight open challenges. Our paradigm clarifies assumptions, standardizes comparisons, and guides the development of clinically meaningful LLMs.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

2510.20001

Genre:

Research Report (1.00)
Overview (0.87)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Filvantorkaman, Melika, Torkaman, Maral Filvan

A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications

arXiv.org Artificial IntelligenceOct-21-2025

Medical imaging plays a vital role in modern diagnostics; however, interpreting high-resolution radiological data remains time-consuming and susceptible to variability among clinicians. Traditional image processing techniques often lack the precision, robustness, and speed required for real-time clinical use. To overcome these limitations, this paper introduces a deep learning framework for real-time medical image analysis designed to enhance diagnostic accuracy and computational efficiency across multiple imaging modalities, including X-ray, CT, and MRI. The proposed system integrates advanced neural network architectures such as U-Net, EfficientNet, and Transformer-based models with real-time optimization strategies including model pruning, quantization, and GPU acceleration. The framework enables flexible deployment on edge devices, local servers, and cloud infrastructures, ensuring seamless interoperability with clinical systems such as PACS and EHR. Experimental evaluations on public benchmark datasets demonstrate state-of-the-art performance, achieving classification accuracies above 92%, segmentation Dice scores exceeding 91%, and inference times below 80 milliseconds. Furthermore, visual explanation tools such as Grad-CAM and segmentation overlays enhance transparency and clinical interpretability. These results indicate that the proposed framework can substantially accelerate diagnostic workflows, reduce clinician workload, and support trustworthy AI integration in time-critical healthcare environments.

artificial intelligence, machine learning, real time system, (20 more...)

2510.16611

Country:

North America > United States > New York > Monroe County > Rochester (0.04)
Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
Asia > China (0.04)
Africa > Ghana (0.04)

Genre: Research Report > Experimental Study (0.68)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Architecture > Real Time Systems (1.00)

Dutta, Abhishek, Hsiao, Yen-Che

Adaptive Reasoning and Acting in Medical Language Agents

arXiv.org Artificial IntelligenceOct-13-2024

This paper presents an innovative large language model (LLM) agent framework for enhancing diagnostic accuracy in simulated clinical environments using the AgentClinic benchmark. The proposed automatic correction enables doctor agents to iteratively refine their reasoning and actions following incorrect diagnoses, fostering improved decision-making over time. Experiments show that the implementation of the adaptive LLM-based doctor agents achieve correct diagnoses through dynamic interactions with simulated patients. The evaluations highlight the capacity of autonomous agents to adapt and improve in complex medical scenarios. Future enhancements will focus on refining the algorithm and expanding its applicability across a wider range of tasks and different large language models.

artificial intelligence, large language model, natural language, (16 more...)

2410.1002

Country:

North America > United States > Connecticut > Tolland County > Storrs (0.15)
Europe > Sweden > Vaestra Goetaland > Gothenburg (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Diagnostic Medicine (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.70)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.69)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

arXiv.org Artificial IntelligenceMay-30-2024

AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments

Schmidgall, Samuel, Ziaei, Rojin, Harris, Carl, Reis, Eduardo, Jopling, Jeffrey, Moor, Michael

Diagnosing and managing a patient is a complex, sequential decision making process that requires physicians to obtain information -- such as which tests to perform -- and to act upon it. Recent advances in artificial intelligence (AI) and large language models (LLMs) promise to profoundly impact clinical care. However, current evaluation schemes overrely on static medical question-answering benchmarks, falling short on interactive decision-making that is required in real-life clinical work. Here, we present AgentClinic: a multimodal benchmark to evaluate LLMs in their ability to operate as agents in simulated clinical environments. In our benchmark, the doctor agent must uncover the patient's diagnosis through dialogue and active data collection. We present two open medical agent benchmarks: a multimodal image and dialogue environment, AgentClinic-NEJM, and a dialogue-only environment, AgentClinic-MedQA. We embed cognitive and implicit biases both in patient and doctor agents to emulate realistic interactions between biased agents. We find that introducing bias leads to large reductions in diagnostic accuracy of the doctor agents, as well as reduced compliance, confidence, and follow-up consultation willingness in patient agents. Evaluating a suite of state-of-the-art LLMs, we find that several models that excel in benchmarks like MedQA are performing poorly in AgentClinic-MedQA. We find that the LLM used in the patient agent is an important factor for performance in the AgentClinic benchmark. We show that both having limited interactions as well as too many interaction reduces diagnostic accuracy in doctor agents. The code and data for this work is publicly available at https://AgentClinic.github.io.

accuracy, agent, patient agent, (16 more...)

2405.0796

Country:

North America > United States > Maryland > Baltimore (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)
South America > Brazil > São Paulo (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Endocrinology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.77)

#artificialintelligenceAug-1-2021, 06:45:24 GMT

Artificial intelligence projects in healthcare: 10 practical tips for success in a clinical environment

There is much discussion concerning ‘digital transformation’ in healthcare and the potential of artificial intelligence (AI) in healthcare systems. Yet it remains rare to find AI solutions deployed in routine healthcare settings. This is in part due to the numerous challenges inherent in delivering an AI project in a clinical environment. In this article, several UK healthcare professionals and academics reflect on the challenges they have faced in building AI solutions using routinely collected healthcare data. These personal reflections are summarised as 10 practical tips. In our experience, these are essential considerations for an AI healthcare project to succeed. They are organised into four phases: conceptualisation, data management, AI application and clinical deployment. There is a focus on conceptualisation, reflecting our view that initial set-up is vital to success. We hope that our personal experiences will provide useful insights to others looking to improve patient care through optimal data use. No data are available to share.

artificial intelligence project, clinical environment, healthcare, (6 more...)

Industry:

Health & Medicine (1.00)
Information Technology > Security & Privacy (0.57)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.78)

Thapa, Chandra, Karmakar, Kallol Krishna, Celdran, Alberto Huertas, Camtepe, Seyit, Varadharajan, Vijay, Nepal, Surya

FedDICE: A ransomware spread detection in a distributed integrated clinical environment using federated learning and SDN based mitigation

arXiv.org Artificial IntelligenceJun-9-2021

An integrated clinical environment (ICE) enables the connection and coordination of the internet of medical things around the care of patients in hospitals. However, ransomware attacks and their spread on hospital infrastructures, including ICE, are rising. Often the adversaries are targeting multiple hospitals with the same ransomware attacks. These attacks are detected by using machine learning algorithms. But the challenge is devising the anti-ransomware learning mechanisms and services under the following conditions: (1) provide immunity to other hospitals if one of them got the attack, (2) hospitals are usually distributed over geographical locations, and (3) direct data sharing is avoided due to privacy concerns. In this regard, this paper presents a federated distributed integrated clinical environment, aka. FedDICE. FedDICE integrates federated learning (FL), which is privacy-preserving learning, to SDN-oriented security architecture to enable collaborative learning, detection, and mitigation of ransomware attacks. We demonstrate the importance of FedDICE in a collaborative environment with up to four hospitals and four popular ransomware families, namely WannaCry, Petya, BadRabbit, and PowerGhost. Our results find that in both IID and non-IID data setups, FedDICE achieves the centralized baseline performance that needs direct data sharing for detection. However, as a trade-off to data privacy, FedDICE observes overhead in the anti-ransomware model training, e.g., 28x for the logistic regression model. Besides, FedDICE utilizes SDN's dynamic network programmability feature to remove the infected devices in ICE.

detection, hospital, ransomware, (14 more...)

2106.05434

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Oceania > Australia (0.04)
North America > United States > Indiana (0.04)
Asia > Nepal (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

#artificialintelligenceJun-4-2021, 03:10:07 GMT

AI outperforms humans in creating cancer treatments, but do doctors trust it?

The impact of deploying Artificial Intelligence (AI) for radiation cancer therapy in a real-world clinical setting has been tested by Princess Margaret researchers in a unique study involving physicians and their patients. A team of researchers directly compared physician evaluations of radiation treatments generated by an AI machine learning (ML) algorithm to conventional radiation treatments generated by humans. They found that in the majority of the 100 patients studied, treatments generated using ML were deemed to be clinically acceptable for patient treatments by physicians. Overall, 89% of ML-generated treatments were considered clinically acceptable for treatments, and 72% were selected over human-generated treatments in head-to-head comparisons to conventional human-generated treatments. Moreover, the ML radiation treatment process was faster than the conventional human-driven process by 60%, reducing the overall time from 118 hours to 47 hours.

ml-generated treatment, princess margaret, radiation treatment, (15 more...)

Country: North America > Canada > Ontario > Toronto (0.19)

Genre: Research Report (0.33)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.35)

#artificialintelligenceFeb-5-2020, 23:29:55 GMT

Clinical management of sepsis can be improved by artificial intelligence: yes

The management of sepsis is a highly complex, multifaceted challenge that remains the realm of highly skilled and trained human experts. But as medical applications of artificial intelligence continue to pour in, it is becoming obvious that some of these decisions could soon be left to machines that could be dubbed "intelligent", improving clinical practice and patient outcomes [1]. Indeed, most of the tasks involved in the clinical management of sepsis (early recognition, selection of antibiotic therapy, haemodynamic optimisation, etc.) could be individually performed or optimised by dedicated algorithms. Most of what we call "artificial intelligence" is in fact machine learning--a set of computer tools intended to generate new knowledge from data [1]. Machine learning includes three categories of techniques: supervised (which uses labelled data to build a prediction model, for example for prognostication), unsupervised (which discovers patterns in data and generates clusters of subjects that share common characteristics) and reinforcement learning (where a sequential decision process is modelled and optimised). Below, I have selected a few significant applications that I consider the most likely to land in the clinical environment in the near future, either because of their robustness or their potential.

algorithm, artificial intelligence, sepsis, (13 more...)

Genre:

Research Report > Strength High (0.50)
Research Report > Experimental Study (0.50)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceAug-21-2019, 05:54:01 GMT

AR AI model can objectively identify, locate pain in real time

Machine learning, combined with neuroimaging data, has the potential to objectively determine whether a patient is suffering pain and where it's located. Accurate pain assessment is critical to provide proper diagnosis and treatment, medical experts agree. However, it's difficult to quantify pain, and most assessments are subjective. Subjective assessments are inconsistent and can't be used when a patient can't communicate, such as during surgery. They're also of limited value in understanding the neurophysiological processes underlying different types of pain.

artificial intelligence, participant, real time system, (8 more...)

Genre: Research Report (0.58)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.93)
Health & Medicine > Health Care Technology (0.71)
Health & Medicine > Diagnostic Medicine > Imaging (0.71)

Technology:

Information Technology > Artificial Intelligence (0.88)
Information Technology > Architecture > Real Time Systems (0.43)