Goto

Collaborating Authors

 pacemaker


Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities

Zhao, Weixiang, Sui, Xingyu, Guo, Jiahe, Hu, Yulin, Deng, Yang, Zhao, Yanyan, Qin, Bing, Che, Wanxiang, Chua, Tat-Seng, Liu, Ting

arXiv.org Artificial Intelligence

Recent advancements in Large Reasoning Models (LRMs), such as OpenAI's o1/o3 and DeepSeek-R1, have demonstrated remarkable performance in specialized reasoning tasks through human-like deliberative thinking and long chain-of-thought reasoning. However, our systematic evaluation across various model families (DeepSeek, Qwen, and LLaMA) and scales (7B to 671B) reveals that acquiring these deliberative reasoning capabilities significantly reduces the foundational capabilities of LRMs, including notable declines in helpfulness and harmlessness, alongside substantially increased inference costs. Importantly, we demonstrate that adaptive reasoning -- employing modes like Zero-Thinking, Less-Thinking, and Summary-Thinking -- can effectively alleviate these drawbacks. Our empirical insights underline the critical need for developing more versatile LRMs capable of dynamically allocating inference-time compute according to specific task characteristics.


PRISM: High-Resolution & Precise Counterfactual Medical Image Generation using Language-guided Stable Diffusion

Kumar, Amar, Kriz, Anita, Havaei, Mohammad, Arbel, Tal

arXiv.org Artificial Intelligence

Developing reliable and generalizable deep learning systems for medical imaging faces significant obstacles due to spurious correlations, data imbalances, and limited text annotations in datasets. Addressing these challenges requires architectures robust to the unique complexities posed by medical imaging data. The rapid advancements in vision-language foundation models within the natural image domain prompt the question of how they can be adapted for medical imaging tasks. In this work, we present PRISM, a framework that leverages foundation models to generate high-resolution, language-guided medical image counterfactuals using Stable Diffusion. Our approach demonstrates unprecedented precision in selectively modifying spurious correlations (the medical devices) and disease features, enabling the removal and addition of specific attributes while preserving other image characteristics. Through extensive evaluation, we show how PRISM advances counterfactual generation and enables the development of more robust downstream classifiers for clinically deployable solutions. To facilitate broader adoption and research, we make our code publicly available at https://github.com/Amarkr1/PRISM.


AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

Liu, Xiaogeng, Li, Peiran, Suh, Edward, Vorobeychik, Yevgeniy, Mao, Zhuoqing, Jha, Somesh, McDaniel, Patrick, Sun, Huan, Li, Bo, Xiao, Chaowei

arXiv.org Artificial Intelligence

In this paper, we propose AutoDAN-Turbo, a black-box jailbreak method that can automatically discover as many jailbreak strategies as possible from scratch, without any human intervention or predefined scopes (e.g., specified candidate strategies), and use them for red-teaming. As a result, AutoDAN-Turbo can significantly outperform baseline methods, achieving a 74.3% higher average attack success rate on public benchmarks. Notably, AutoDAN-Turbo achieves an 88.5 attack success rate on GPT-4-1106-turbo. In addition, AutoDAN-Turbo is a unified framework that can incorporate existing human-designed jailbreak strategies in a plug-and-play manner. By integrating human-designed strategies, AutoDAN-Turbo can even achieve a higher attack success rate of 93.4 on GPT-4-1106-turbo.


Show, Don't Tell: Learning Reward Machines from Demonstrations for Reinforcement Learning-Based Cardiac Pacemaker Synthesis

Komp, John, Srinivas, Dananjay, Pacheco, Maria, Trivedi, Ashutosh

arXiv.org Artificial Intelligence

An (artificial cardiac) pacemaker is an implantable electronic device that sends electrical impulses to the heart to regulate the heartbeat. As the number of pacemaker users continues to rise, so does the demand for features with additional sensors, adaptability, and improved battery performance. Reinforcement learning (RL) has recently been proposed as a performant algorithm for creative design space exploration, adaptation, and statistical verification of cardiac pacemakers. The design of correct reward functions, expressed as a reward machine, is a key programming activity in this process. In 2007, Boston Scientific published a detailed description of their pacemaker specifications. This document has since formed the basis for several formal characterizations of pacemaker specifications using real-time automata and logic. However, because these translations are done manually, they are challenging to verify. Moreover, capturing requirements in automata or logic is notoriously difficult. We posit that it is significantly easier for domain experts, such as electrophysiologists, to observe and identify abnormalities in electrocardiograms that correspond to patient-pacemaker interactions. Therefore, we explore the possibility of learning correctness specifications from such labeled demonstrations in the form of a reward machine and training an RL agent to synthesize a cardiac pacemaker based on the resulting reward machine. We leverage advances in machine learning to extract signals from labeled demonstrations as reward machines using recurrent neural networks and transformer architectures. These reward machines are then used to design a simple pacemaker with RL. Finally, we validate the resulting pacemaker using properties extracted from the Boston Scientific document.


A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks

Jahan, Israt, Laskar, Md Tahmid Rahman, Peng, Chun, Huang, Jimmy

arXiv.org Artificial Intelligence

Recently, Large Language Models (LLM) have demonstrated impressive capability to solve a wide range of tasks. However, despite their success across various tasks, no prior work has investigated their capability in the biomedical domain yet. To this end, this paper aims to evaluate the performance of LLMs on benchmark biomedical tasks. For this purpose, we conduct a comprehensive evaluation of 4 popular LLMs in 6 diverse biomedical tasks across 26 datasets. To the best of our knowledge, this is the first work that conducts an extensive evaluation and comparison of various LLMs in the biomedical domain. Interestingly, we find based on our evaluation that in biomedical datasets that have smaller training sets, zero-shot LLMs even outperform the current state-of-the-art fine-tuned biomedical models. This suggests that pretraining on large text corpora makes LLMs quite specialized even in the biomedical domain. We also find that not a single LLM can outperform other LLMs in all tasks, with the performance of different LLMs may vary depending on the task. While their performance is still quite poor in comparison to the biomedical models that were fine-tuned on large training sets, our findings demonstrate that LLMs have the potential to be a valuable tool for various biomedical tasks that lack large annotated data.


Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers

Jahan, Israt, Laskar, Md Tahmid Rahman, Peng, Chun, Huang, Jimmy

arXiv.org Artificial Intelligence

ChatGPT is a large language model developed by OpenAI. Despite its impressive performance across various tasks, no prior work has investigated its capability in the biomedical domain yet. To this end, this paper aims to evaluate the performance of ChatGPT on various benchmark biomedical tasks, such as relation extraction, document classification, question answering, and summarization. To the best of our knowledge, this is the first work that conducts an extensive evaluation of ChatGPT in the biomedical domain. Interestingly, we find based on our evaluation that in biomedical datasets that have smaller training sets, zero-shot ChatGPT even outperforms the state-of-the-art fine-tuned generative transformer models, such as BioGPT and BioBART. This suggests that ChatGPT's pre-training on large text corpora makes it quite specialized even in the biomedical domain. Our findings demonstrate that ChatGPT has the potential to be a valuable tool for various tasks in the biomedical domain that lack large annotated data.


Three paraplegics are able to walk again thanks to a revolutionary spinal cord implant

Daily Mail - Science & tech

Three paraplegics have been able to walk, swim and cycle for the first time in years after being fitted with a revolutionary spinal cord implant. The device, which works by stimulating the region of the spinal cord that activates the trunk and leg muscles, was so effective that the patients were all able to stand and walk just one day after it was activated, researchers said. It is controlled by artificial intelligence (AI) software which reactivates neurons via a pacemaker inserted into a person's abdomen. Italian-born Michel Roccati, 29, had been confined to a wheelchair for over four years after losing the use of his legs in a motorbike crash, and was one of the three men to test the implant. He said: 'The first few steps were incredible -- a dream come true.'


THE FUTURE OF AI IN MEDICAL DEVICE DEVELOPMENT - Dataconomy

#artificialintelligence

The medical device remains a crucial component in improving the quality of life. Key players in the medical technology arena are going on the AI track to invent cutting-edge devices with high precision and automation. Expectations are high as the future of healthcare delivery is poised for steady growth with AI onboard. Picture a smart sensor device that estimates the possibility of a heart attack or an imaging system that uses algorithms to spot a brain tumor – these are real-world evidence of AI medical technologies in action. Application design teams harmonizing AI technologies into medical devices made these realities.


How AI Is Preventing Data Breaches In 3 Major Industries

#artificialintelligence

Artificial intelligence (AI) and machine learning are allowing both businesses and consumers to boost their cybersecurity to unprecedented levels. In a recent post, we examined six ways that AI is leading the way towards rock-solid information security. In case you missed it, read it here. For this article, we'll take a closer look at how AI and machine learning are letting three major industries safeguard their data better. In each of these sectors, websites not only contain a wealth of sensitive information but also have a high volume of visitors every day.


Opinion: When the mind gives out before the machine

#artificialintelligence

Margaret Munro is a Vancouver-based journalist. My father was preparing breakfast when his blood pressure dropped and he blacked out. Keeling over backward, he hit his head so hard it punched a hole in the wall. "Good thing I didn't hit the stud," he said in the emergency room at Nanaimo Regional General Hospital. He was stable, but the wobbly lines running across a monitor wired to his chest showed the critical state of his 92-year-old heart. It had been repaired before, but now doctors offered something more – a pacemaker to keep it beating steadily. Hundreds of thousands of Canadians have the ingenious devices, and many of them, like my father, likely had them implanted without considering all the implications. A cardiologist stayed late, after his scheduled surgeries, to wire Dad's heart with a German-designed Biotronik pacemaker that would restore a healthy heart rhythm. The procedure, done under local anesthetic, took less than 30 minutes.