Generative AI
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
Buess, Lukas, Keicher, Matthias, Navab, Nassir, Maier, Andreas, Arasteh, Soroosh Tayebi
Generative artificial intelligence (AI) models, such as diffusion models and OpenAI's ChatGPT, are transforming medicine by enhancing diagnostic accuracy and automating clinical workflows. The field has advanced rapidly, evolving from text-only large language models for tasks such as clinical documentation and decision support to multimodal AI systems capable of integrating diverse data modalities, including imaging, text, and structured data, within a single model. The diverse landscape of these technologies, along with rising interest, highlights the need for a comprehensive review of their applications and potential. This scoping review explores the evolution of multimodal AI, highlighting its methods, applications, datasets, and evaluation in clinical settings. Adhering to PRISMA-ScR guidelines, we systematically queried PubMed, IEEE Xplore, and Web of Science, prioritizing recent studies published up to the end of 2024. After rigorous screening, 144 papers were included, revealing key trends and challenges in this dynamic field. Our findings underscore a shift from unimodal to multimodal approaches, driving innovations in diagnostic support, medical report generation, drug discovery, and conversational AI. However, critical challenges remain, including the integration of heterogeneous data types, improving model interpretability, addressing ethical concerns, and validating AI systems in real-world clinical settings. This review summarizes the current state of the art, identifies critical gaps, and provides insights to guide the development of scalable, trustworthy, and clinically impactful multimodal AI solutions in healthcare.
OpenAI will offer free ChatGPT users unlimited access to GPT-5
OpenAI's upcoming GPT-5 release will integrate its o3 reasoning model and be available to free users, CEO Sam Altman revealed in a roadmap he shared on X. He said the company is also working to simplify how users interact with ChatGPT. "We want AI to'just work' for you; we realize how complicated our model and product offerings have gotten," Altman wrote. "We hate the model picker as much as you do and want to return to magic unified intelligence." In its current iteration, forcing ChatGPT to use a specific model, such as o3-mini, involves either tapping the "Reason" button in the prompt bar or one of the options present in the model picker, which appears after the chatbot answers a question.
The Dirty Truth Behind Musk and Altman's Mud Fight
Sign up for the Slatest to get the most insightful analysis, criticism, and advice out there, delivered to your inbox daily. On Monday, Elon Musk and a group of investors made an unsolicited offer to buy ChatGPT parent company OpenAI for 97.4 billion. This ticked off OpenAI CEO Sam Altman not only because the company isn't for sale, but because the offer is insultingly low. OpenAI is reportedly in talks to raise new money in a funding round led by SoftBank at a 300 billion valuation, which would make it the most valuable privately held company in the world. Altman's return fire was equally petty, with him refusing via tweet before offering to buy Musk's social media company X for a decimal-sliding 9.74 billion.
Adobe Firefly muscles into AI video–here's what it looks like
Adobe said today that it's bringing AI-generated video, aka the Firefly Video Model, to Adobe Premiere Pro plus its Firefly generative art service. Unlike its generative AI image capabilities, however, it won't be free. AI-generated video has been available for months. In December, OpenAI released Sora, its ability to craft AI video clips of several seconds from a text prompt. What Adobe is offering is authenticity.
Watch out, Nvidia. OpenAI's proprietary AI chip is coming along
According to a new report from Reuters (spotted by Thurrott), OpenAI could finalize the design of its first 3nm AI chip in the coming months, with the goal of starting mass production at TSMC in 2026. The chip is being developed by a team of 40 OpenAI employees in collaboration with Broadcom. The project is being led by Richard Ho, OpenAI's new head of hardware, who previously worked on solutions for Google's infrastructure and cloud services. According to Reuters, OpenAI's chip will be able to both train and run AI models, but initially it'll be used mainly for inference (running AI models) and to a limited extent within the company's infrastructure. Demand for Nvidia's AI chips remains extremely high right now, with companies like OpenAI, Microsoft, Meta, and Google investing billions in AI data centers.
Fox News AI Newsletter: VP calls for ideology-free AI
Gladstone A.I. co-founders and CEOs Edouard Harris and Jeremie Harris explain the major role that A.I will play in national security and warfare on'The Will Cain Show.' Vice President JD Vance will attend an AI summit in Paris, France, a French official said anonymously. FREE FROM BIAS: Vice President JD Vance told world leaders in Paris on Tuesday that the United States intends to remain the dominant force in artificial intelligence and warned that the European Union's far tougher regulatory approach to the technology could cripple it. 'TRYING TO SLOW US DOWN': OpenAI CEO Sam Altman said Elon Musk is "probably just trying to slow us down" with his bid to purchase the company, insisting on Tuesday that it is not for sale. 'MASS SURVEILLANCE': OpenAI CEO Sam Altman predicts that artificial general intelligence will lead to lower costs for many goods, but has also warned that AI could be leveraged by authoritarian governments aiming to control people. TRANSLATED TRUTH: Whether you have an iPhone or an Android, these apps have got you covered with features like live speech translation, text input and even AI-powered sign and menu translation.
Adobe's Firefly generative AI video app is now in public beta
Adobe's Firefly Video Model is in public beta as of today, meaning the days of praying you had a chance to test it are over. Previously, it was only available in the Adobe Premiere Pro video editor with Generative Extend, but you can also access a standalone Firefly web app now. For those unaware, Adobe's Firefly Video Model powers the Generate Video feature, which can generate video clips from a text prompt or image. It can also edit images, turn them into videos, create 3D worlds and more. The content is "safe" for commercial use too, since the AI applies watermarks signifying which parts have AI assistance, and that can be checked with the Adobe Content Authenticity web app's Inspect tool.
Diffusion model predicts 3D genomic structures
This image shows the three-dimensional genome structures of several chromosomes reported in a Dip-C study, which were used to train the new ChromoGen model. Every cell in your body contains the same genetic sequence, yet each cell expresses only a subset of those genes. These cell-specific gene expression patterns, which ensure that a brain cell is different from a skin cell, are partly determined by the three-dimensional structure of the genetic material, which controls the accessibility of each gene. MIT chemists have now come up with a new way to determine those 3D genome structures, using generative artificial intelligence. Their technique can predict thousands of structures in just minutes, making it much speedier than existing experimental methods for analyzing the structures.
Want to run AI on your PC? You're gonna need a bigger hard drive
When people talk about the "size" of an AI model, they're referring to the number of "parameters" it contains. A parameter is one variable in the AI model that determines how it generates output, and any given AI model can have billions of these parameters. Also referred to as model weights, these parameters occupy storage space to operate properly -- and when an AI model has billions of parameters, storage requirements can quickly balloon. As you can see, the storage space consumed by an LLM increases with the size of its parameters. The same is true for other types of generative AI models, too.
Elon Musk owning OpenAI would be a terrible idea. That doesn't mean it won't happen Chris Stokel-Walker
The two had a blowout argument over the future direction of OpenAI – the company they came together to found in 2015 – with Altman seemingly content to pursue a for-profit approach and Musk feeling that was forswearing the founding principles of the firm as well as its name. OpenAI couldn't be open, he reckoned, if it was closed off and trying to make money rather than better humanity. So it's no surprise that Musk, who lodged an audacious bid to take over Twitter a little more than two years ago, which ended up with his ownership of the platform now called X, has sought to put a spoiler in two years of near-untrammelled growth for OpenAI. Musk – who is currently overhauling (to his supporters; "tearing down" to his opponents) the US government to be, as he would describe it, leaner and more efficient while also devastating important programmes such as international aid and cutting-edge scientific research – has lodged a near 100bn bid for OpenAI's non-profit arm. "It's time for OpenAI to return to the open-source, safety-focused force for good it once was," Musk said in a statement supplied by the lawyer shepherding his bid.