Goto

Collaborating Authors

 flu


VELA: An LLM-Hybrid-as-a-Judge Approach for Evaluating Long Image Captions

Matsuda, Kazuki, Wada, Yuiga, Hirano, Shinnosuke, Otsuki, Seitaro, Sugiura, Komei

arXiv.org Artificial Intelligence

In this study, we focus on the automatic evaluation of long and detailed image captions generated by multimodal Large Language Models (MLLMs). Most existing automatic evaluation metrics for image captioning are primarily designed for short captions and are not suitable for evaluating long captions. Moreover, recent LLM-as-a-Judge approaches suffer from slow inference due to their reliance on autoregressive inference and early fusion of visual information. To address these limitations, we propose VELA, an automatic evaluation metric for long captions developed within a novel LLM-Hybrid-as-a-Judge framework. Furthermore, we propose LongCap-Arena, a benchmark specifically designed for evaluating metrics for long captions. This benchmark comprises 7,805 images, the corresponding human-provided long reference captions and long candidate captions, and 32,246 human judgments from three distinct perspectives: Descriptiveness, Relevance, and Fluency. We demonstrated that VELA outperformed existing metrics and achieved superhuman performance on LongCap-Arena.


Predicting sub-population specific viral evolution

Shi, Wenxian, Wu, Menghua, Barzilay, Regina

arXiv.org Artificial Intelligence

Forecasting the change in the distribution of viral variants is crucial for therapeutic design and disease surveillance. This task poses significant modeling challenges due to the sharp differences in virus distributions across sub-populations (e.g., countries) and their dynamic interactions. Existing machine learning approaches that model the variant distribution as a whole are incapable of making location-specific predictions and ignore transmissions that shape the viral landscape. In this paper, we propose a sub-population specific protein evolution model, which predicts the time-resolved distributions of viral proteins in different locations. The algorithm explicitly models the transmission rates between sub-populations and learns their interdependence from data. The change in protein distributions across all sub-populations is defined through a linear ordinary differential equation (ODE) parametrized by transmission rates. Solving this ODE yields the likelihood of a given protein occurring in particular sub-populations. Multi-year evaluation on both SARS-CoV-2 and influenza A/H3N2 demonstrates that our model outperforms baselines in accurately predicting distributions of viral proteins across continents and countries. We also find that the transmission rates learned from data are consistent with the transmission pathways discovered by retrospective phylogenetic analysis.


Cold and flu season is coming: Know the warning signs and symptoms now

FOX News

Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. "Game of Thrones" may be over, but winter is still coming. That means the dreaded cold and flu season is right around the corner. "A visit with a clinician has become increasingly common for upper respiratory symptoms since the COVID pandemic," Mark Fendrick, M.D., a general internist at the University of Michigan, who is based in Ann Arbor, Michigan, told Fox News Digital.


The Next Chapter: A Study of Large Language Models in Storytelling

Xie, Zhuohan, Cohn, Trevor, Lau, Jey Han

arXiv.org Artificial Intelligence

To enhance the quality of generated stories, recent story generation models have been investigating the utilization of higher-level attributes like plots or commonsense knowledge. The application of prompt-based learning with large language models (LLMs), exemplified by GPT-3, has exhibited remarkable performance in diverse natural language processing (NLP) tasks. This paper conducts a comprehensive investigation, utilizing both automatic and human evaluation, to compare the story generation capacity of LLMs with recent models across three datasets with variations in style, register, and length of stories. The results demonstrate that LLMs generate stories of significantly higher quality compared to other story generation models. Moreover, they exhibit a level of performance that competes with human authors, albeit with the preliminary observation that they tend to replicate real stories in situations involving world knowledge, resembling a form of plagiarism.


A Human-Centered Interpretability Framework Based on Weight of Evidence

Alvarez-Melis, David, Kaur, Harmanpreet, Daumé, Hal III, Wallach, Hanna, Vaughan, Jennifer Wortman

arXiv.org Artificial Intelligence

In this paper, we take a human-centered approach to interpretable machine learning. First, drawing inspiration from the study of explanation in philosophy, cognitive science, and the social sciences, we propose a list of design principles for machine-generated explanations that are meaningful to humans. Using the concept of weight of evidence from information theory, we develop a method for producing explanations that adhere to these principles. We show that this method can be adapted to handle high-dimensional, multi-class settings, yielding a flexible meta-algorithm for generating explanations. We demonstrate that these explanations can be estimated accurately from finite samples and are robust to small perturbations of the inputs. We also evaluate our method through a qualitative user study with machine learning practitioners, where we observe that the resulting explanations are usable despite some participants struggling with background concepts like prior class probabilities. Finally, we conclude by surfacing design implications for interpretability tools


You -- yes, you -- can help AI predict the spread of coronavirus

#artificialintelligence

Roni Rosenfeld makes predictions for a living. Typically, he uses artificial intelligence to forecast the spread of the seasonal flu. But with the coronavirus outbreak claiming lives all over the world, he's switched to predicting the spread of Covid-19. It was the Centers for Disease Control and Prevention (CDC) that asked Rosenfeld to take on this task. As a professor of computer science at Carnegie Mellon University, he leads the machine learning department and the Delphi research group, which aims "to make epidemiological forecasting as universally accepted and useful as weather forecasting is today."


Redistribution Systems and PRAM

Cohen, Paul, Loboda, Tomasz

arXiv.org Artificial Intelligence

Redistribution systems iteratively redistribute mass between groups under the control of rules. PRAM is a framework for building redistribution systems. We discuss the relationships between redistribution systems, agent-based systems, compartmental models and Bayesian models. PRAM puts agent-based models on a sound probabilistic footing by reformulating them as redistribution systems. This provides a basis for integrating agent-based and probabilistic models. \pram/ extends the themes of probabilistic relational models and lifted inference to incorporate dynamical models and simulation. We illustrate PRAM with an epidemiological example.


On State Variables, Bandit Problems and POMDPs

Powell, Warren B

arXiv.org Artificial Intelligence

State variables are easily the most subtle dimension of sequential decision problems. This is especially true in the context of active learning problems (bandit problems") where decisions affect what we observe and learn. We describe our canonical framework that models {\it any} sequential decision problem, and present our definition of state variables that allows us to claim: Any properly modeled sequential decision problem is Markovian. We then present a novel two-agent perspective of partially observable Markov decision problems (POMDPs) that allows us to then claim: Any model of a real decision problem is (possibly) non-Markovian. We illustrate these perspectives using the context of observing and treating flu in a population, and provide examples of all four classes of policies in this setting. We close with an indication of how to extend this thinking to multiagent problems.


Flu's spread is unpredictable. Can AI yield more reliable forecasts? - STAT

#artificialintelligence

Flu season reliably arrives around this time every year -- but where the virus heads and how it will spread can seem wildly unpredictable. Now, artificial intelligence is playing a bigger role in trying to change that. One startup is using data collected from thermometers to develop algorithms to derive insights about flu activity that it could sell to retailers, consumer-goods makers, and perhaps even health systems. Academic researchers are refining sophisticated AI models, including using machine learning and statistical methods to recognize patterns and map out future trajectories. Unlock this article by subscribing to STAT Plus and enjoy your first 30 days free!


Probabilistic Relational Agent-based Models

Cohen, Paul

arXiv.org Artificial Intelligence

In agent-based models (ABMs, e.g., [4, 3]) agents probabilistically change state. State can be represented as attribute values such as health status, monthly income, age, political orientation, location and so on. A population of agents has a joint state that is typically a joint distribution; for example, a population has a joint distribution over income levels and political beliefs. ABMs are a popular method for exploring the dynamics of joint states, which can be hard to estimate when attribute values depend on each other, and populations are heterogeneous in the sense that not everyone has the same distribution of attribute values, and the principal mechanism for changing attribute values is interactions between agents. For example, suppose all agents have a flu status attribute that changes conditionally - given other attributes such as age, income, and vaccination status - when agents interact. The dynamics of flu - how it moves through heterogeneous populations - can be difficult or impossible to solve, but ABMs can simulate the interactions of agents, and the flu status of these agents can be tracked over time. ABMs are no doubt engines of probabilistic inference, but it is difficult to say anything about the models that underlie the inference. This paper presents pram - Probabilistic Relational Agentbased Models - a new kind of ABM with design influences from compartmental models (e.g., [1]) and probabilistic relational models (PRMs; e.g., [2]).