Goto

Collaborating Authors

 Media



AI Isn't Coming for Hollywood. It Has Already Arrived

WIRED

Lady Gaga probably wasn't thinking that a coup would unfold in her greenhouse. Then again, she was cohosting a party there with Sean Parker, the billionaire founder of Napster and first president of Facebook. It was February 2024, and the singer had invited guests to her 22.5 million oceanside estate in Malibu to mark the launch of a skin-care nonprofit. One of the organization's trustees was her boyfriend, whose day job was running the Parker Foundation. In the candlelit space, beside floor-to-ceiling windows that looked out over the Pacific, Parker's people mingled with Gaga's, nibbling focaccia and branzino alla brace to music from a string quartet (Grammy-winning, of course).


DAN GAINOR: Leftist MSNBC changes its name, but it's still the same embarrassment

FOX News

MSNBC's "Morning Joe" reacted to the networks upcoming name change, "My Source News Opinion World," or MS NOW, on Monday. But don't shed a tear (not that you would, anyway), it's turning into MS NOW. Or, as the New York Times put it, "Goodbye, MSNBC. The far-left network lost its tie to the newsy term "NBC" and looks more like some feminist retread site. Or, as MSNBC President Rebecca Kutler put it, "While our name will be changing, who we are and what we do will not." So, maybe my viewership assessment is correct. Sure, the ship might have made a career of hitting icebergs, but it's got a new name. The fallout from the change was swift. The Times even took a swipe with the follow-up headline: "MSNBC's Rebrand Invites Bemusement and Ridicule." The name switch reflects marketing nonsense as part of the corporate split. It also eliminates the long-standing comparison to MSDNC. The rationalization for the new name is: "My Source for News, Opinion, and the World." CNBC is going to keep its name, according to the Wall Street Journal, but the initials mean something else – "Consumer News and Business Channel," another marketing nuance. The new company will include, "NBCUniversal's cable television networks, including USA Network, CNBC, MSNBC, Oxygen, E!, SYFY and Golf Channel" along with a few other properties, including the formerly useful Rotten Tomatoes movie site. Nobody sane wants MSNBC/MS NOW connected in any way to NBC. It's been a corporate embarrassment for years. They're OK with it looking like the rational folks at CNBC are still connected, but the lunacy of MSNBC gets rebranded. It removes the stain for NBC. The more things change, the more they remain the same. This is the same network where they repeatedly compare President Donald Trump to monsters like Hitler and Stalin. Hosts regularly throw around charges of dictatorship like we are living in 1930s Germany – although somehow they are allowed to say it. Host Tiffany Cross recently claimed the government was grabbing people and "transporting them to concentration camps." And the face of the franchise, MSNBC host Rachel Maddow, told viewers, "We have a consolidating dictatorship in our country." Remember, "Morning Joe" host Joe Scarborough made the most-embarrassing quote of the entire failed Joe Biden presidency: "I've said it for years now, he's cogent.


Adversarial Music: Real world Audio Adversary against Wake-word Detection System

Neural Information Processing Systems

V oice Assistants (V As) such as Amazon Alexa or Google Assistant rely on wake-word detection to respond to people's commands, which could potentially be vulnerable to audio adversarial examples. In this work, we target our attack on the wake-word detection system, jamming the model with some inconspicuous background music to deactivate the V As while our audio adversary is present. We implemented an emulated wake-word detection system of Amazon Alexa based on recent publications.


Predicting the Politics of an Image Using Webly Supervised Data

Neural Information Processing Systems

We collect a dataset of over one million unique images and associated news articles from left-and right-leaning news sources, and develop a method to predict the image's political leaning. This problem is particularly challenging because of the enormous intra-class visual and semantic diversity of our data. We propose a two-stage method to tackle this problem. In the first stage, the model is forced to learn relevant visual concepts that, when joined with document embeddings computed from articles paired with the images, enable the model to predict bias. In the second stage, we remove the requirement of the text domain and train a visual classifier from the features of the former model. We show this two-stage approach facilitates learning and outperforms several strong baselines.


Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

arXiv.org Artificial Intelligence

Recent advances in large language models (LLMs) and multi-agent systems have demonstrated remarkable capabilities in complex problem-solving tasks such as deep research, vibe coding, and mathematical reasoning. However, most existing multi-agent systems are built upon manual prompt/workflow engineering with sophisticated agent frameworks, making them computationally inefficient, less capable, and can not benefit from data-centric learning. In this work, we introduce Chain-of-Agents (CoA), a novel paradigm of LLM reasoning that enables native end-to-end complex problem-solving in the same way as a multi-agent system (i.e., multi-turn problem solving with multiple tools and multiple agents) within one model. In chain-of-agents problem-solving, the model dynamically activates different tool agents and role-playing agents to simulate multi-agent collaboration in an end-to-end fashion. To elicit end-to-end chain-of-agents problem-solving abilities in LLMs, we introduce a multi-agent distillation framework to distill state-of-the-art multi-agent systems into chain-of-agents trajectories for agentic supervised fine-tuning. We then use agentic reinforcement learning on verifiable agentic tasks to further improve the models' capabilities on chain-of-agents problem solving. We call the resulting models Agent Foundation Models (AFMs). Our empirical studies demonstrate that AFM establishes new state-of-the-art performance across diverse benchmarks in both web agent and code agent settings. We make the entire research, including the model weights, code for training and evaluation, and the training data, fully open-sourced, which offers a solid starting point for future research on agent models and agentic RL.


Exploring Content and Social Connections of Fake News with Explainable Text and Graph Learning

arXiv.org Artificial Intelligence

The global spread of misinformation and concerns about content trustworthiness have driven the development of automated fact-checking systems. Since false information often exploits social media dynamics such as "likes" and user networks to amplify its reach, effective solutions must go beyond content analysis to incorporate these factors. Moreover, simply labelling content as false can be ineffective or even reinforce biases such as automation and confirmation bias. This paper proposes an explainable framework that combines content, social media, and graph-based features to enhance fact-checking. It integrates a misinformation classifier with explainability techniques to deliver complete and interpretable insights supporting classification decisions. Experiments demonstrate that multimodal information improves performance over single modalities, with evaluations conducted on datasets in English, Spanish, and Portuguese. Additionally, the framework's explanations were assessed for interpretability, trustworthiness, and robustness with a novel protocol, showing that it effectively generates human-understandable justifications for its predictions. The code and experiments are available at https://github.com/MeLLL-UFF/mu2X/ .


Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds

arXiv.org Artificial Intelligence

Audio DNNs have demonstrated impressive performance on various machine listening tasks; however, most of their representations are computationally costly and uninterpretable, leaving room for optimization. Here, we propose a novel approach centered on spectrotemporal modulation (STM) features, a signal processing method that mimics the neurophysiological representation in the human auditory cortex. The classification performance of our STM-based model, without any pretraining, is comparable to that of pretrained audio DNNs across diverse naturalistic speech, music, and environmental sounds, which are essential categories for both human cognition and machine perception. These results show that STM is an efficient and interpretable feature representation for audio classification, advancing the development of machine listening and unlocking exciting new possibilities for basic understanding of speech and auditory sciences, as well as developing audio BCI and cognitive computing.


The AI Risk Spectrum: From Dangerous Capabilities to Existential Threats

arXiv.org Artificial Intelligence

As AI systems become more capable, integrated, and widespread, understanding the associated risks becomes increasingly important. This paper maps the full spectrum of AI risks, from current harms affecting individual users to existential threats that could endanger humanity's survival. We organize these risks into three main causal categories. Misuse risks, which occur when people deliberately use AI for harmful purposes - creating bioweapons, launching cyberattacks, adversarial AI attacks or deploying lethal autonomous weapons. Misalignment risks happen when AI systems pursue outcomes that conflict with human values, irrespective of developer intentions. This includes risks arising through specification gaming (reward hacking), scheming and power-seeking tendencies in pursuit of long-term strategic goals. Systemic risks, which arise when AI integrates into complex social systems in ways that gradually undermine human agency - concentrating power, accelerating political and economic disempowerment, creating overdependence that leads to human enfeeblement, or irreversibly locking in current values curtailing future moral progress. Beyond these core categories, we identify risk amplifiers - competitive pressures, accidents, corporate indifference, and coordination failures - that make all risks more likely and severe. Throughout, we connect today's existing risks and empirically observable AI behaviors to plausible future outcomes, demonstrating how existing trends could escalate to catastrophic outcomes. Our goal is to help readers understand the complete landscape of AI risks. Good futures are possible, but they don't happen by default. Navigating these challenges will require unprecedented coordination, but an extraordinary future awaits if we do.


Understanding Distribution Structure on Calibrated Recommendation Systems

arXiv.org Artificial Intelligence

--Traditional recommender systems aim to generate a recommendation list comprising the most relevant or similar items to the user's profile. These approaches can create recommendation lists that omit item genres from the less prominent areas of a user's profile, thereby undermining the user's experience. T o solve this problem, the calibrated recommendation system provides a guarantee of including less representative areas in the recommended list. The calibrated context works with three distributions. The first is from the user's profile, the second is from the candidate items, and the last is from the recommendation list. These distributions are G-dimensional, where G is the total number of genres in the system. This high dimensionality requires a different evaluation method, considering that traditional recommenders operate in a one-dimensional data space. In this sense, we implement fifteen models that help to understand how these distributions are structured. We evaluate the users' patterns in three datasets from the movie domain. The results indicate that the models of outlier detection provide a better understanding of the structures. The calibrated system creates recommendation lists that act similarly to traditional recommendation lists, allowing users to change their groups of preferences to the same degree. Commonly, traditional recommender systems generate recommendations with miscalibration [1]. Miscalibration means that the recommendation lists do not follow the user preferences distribution, instead suggesting items from user's dominant area of interest. It creates an overspecialized recommendation list in which the items from the less dominant area are overwhelmed. This effect puts the user in a filter bubble or an echo chamber problem [2]. For instance, when a specific area dominates the recommended list, the user likely has few other options to interact with, aside from items within that dominant area. Then, the subsequent lists are recommended, with the dominant area becoming more overspecialized. In recent years, calibrated recommendation systems have attracted attention [3]-[8] from the recommender system community to overcome this issue. This type of system demonstrates the capacity to improve several objectives, such as diversity [3], control of popularity bias [4], item coverage [5], precision [6], and the reduction of miscalibration [7]. To illustrate how calibrated recommendation works, consider a scenario: if a user's preferences distribution indicates Corresponding author is Diego Corr ˆ ea da Silva.