AITopics

AI-powered influence operations can now be executed end-to-end on commodity hardware. We show that small language models produce coherent, persona-driven political messaging and can be evaluated automatically without human raters. Two behavioural findings emerge. First, persona-over-model: persona design explains behaviour more than model identity. Second, engagement as a stressor: when replies must counter-arguments, ideological adherence strengthens and the prevalence of extreme content increases. We demonstrate that fully automated influence-content production is within reach of both large and small actors. Consequently, defence should shift from restricting model access towards conversation-centric detection and disruption of campaigns and coordination infrastructure. Paradoxically, the very consistency that enables these operations also provides a detection signature.

large language model, machine learning, persona, (19 more...)

2508.20186

Country: Europe (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.69)

Industry:

Media > News (0.46)
Government > Military (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
(2 more...)

ProactiveEval: A Unified Evaluation Framework for Proactive Dialogue Agents

Liu, Tianjian, Wan, Fanqi, Guo, Jiajian, Quan, Xiaojun

Proactive dialogue has emerged as a critical and challenging research problem in advancing large language models (LLMs). Existing works predominantly focus on domain-specific or task-oriented scenarios, which leads to fragmented evaluations and limits the comprehensive exploration of models' proactive conversation abilities. In this work, we propose ProactiveEval, a unified framework designed for evaluating proactive dialogue capabilities of LLMs. This framework decomposes proactive dialogue into target planning and dialogue guidance, establishing evaluation metrics across various domains. Moreover, it also enables the automatic generation of diverse and challenging evaluation data. Based on the proposed framework, we develop 328 evaluation environments spanning 6 distinct domains. Through experiments with 22 different types of LLMs, we show that DeepSeek-R1 and Claude-3.7-Sonnet exhibit exceptional performance on target planning and dialogue guidance tasks, respectively. Finally, we investigate how reasoning capabilities influence proactive behaviors and discuss their implications for future model development.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

2508.20973

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.68)

Industry:

Media (0.68)
Leisure & Entertainment (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Mago, Gowreesh, Mettes, Pascal, Rudinac, Stevan

Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding

The automatic understanding of video content is advancing rapidly. Empowered by deeper neural networks and large datasets, machines are increasingly capable of understanding what is concretely visible in video frames, whether it be objects, actions, events, or scenes. In comparison, humans retain a unique ability to also look beyond concrete entities and recognize abstract concepts like justice, freedom, and togetherness. Abstract concept recognition forms a crucial open challenge in video understanding, where reasoning on multiple semantic levels based on contextual information is key. In this paper, we argue that the recent advances in foundation models make for an ideal setting to address abstract concept understanding in videos. Automated understanding of high-level abstract concepts is imperative as it enables models to be more aligned with human reasoning and values. In this survey, we study different tasks and datasets used to understand abstract concepts in video content. We observe that, periodically and over a long period, researchers have attempted to solve these tasks, making the best use of the tools available at their disposal. We advocate that drawing on decades of community experience will help us shed light on this important open grand challenge and avoid ``re-inventing the wheel'' as we start revisiting it in the era of multi-modal foundation models.

computer vision, large language model, machine learning, (20 more...)

2508.20765

Country: North America > United States (0.46)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Media > News (1.00)
Media > Film (1.00)
Leisure & Entertainment (1.00)
Health & Medicine > Therapeutic Area (0.92)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
(6 more...)

Petkar, Soham, K, Hari Aakash, Vempati, Anirudh, Sinha, Akshit, Kumarauguru, Ponnurangam, Agarwal, Chirag

A Graph Talks, But Who's Listening? Rethinking Evaluations for Graph-Language Models

Developments in Graph-Language Models (GLMs) aim to integrate the structural reasoning capabilities of Graph Neural Networks (GNNs) with the semantic understanding of Large Language Models (LLMs). However, we demonstrate that current evaluation benchmarks for GLMs, which are primarily repurposed node-level classification datasets, are insufficient to assess multimodal reasoning. Our analysis reveals that strong performance on these benchmarks is achievable using unimodal information alone, suggesting that they do not necessitate graph-language integration. To address this evaluation gap, we introduce the CLEGR(Compositional Language-Graph Reasoning) benchmark, designed to evaluate multimodal reasoning at various complexity levels. Our benchmark employs a synthetic graph generation pipeline paired with questions that require joint reasoning over structure and textual semantics. We perform a thorough evaluation of representative GLM architectures and find that soft-prompted LLM baselines perform on par with GLMs that incorporate a full GNN backbone. This result calls into question the architectural necessity of incorporating graph structure into LLMs. We further show that GLMs exhibit significant performance degradation in tasks that require structural reasoning. These findings highlight limitations in the graph reasoning capabilities of current GLMs and provide a foundation for advancing the community toward explicit multimodal reasoning involving graph structure and language.

artificial intelligence, large language model, natural language, (17 more...)

2508.20583

Country: Asia (0.46)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation > Ground (0.68)
Media (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Céspedes-Sarrias, Berta, Collado-Capell, Carlos, Rodenas-Ruiz, Pablo, Hrynenko, Olena, Cavallaro, Andrea

MM-HSD: Multi-Modal Hate Speech Detection in Videos

While hate speech detection (HSD) has been extensively studied in text, existing multi-modal approaches remain limited, particularly in videos. As modalities are not always individually informative, simple fusion methods fail to fully capture inter-modal dependencies. Moreover, previous work often omits relevant modalities such as on-screen text and audio, which may contain subtle hateful content and thus provide essential cues, both individually and in combination with others. In this paper, we present MM-HSD, a multi-modal model for HSD in videos that integrates video frames, audio, and text derived from speech transcripts and from frames (i.e.~on-screen text) together with features extracted by Cross-Modal Attention (CMA). We are the first to use CMA as an early feature extractor for HSD in videos, to systematically compare query/key configurations, and to evaluate the interactions between different modalities in the CMA block. Our approach leads to improved performance when on-screen text is used as a query and the rest of the modalities serve as a key. Experiments on the HateMM dataset show that MM-HSD outperforms state-of-the-art methods on M-F1 score (0.874), using concatenation of transcript, audio, video, on-screen text, and CMA for feature extraction on raw embeddings of the modalities. The code is available at https://github.com/idiap/mm-hsd

artificial intelligence, machine learning, natural language, (19 more...)

doi: 10.1145/3746027.3754558

2508.20546

Country:

Europe (1.00)
North America > United States (0.93)
Asia (0.93)

Genre: Research Report > Promising Solution (0.34)

Industry:

Leisure & Entertainment (1.00)
Media (0.97)
Information Technology (0.93)
Law Enforcement & Public Safety > Terrorism (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Enhancing Health Fact-Checking with LLM-Generated Synthetic Data

Zhang, Jingze, Qian, Jiahe, Zhou, Yiliang, Peng, Yifan

Fact-checking for health-related content is challenging due to the limited availability of annotated training data. In this study, we propose a synthetic data generation pipeline that leverages large language models (LLMs) to augment training data for health-related fact checking. In this pipeline, we summarize source documents, decompose the summaries into atomic facts, and use an LLM to construct sentence-fact entailment tables. From the entailment relations in the table, we further generate synthetic text-claim pairs with binary veracity labels. These synthetic data are then combined with the original data to fine-tune a BERT -based fact-checking model. Evaluation on two public datasets, PubHealth and SciFact, shows that our pipeline improved F1 scores by up to 0.019 and 0.049, respectively, compared to models trained only on the original data.

artificial intelligence, large language model, natural language, (19 more...)

2508.20525

Country: North America > United States > Alabama (0.14)

Genre:

Personal > Obituary (0.46)
Research Report > New Finding (0.35)

Industry:

Health & Medicine > Therapeutic Area (0.94)
Media (0.93)
Leisure & Entertainment > Sports (0.69)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Photonic restricted Boltzmann machine for content generation tasks

Luo, Li, Fang, Yisheng, Zhang, Wanyi, Ruan, Zhichao

The restricted Boltzmann machine (RBM) is a neural network based on the Ising model, well known for its ability to learn probability distributions and stochastically generate new content. However, the high computational cost of Gibbs sampling in content generation tasks imposes significant bottlenecks on electronic implementations. Here, we propose a photonic restricted Boltzmann machine (PRBM) that leverages photonic computing to accelerate Gibbs sampling, enabling efficient content generation. By introducing an efficient encoding method, the PRBM eliminates the need for computationally intensive matrix decomposition and reduces the computational complexity of Gibbs sampling from $O(N)$ to $O(1)$. Moreover, its non-Von Neumann photonic computing architecture circumvents the memory storage of interaction matrices, providing substantial advantages for large-scale RBMs. We experimentally validate the photonic-accelerated Gibbs sampling by simulating a two-dimensional Ising model, where the observed phase transition temperature closely matches the theoretical predictions. Beyond physics-inspired tasks, the PRBM demonstrates robust capabilities in generating and restoring diverse content, including images and temporal sequences, even in the presence of noise and aberrations. The scalability and reduced training cost of the PRBM framework underscore its potential as a promising pathway for advancing photonic computing in generative artificial intelligence.

artificial intelligence, gibbs, machine learning, (16 more...)

2508.20472

Country: Asia > China (0.28)

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment (1.00)
Media > Music (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.99)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.92)

ConspirED: A Dataset for Cognitive Traits of Conspiracy Theories and Large Language Model Safety

Bates, Luke, Glockner, Max, Nakov, Preslav, Gurevych, Iryna

Conspiracy theories erode public trust in science and institutions while resisting debunking by evolving and absorbing counter-evidence. As AI-generated misinformation becomes increasingly sophisticated, understanding rhetorical patterns in conspiratorial content is important for developing interventions such as targeted prebunking and assessing AI vulnerabilities. We introduce ConspirED (CONSPIR Evaluation Dataset), which captures the cognitive traits of conspiratorial ideation in multi-sentence excerpts (80--120 words) from online conspiracy articles, annotated using the CONSPIR cognitive framework (Lewandowsky and Cook, 2020). ConspirED is the first dataset of conspiratorial content annotated for general cognitive traits. Using ConspirED, we (i) develop computational models that identify conspiratorial traits and determine dominant traits in text excerpts, and (ii) evaluate large language/reasoning model (LLM/LRM) robustness to conspiratorial inputs. We find that both are misaligned by conspiratorial content, producing output that mirrors input reasoning patterns, even when successfully deflecting comparable fact-checked misinformation.

computational linguistic, large language model, machine learning, (17 more...)

2508.20468

Country:

Europe (1.00)
Asia (1.00)
North America > United States > New Mexico (0.28)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (0.94)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

FOX NewsAug-28-2025, 16:08:19 GMT

Jobs that are most at risk from AI, according to Microsoft

A majority of small businesses are using artificial intelligence and finding out it can save time and money. Right now, many people are worried that artificial intelligence (AI) is coming for their jobs. If you're one of them, then the recent study by Microsoft will shed some light on how AI's generative capabilities will impact your field of work. In short, some occupations are more susceptible to its influence than others. This study is making waves because, unlike previous studies, it draws insight from real-world data.

artificial intelligence, microsoft, natural language, (15 more...)

FOX News

Country: North America > United States > California > San Francisco County > San Francisco (0.05)

Genre: Research Report (0.36)

Industry:

Banking & Finance > Economy (0.32)
Media > News (0.31)
Information Technology (0.30)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.31)

FOX NewsAug-28-2025, 12:36:29 GMT

Will Smith accused of using AI to create fake crowd in concert performance footage

Fox News Flash top entertainment and celebrity headlines are here. Will Smith is facing accusations of using artificial intelligence to create a crowd in a video shared online. Smith, 56, posted a YouTube clip allegedly featuring scenes from a tour performance, but eagle-eyed fans were quick to point out purported inaccuracies in the video. The "Gettin' Jiggy Wit It" singer appeared to be singing to a packed room while on tour, only for distorted images to materialize in the crowd. Will Smith faced backlash for alleged AI use in a video shared online.

artificial intelligence, smith, social media, (12 more...)

FOX News

Industry:

Media > News (0.44)
Media > Music (0.34)

Technology:

Information Technology > Artificial Intelligence > Applied AI (0.90)
Information Technology > Communications > Social Media (0.64)