Goto

Collaborating Authors

 Media


When Algorithms Meet Artists: Topic Modeling the AI-Art Debate, 2013-2025

arXiv.org Artificial Intelligence

As generative AI continues to reshape artistic production and alternate modes of human expression, artists whose livelihoods are most directly affected have raised urgent concerns about consent, transparency, and the future of creative labor. However, the voices of artists are often marginalized in dominant public and scholarly discourse. This study presents a twelve-year analysis, from 2013 to 2025, of English-language discourse surrounding AI-generated art. It draws from 439 curated 500-word excerpts sampled from opinion articles, news reports, blogs, legal filings, and spoken-word transcripts. Through a reproducible methodology, we identify five stable thematic clusters and uncover a misalignment between artists' perceptions and prevailing media narratives. Our findings highlight how the use of technical jargon can function as a subtle form of gatekeeping, often sidelining the very issues artists deem most urgent. Our work provides a BERTopic-based methodology and a multimodal baseline for future research, alongside a clear call for deeper, transparency-driven engagement with artist perspectives in the evolving AI-creative landscape.


SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models

arXiv.org Artificial Intelligence

Diffusion models have demonstrated exceptional generative capabilities but are computationally intensive, posing significant challenges for deployment in resource-constrained or latency-sensitive environments. Quantization offers an effective means to reduce model size and computational cost, with post-training quantization (PTQ) being particularly appealing due to its compatibility with pre-trained models without requiring retraining or training data. However, existing PTQ methods for diffusion models often rely on architecture-specific heuristics that limit their generalizability and hinder integration with industrial deployment pipelines. To address these limitations, we propose SegQuant, a unified quantization framework that adaptively combines complementary techniques to enhance cross-model versatility. SegQuant consists of a segment-aware, graph-based quantization strategy (SegLinear) that captures structural semantics and spatial heterogeneity, along with a dual-scale quantization scheme (DualScale) that preserves polarity-asymmetric activations, which is crucial for maintaining visual fidelity in generated outputs. SegQuant is broadly applicable beyond Transformer-based diffusion models, achieving strong performance while ensuring seamless compatibility with mainstream deployment tools.


Analyzing Character Representation in Media Content using Multimodal Foundation Model: Effectiveness and Trust

arXiv.org Artificial Intelligence

Recent advances in AI has made automated analysis of complex media content at scale possible while generating actionable insights regarding character representation along such dimensions as gender and age. Past works focused on quantifying representation from audio/video/text using AI models, but without having the audience in the loop. We ask, even if character distribution along demographic dimensions are available, how useful are those to the general public? Do they actually trust the numbers generated by AI models? Our work addresses these open questions by proposing a new AI-based character representation tool and performing a thorough user study. Our tool has two components: (i) An analytics extraction model based on the Contrastive Language Image Pretraining (CLIP) foundation model that analyzes visual screen data to quantify character representation across age and gender; (ii) A visualization component effectively designed for presenting the analytics to lay audience. The user study seeks empirical evidence on the usefulness and trustworthiness of the AI-generated results for carefully chosen movies presented in the form of our visualizations. We found that participants were able to understand the analytics in our visualizations, and deemed the tool `overall useful'. Participants also indicated a need for more detailed visualizations to include more demographic categories and contextual information of the characters. Participants' trust in AI-based gender and age models is seen to be moderate to low, although they were not against the use of AI in this context. Our tool including code, benchmarking, and the user study data can be found at https://github.com/debadyuti0510/Character-Representation-Media.


Socioeconomic Threats of Deepfakes and the Role of Cyber-Wellness Education in Defense

Communications of the ACM

Due to the limits of science and its steep learning curve, we must rely on the expertise of others to develop our knowledge and skills.26 Toward this end, social media platforms have revolutionized how netizens--users who are actively engaged in online communities--gain knowledge and skills by facilitating the exchange of costless information with the public (for example, followers or influencers). Businesses around the world also use these platforms along with tools based on generative artificial intelligence (GenAI) to craft synthetic media, hoping to grow revenue by attracting more customers and improving their online experience.28 Generative AI tools can empower cyber threats and have cyberpsychological effects on netizens, allowing malicious actors to craft deepfakes in the form of disinformation, misinformation, and malinformation. Service providers not only must enhance GenAI tools to reduce hallucinations, but they also have a statutory duty to mitigate data-driven biases.


Light-based AI image generator uses almost no power

New Scientist

An AI image generator that uses light to produce images, rather than conventional computing hardware, could consume hundreds of times less energy. When an artificial intelligence model produces an image from text, it typically uses a process called diffusion. The AI is first shown a large collection of images and shown how to destroy them using statistical noise, then it encodes these patterns in a set of rules. When it is given a new, noisy image, it can use these rules to do the same thing in reverse: over many steps, it works towards a coherent image that matches a given text request. For realistic, high-resolution images, diffusion uses many sequential steps that require a significant level of computing power.


Rumors spread like viruses. The French Revolution proved it.

Popular Science

Breakthroughs, discoveries, and DIY tips sent every weekday. It's hard to contain misinformation once enough people believe it. A conspiracy theory spreads exponentially regardless of its accuracy, making it that much more likely to translate into real violence. According to a study published August 27 in the journal Nature, these situations can (and should) be geographically mapped with the same models that epidemiologists use to track diseases. And as an example, researchers turned to one of history's most famous moments of misinformation. The Great Fear of 1789 was a major chapter in the French Revolution and a defining moment in modern history.


AI drone finds missing hiker's remains in mountains after 10 months

FOX News

Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. A missing hiker's dead body was finally found in July in Italy's rugged Piedmont region after 10 months. The recovery team credited the breakthrough to an AI-powered drone that spotted a critical clue within hours. The same process would have taken weeks or even months if done by the human eye.


3 Things James O'Donnell is into right now

MIT Technology Review

This is a podcast in which two very smart people (who happen to be young and hilarious professors of philosophy) draw unexpected philosophical connections between facets of modern life. Ellie Anderson and David Peรฑa-Guzmรกn have done hour-long episodes on everything from mommy issues to animal justice, with particularly sharp segments on tech-adjacent issues like biohacking and the relationship between AI and art. Whenever I think society is dealing with a brand-new problem, these two unearth someone who was pondering it centuries ago. It's a treat to listen to. Over the summer I was eager to watch Mountainhead, a darkly funny film by Jesse Armstrong, the creator of Succession, that follows four unlikable tech founders as they watch the world collapse under political turmoil and violence caused by AI deepfakes.


ChatGPT has its uses, but I still hate it โ€“ and I'll tell you why Imogen West-Knights

The Guardian

It's one of those topics that comes up over drinks or dinner at the moment: whether or not you think AI is going to steal your job. So far, I've felt relatively confident that while AI could no doubt have a fair crack at writing a newspaper opinion column, there is something I do as part of my work that AI cannot: reporting. Except now, it seems, AI is claiming to be doing that as well. Last week, it was revealed that at least six reputable publications have had to take down published articles because it turned out that they were probably pieces of fiction written by AI and then passed off by somebody as works of journalism under the name of Margaux Blanchard. One of these was a piece for Wired titled They Fell in Love Playing Minecraft.


Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models

arXiv.org Artificial Intelligence

The proliferation of misinformation in digital platforms reveals the limitations of traditional detection methods, which mostly rely on static classification and fail to capture the intricate process of real-world fact-checking. Despite advancements in Large Language Models (LLMs) that enhance automated reasoning, their application to misinformation detection remains hindered by issues of logical inconsistency and superficial verification. In response, we introduce Debate-to-Detect (D2D), a novel Multi-Agent Debate (MAD) framework that reformulates misinformation detection as a structured adversarial debate. Inspired by fact-checking workflows, D2D assigns domain-specific profiles to each agent and orchestrates a five-stage debate process, including Opening Statement, Rebuttal, Free Debate, Closing Statement, and Judgment. To transcend traditional binary classification, D2D introduces a multi-dimensional evaluation mechanism that assesses each claim across five distinct dimensions: Factuality, Source Reliability, Reasoning Quality, Clarity, and Ethics. Experiments with GPT-4o on two datasets demonstrate significant improvements over baseline methods, and the case study highlight D2D's capability to iteratively refine evidence while improving decision transparency, representing a substantial advancement towards interpretable misinformation detection. The code will be released publicly after the official publication.