AITopics

2505.06624

Country: North America (0.67)

Genre: Research Report > New Finding (0.93)

Industry:

Education (0.93)
Health & Medicine (0.93)
Media > News (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Karan, Aditya, Vincent, Nicholas, Karahalios, Karrie, Sundaram, Hari

Algorithmic Collective Action with Two Collectives

arXiv.org Artificial IntelligenceMay-13-2025

Given that data-dependent algorithmic systems have become impactful in more domains of life, the need for individuals to promote their own interests and hold algorithms accountable has grown. To have meaningful influence, individuals must band together to engage in collective action. Groups that engage in such algorithmic collective action are likely to vary in size, membership characteristics, and crucially, objectives. In this work, we introduce a first of a kind framework for studying collective action with two or more collectives that strategically behave to manipulate data-driven systems. With more than one collective acting on a system, unexpected interactions may occur. We use this framework to conduct experiments with language model-based classifiers and recommender systems where two collectives each attempt to achieve their own individual objectives. We examine how differing objectives, strategies, sizes, and homogeneity can impact a collective's efficacy. We find that the unintentional interactions between collectives can be quite significant; a collective acting in isolation may be able to achieve their objective (e.g., improve classification outcomes for themselves or promote a particular item), but when a second collective acts simultaneously, the efficacy of the first group drops by as much as $75\%$. We find that, in the recommender system context, neither fully heterogeneous nor fully homogeneous collectives stand out as most efficacious and that heterogeneity's impact is secondary compared to collective size. Our results signal the need for more transparency in both the underlying algorithmic models and the different behaviors individuals or collectives may take on these systems. This approach also allows collectives to hold algorithmic system developers accountable and provides a framework for people to actively use their own data to promote their own interests.

artificial intelligence, machine learning, natural language, (18 more...)

doi: 10.1145/3715275.3732098

2505.00195

Country: North America > United States > Minnesota (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (0.68)
Media (0.68)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

BBC NewsMay-12-2025, 12:15:41 GMT

Pope calls for journalists to be released from prison

Pope Leo, who was chosen as the new leader of the Catholic Church on Thursday, also highlighted the role journalists can play in bringing attention to injustice and poverty in the world. He urged the media to focus on reporting the truth instead of taking part in partisan divisions, and not to give space to "fanaticism and hatred." Speaking in the Vatican's Paul VI audience hall, he said "the way we communicate is of fundamental importance: we must say'no' to the war of words and images, we must reject the paradigm of war." "We do not need loud, forceful communication," he said, "but rather communication that is capable of listening and of gathering the voices of the weak who have no voice." The new pope also raised concerns about artificial intelligence, telling the assembled media they should use AI with "responsibility and discernment." Reporters should ensure that AI can be used for the "benefit of all of humanity," he said.

artificial intelligence, journalist, pope call, (2 more...)

BBC News

Country: Europe > Holy See (0.30)

Industry: Media > News (0.66)

Technology: Information Technology > Artificial Intelligence (1.00)

FOX NewsMay-12-2025, 11:29:29 GMT

Pope Leo dishes advice to journalists, mentions AI challenge in first news conference

OutKick writer Mary Katharine Ham and Democratic strategist Kevin Walling join'MediaBuzz' to discuss the election of Pope Leo XIV, the first American pope in history, and the U.S. trade deal with the U.K. Pope Leo XIV wrapped up his first meeting with Vatican-accredited journalists Monday morning. More than 1,000 members of the media were assembled to hear his remarks, according to the New York Times. Some of them even took their children. The gathering took place in the Vatican's Paul VI Hall, Vatican Media reported. There, the pontiff "thanked reporters in Italian for their tireless work over these intense few weeks."

artificial intelligence, journalist, pope leo dish advice, (8 more...)

FOX News

Country: Europe > Holy See (0.35)

Industry:

Government (1.00)
Media > News (0.99)

Technology: Information Technology > Artificial Intelligence > Challenges (0.40)

FOX NewsMay-12-2025, 06:00:54 GMT

Silicon, steel and megawatts: Can America create the infrastructure needed to win the AI race?

Fox News anchor Bret Baier has the latest on the Murdoch Children's Research Institute's partnership with the Gladstone Institutes for the'Decoding Broken Hearts' initiative on'Special Report.' This week's Senate hearing on U.S. competitiveness in artificial intelligence made it clear that we are not just in an AI race with China and the rest of the world. We are in a race to build the foundation of the 21st century global economy while strengthening our national security. That foundation is made of silicon, steel and megawatts. America's ability to lead in AI hinges on a simple but urgent question – can we build the computing infrastructure fast enough to unleash AI's full potential and drive a competitive advantage? The emerging AI cloud computing infrastructure is not like the general-purpose cloud that still powers most of the digital world.

artificial intelligence, infrastructure, steel and megawatt, (12 more...)

FOX News

Country:

Asia > China (0.26)
North America > United States > New Jersey (0.06)

Industry:

Government (1.00)
Media > News (0.74)

Technology: Information Technology > Artificial Intelligence (1.00)

Rahimzadeh, Vahid, Hamzehpour, Ali, Shakery, Azadeh, Asadpour, Masoud

From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling

Social media user profiling through content analysis is crucial for tasks like misinformation detection, engagement prediction, hate speech monitoring, and user behavior modeling. However, existing profiling techniques, including tweet summarization, attribute-based profiling, and latent representation learning, face significant limitations: they often lack transferability, produce non-interpretable features, require large labeled datasets, or rely on rigid predefined categories that limit adaptability. We introduce a novel large language model (LLM)-based approach that leverages domain-defining statements, which serve as key characteristics outlining the important pillars of a domain as foundations for profiling. Our two-stage method first employs semi-supervised filtering with a domain-specific knowledge base, then generates both abstractive (synthesized descriptions) and extractive (representative tweet selections) user profiles. By harnessing LLMs' inherent knowledge with minimal human validation, our approach is adaptable across domains while reducing the need for large labeled datasets. Our method generates interpretable natural language user profiles, condensing extensive user data into a scale that unlocks LLMs' reasoning and knowledge capabilities for downstream social network tasks. We contribute a Persian political Twitter (X) dataset and an LLM-based evaluation framework with human validation. Experimental results show our method significantly outperforms state-of-the-art LLM-based and traditional methods by 9.8%, demonstrating its effectiveness in creating flexible, adaptable, and interpretable user profiles.

large language model, machine learning, natural language, (19 more...)

2505.06184

Country:

Asia > Middle East > Iran (0.29)
Asia > Middle East > UAE (0.28)

Genre:

Research Report > Experimental Study (0.94)
Research Report > New Finding (0.88)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Government (1.00)
Information Technology > Services (0.67)
Media > News (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Dutt, Niladri Shekhar, Ceylan, Duygu, Mitra, Niloy J.

MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills

Retouching is an essential task in post-manipulation of raw photographs. Generative editing, guided by text or strokes, provides a new tool accessible to users but can easily change the identity of the original objects in unacceptable and unpredictable ways. In contrast, although traditional procedural edits, as commonly supported by photoediting tools (e.g., Gimp, Lightroom), are conservative, they are still preferred by professionals. Unfortunately, professional quality retouching involves many individual procedural editing operations that is challenging to plan for most novices. In this paper, we ask if a multimodal large language model (MLLM) can be taught to critique raw photographs, suggest suitable remedies, and finally realize them with a given set of pre-authored procedural image operations. We demonstrate that MLLMs can be first made aware of the underlying image processing operations, by training them to solve specially designed visual puzzles. Subsequently, such an operation-aware MLLM can both plan and propose edit sequences. To facilitate training, given a set of expert-edited photos, we synthesize a reasoning dataset by procedurally manipulating the expert edits and then grounding a pretrained LLM on the visual adjustments, to synthesize reasoning for finetuning. The proposed retouching operations are, by construction, understandable by the users, preserve object details and resolution, and can be optionally overridden. We evaluate our setup on a variety of test examples and show advantages, in terms of explainability and identity preservation, over existing generative and other procedural alternatives. Code, data, models, and supplementary results can be found via our project website at https://monetgpt.github.io.

large language model, machine learning, natural language, (20 more...)

doi: 10.1145/3730926

2505.06176

Country: Europe > United Kingdom (0.28)

Genre: Research Report (0.83)

Industry: Media (0.48)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Plachouras, Christos, Benetos, Emmanouil, Pauwels, Johan

Learning Music Audio Representations With Limited Data

--Large deep-learning models for music, including those focused on learning general-purpose music audio representations, are often assumed to require substantial training data to achieve high performance. If true, this would pose challenges in scenarios where audio data or annotations are scarce, such as for underrepresented music traditions, non-popular genres, and personalized music creation and listening. Understanding how these models behave in limited-data scenarios could be crucial for developing techniques to tackle them. In this work, we investigate the behavior of several music audio representation models under limited-data learning regimes. We consider music models with various architectures, training paradigms, and input durations, and train them on data collections ranging from 5 to 8,000 minutes long. We evaluate the learned representations on various music information retrieval tasks and analyze their robustness to noise. We show that, under certain conditions, representations from limited-data and even random models perform comparably to ones from large-dataset models, though handcrafted features outperform all learned representations in some tasks. Large models tackling audio-based Music Information Retrieval (MIR) tasks are commonly thought to require substantial amounts of training data [1], [2].

artificial intelligence, machine learning, natural language, (14 more...)

doi: 10.1109/ICASSP49660.2025.10887766

2505.06042

Country:

Asia (0.93)
North America > United States (0.47)
Europe > Netherlands (0.28)
(3 more...)

Genre: Research Report (0.64)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Eshuijs, Leon, Wang, Shihan, Fokkens, Antske

Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification

Reliance on spurious correlations (shortcuts) has been shown to underlie many of the successes of language models. Previous work focused on identifying the input elements that impact prediction. We investigate how shortcuts are actually processed within the model's decision-making mechanism. We use actor names in movie reviews as controllable shortcuts with known impact on the outcome. We use mechanistic interpretability methods and identify specific attention heads that focus on shortcuts. These heads gear the model towards a label before processing the complete input, effectively making premature decisions that bypass contextual analysis. Based on these findings, we introduce Head-based Token Attribution (HTA), which traces intermediate decisions back to input tokens. We show that HTA is effective in detecting shortcuts in LLMs and enables targeted mitigation by selectively deactivating shortcut-related attention heads.

large language model, machine learning, natural language, (20 more...)

2505.06032

Country:

Europe (0.93)
North America > United States (0.93)

Genre: Research Report > New Finding (1.00)

Industry:

Media > Film (0.49)
Leisure & Entertainment (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Bartl, Stephan, Innerebner, Kevin, Lex, Elisabeth

Differentiable Fuzzy Neural Networks for Recommender Systems

As recommender systems become increasingly complex, transparency is essential to increase user trust, accountability, and regulatory compliance. Neuro-symbolic approaches that integrate symbolic reasoning with sub-symbolic learning offer a promising approach toward transparent and user-centric systems. In this work-in-progress, we investigate using fuzzy neural networks (FNNs) as a neuro-symbolic approach for recommendations that learn logic-based rules over predefined, human-readable atoms. Each rule corresponds to a fuzzy logic expression, making the recommender's decision process inherently transparent. In contrast to black-box machine learning methods, our approach reveals the reasoning behind a recommendation while maintaining competitive performance. We evaluate our method on a synthetic and MovieLens 1M datasets and compare it to state-of-the-art recommendation algorithms. Our results demonstrate that our approach accurately captures user behavior while providing a transparent decision-making process. Finally, the differentiable nature of this approach facilitates an integration with other neural models, enabling the development of hybrid, transparent recommender systems.

artificial intelligence, fuzzy logic, machine learning, (15 more...)

doi: 10.1145/3708319.3734174

2505.06

Country:

Europe (0.94)
North America > United States > New York > New York County > New York City (0.15)

Genre: Research Report > New Finding (0.86)

Industry:

Media > Film (0.47)
Leisure & Entertainment (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.90)