AITopics

Social simulation is transforming traditional social science research by modeling human behavior through interactions between virtual individuals and their environments. With recent advances in large language models (LLMs), this approach has shown growing potential in capturing individual differences and predicting group behaviors. However, existing methods face alignment challenges related to the environment, target users, interaction mechanisms, and behavioral patterns. To this end, we introduce SocioVerse, an LLM-agent-driven world model for social simulation. Our framework features four powerful alignment components and a user pool of 10 million real individuals. To validate its effectiveness, we conducted large-scale simulation experiments across three distinct domains: politics, news, and economics. Results demonstrate that SocioVerse can reflect large-scale population dynamics while ensuring diversity, credibility, and representativeness through standardized procedures and minimal manual adjustments.

large language model, machine learning, simulation, (16 more...)

2504.10157

Country:

North America > United States (1.00)
Asia (1.00)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.66)

Industry:

Media > News (1.00)
Law > Statutes (1.00)
Health & Medicine (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.91)

COLIBRI Fuzzy Model: Color Linguistic-Based Representation and Interpretation

Shamoi, Pakizar, Toganas, Nuray, Muratbekova, Muragul, Kadyrgali, Elnara, Yerkin, Adilet, Igali, Ayan, Ziyada, Malika, Adilova, Ayana, Karatayev, Aron, Torekhan, Yerdauit

Colors are omnipresent in today's world and play a vital role in how humans perceive and interact with their surroundings. However, it is challenging for computers to imitate human color perception. This paper introduces the Human Perception-Based Fuzzy Color Model, COLIBRI (Color Linguistic-Based Representation and Interpretation), designed to bridge the gap between computational color representations and human visual perception. The proposed model uses fuzzy sets and logic to create a framework for color categorization. Using a three-phase experimental approach, the study first identifies distinguishable color stimuli for hue, saturation, and intensity through preliminary experiments, followed by a large-scale human categorization survey involving more than 1000 human subjects. The resulting data are used to extract fuzzy partitions and generate membership functions that reflect real-world perceptual uncertainty. The model incorporates a mechanism for adaptation that allows refinement based on feedback and contextual changes. Comparative evaluations demonstrate the model's alignment with human perception compared to traditional color models, such as RGB, HSV, and LAB. To the best of our knowledge, no previous research has documented the construction of a model for color attribute specification based on a sample of this size or a comparable sample of the human population (n = 2496). Our findings are significant for fields such as design, artificial intelligence, marketing, and human-computer interaction, where perceptually relevant color representation is critical.

artificial intelligence, data mining, machine learning, (21 more...)

2507.11488

Country: North America > United States (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Media (1.00)
Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Human Computer Interaction (1.00)
Information Technology > Data Science > Data Mining (1.00)
(3 more...)

Sioros, Vassilis, Potamianos, Alexandros, Paraskevopoulos, Giorgos

EditGen: Harnessing Cross-Attention Control for Instruction-Based Auto-Regressive Audio Editing

In this study, we investigate leveraging cross-attention control for efficient audio editing within auto-regressive models. Inspired by image editing methodologies, we develop a Prompt-to-Prompt-like approach that guides edits through cross and self-attention mechanisms. Integrating a diffusion-based strategy, influenced by Auffusion, we extend the model's functionality to support refinement edits, establishing a baseline for prompt-guided audio editing. Additionally, we introduce an alternative approach by incorporating MUSICGEN, a pre-trained frozen auto-regressive model, and propose three editing mechanisms, based on Replacement, Reweighting, and Refinement of the attention scores. We employ commonly-used music-specific evaluation metrics and a human study, to gauge time-varying controllability, adherence to global text cues, and overall audio realism. The automatic and human evaluations indicate that the proposed combination of prompt-to-prompt guidance with autoregressive generation models significantly outperforms the diffusion-based baseline in terms of melody, dynamics, and tempo of the generated audio. Our code is available at https://github.com/billsioros/EditGen

diffusion model, large language model, machine learning, (18 more...)

2507.11096

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.48)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Tian, Lin, Trippas, Johanne R., Rizoiu, Marian-Andrei

Mario at EXIST 2025: A Simple Gateway to Effective Multilingual Sexism Detection

This paper presents our approach to EXIST 2025 Task 1, addressing text-based sexism detection in English and Spanish tweets through hierarchical Low-Rank Adaptation (LoRA) of Llama 3.1 8B. Our method introduces conditional adapter routing that explicitly models label dependencies across three hierarchically structured subtasks: binary sexism identification, source intention detection, and multilabel sexism categorization. Unlike conventional LoRA applications that target only attention layers, we apply adaptation to all linear transformations, enhancing the model's capacity to capture task-specific patterns. In contrast to complex data processing and ensemble approaches, we show that straightforward parameter-efficient fine-tuning achieves strong performance. We train separate LoRA adapters (rank=16, QLoRA 4-bit) for each subtask using unified multilingual training that leverages Llama 3.1's native bilingual capabilities. The method requires minimal preprocessing and uses standard supervised learning. Our multilingual training strategy eliminates the need for separate language-specific models, achieving 1.7-2.4\% F1 improvements through cross-lingual transfer. With only 1.67\% trainable parameters compared to full fine-tuning, our approach reduces training time by 75\% and model storage by 98\%, while achieving competitive performance across all subtasks (ICM-Hard: 0.6774 for binary classification, 0.4991 for intention detection, 0.6519 for multilabel categorization).

large language model, machine learning, natural language, (21 more...)

2507.10996

Country:

Oceania > Australia (0.28)
Europe > Spain (0.28)

Genre: Research Report (0.82)

Industry:

Information Technology (0.67)
Health & Medicine (0.47)
Law (0.46)
Media > News (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Kanani, Maziar, Leary, Sean O, McDermott, James

Parsing Musical Structure to Enable Meaningful Variations

This paper presents a novel rule-based approach for generating music by varying existing tunes. We parse each tune to find the Pathway Assembly (PA) [ 1], that is a structure representing all repetitions in the tune. The Sequitur algorithm [2 ] is used for this. The result is a grammar. We then carry out mutation on the grammar, rather than on a tune directly. There are potentially 19 types of mutations such as adding, removing, swapping or reversing parts of the grammar that can be applied to the grammars. The system employs one of the mutations randomly in this step to automatically manipulate the grammar. Following the mutation, we need to expand the grammar which returns a new tune. The output after 1 or more mutations will be a new tune related to the original tune. Our study examines how tunes change gradually over the course of multiple mutations. Edit distances, structural complexity and length of the tunes are used to show how a tune is changed after multiple mutations. In addition, the size of effect of each mutation type is analyzed. As a final point, we review the musical aspect of the output tunes. It should be noted that the study only focused on generating new pitch sequences. The study is based on an Irish traditional tune dataset and a list of integers has been used to represent each tune's pitch values.

artificial intelligence, machine learning, natural language, (18 more...)

2507.1074

Country: Europe > Ireland (0.28)

Genre: Research Report (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.87)

Al JazeeraJul-15-2025, 23:12:56 GMT

AI and disinformation fuel political rivalries in the Philippines

Manila, Philippines – When former Philippines President Rodrigo Duterte was arrested by the International Criminal Court (ICC) in March, Sheerah Escuerdo spoke to a local television station, welcoming the politician's detention on charges of murder linked to his war on drugs. Escuerdo, who lost her 18-year-old brother, Ephraim, to Duterte's war, clutched a portrait of her sibling during the interview with News 5 Everywhere as she demanded justice for his killing. Days later, she was shocked to find an AI-generated video of her slain brother circulating on Facebook, in which he said he was alive and accused his sister of lying. Are they paying you to do this?" the computer-generated image of Ephraim said. The video, posted online by a pro-Duterte influencer with 11,000 followers, immediately drew thousands of views on Facebook. One of the comments read, "Fake drug war victims". It was Escudero and her brother's image from her News 5 Everywhere interview that the influencer had used to ...

artificial intelligence, disinformation, social media, (13 more...)

Al Jazeera

Country: Asia > Philippines > Luzon > National Capital Region > City of Manila (0.56)

Industry:

Government > Regional Government > Asia Government > Philippines Government (0.70)
Media > News (0.62)
Law > Criminal Law (0.55)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.55)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.30)

SlateJul-15-2025, 15:00:00 GMT

What It's Like to Be a Student Who Hates ChatGPT

Sign up for the Slatest to get the most insightful analysis, criticism, and advice out there, delivered to your inbox daily. As a classically trained singer preparing for a professional career, Erin Perry can see quite clearly how artificial intelligence is upending her field--all the way down to the classroom. Perry just completed her first year as a graduate student in voice performance at the Peabody Institute, the prestigious music conservatory run by Johns Hopkins University. It's been rewarding so far: She's been learning how to navigate the modern classical music sector and confronting the relevant impacts of generative A.I., having taken on a project to study the major record labels' lawsuit against the Amazon-backed A.I. startup Anthropic, which trained its models on songwriters' lyrics sans permission or compensation. Understandably, Perry's rather skeptical of A.I.'s artistic applications, and fearful of the sweeping effects it could have on her chosen field, especially as generative-music startups like Suno and Udio are programmed to replicate specific artists and musical styles.

large language model, machine learning, natural language, (20 more...)

Slate

Country: North America > United States > California > Los Angeles County > Los Angeles (0.14)

Industry:

Media > Music (1.00)
Leisure & Entertainment (0.89)
Education > Educational Setting > Higher Education (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.86)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.37)

Chandra, Rohitash, Choi, Jiyong

Abusive text transformation using LLMs

arXiv.org Artificial IntelligenceJul-15-2025

Although Large Language Models (LLMs) have demonstrated significant advancements in natural language processing tasks, their effectiveness in the classification and transformation of abusive text into non-abusive versions remains an area for exploration. In this study, we aim to use LLMs to transform abusive text (tweets and reviews) featuring hate speech and swear words into non-abusive text, while retaining the intent of the text. We evaluate the performance of two state-of-the-art LLMs, such as Gemini, GPT-4o, DeekSeek and Groq, on their ability to identify abusive text. We them to transform and obtain a text that is clean from abusive and inappropriate content but maintains a similar level of sentiment and semantics, i.e. the transformed text needs to maintain its message. Afterwards, we evaluate the raw and transformed datasets with sentiment analysis and semantic analysis. Our results show Groq provides vastly different results when compared with other LLMs. We have identified similarities between GPT-4o and DeepSeek-V3.

large language model, machine learning, natural language, (20 more...)

2507.10177

Country: North America > United States (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Media (0.93)
Law > Civil Rights & Constitutional Law (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJul-15-2025

ODIA: Oriented Distillation for Inline Acceleration of LLM-based Function Calling

Zhang, Hanlong, Yang, Jingsheng, Li, Hao, He, Yuhao, Gong, Franck

Function Calling is a crucial technique that enables Large Language Models (LLMs) to interact with external systems through APIs. However, the high latency associated with LLM-based Function Calling significantly impacts user experience. This paper presents a novel approach called Oriented Distillation for Inline Acceleration (ODIA) that leverages online user interaction data to accelerate Function Calling. By automatically identifying "simple queries" from production traffic and distilling knowledge from larger models to smaller ones, our method reduces response latency by 45% (expected) and 78% (median) while maintaining accuracy. We demonstrate the effectiveness of our approach through real-world deployment in a music application, where the smaller model successfully handles 60% of traffic with negligible accuracy loss. Our method requires minimal human intervention and continuously improves through automated data collection and model updating, making it a practical solution for production environments.

artificial intelligence, large language model, natural language, (17 more...)

2507.08877

Genre: Research Report (0.70)

Industry:

Media (0.47)
Leisure & Entertainment (0.47)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

arXiv.org Artificial IntelligenceJul-15-2025

GR-LLMs: Recent Advances in Generative Recommendation Based on Large Language Models

Yang, Zhen, Lin, Haitao, xue, Jiawei, Zhang, Ziji

In the past year, Generative Recommendations (GRs) have undergone substantial advancements, especially in leveraging the powerful sequence modeling and reasoning capabilities of Large Language Models (LLMs) to enhance overall recommendation performance. LLM-based GRs are forming a new paradigm that is distinctly different from discriminative recommendations, showing strong potential to replace traditional recommendation systems heavily dependent on complex hand-crafted features. In this paper, we provide a comprehensive survey aimed at facilitating further research of LLM-based GRs. Initially, we outline the general preliminaries and application cases of LLM-based GRs. Subsequently, we introduce the main considerations when LLM-based GRs are applied in real industrial scenarios. Finally, we explore promising directions for LLM-based GRs. We hope that this survey contributes to the ongoing advancement of the GR domain.

large language model, machine learning, natural language, (18 more...)

2507.06507

Country:

North America > Mexico (0.28)
Asia (0.28)

Genre: Overview (1.00)

Industry:

Information Technology (0.46)
Media > Music (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)