AITopics

2505.15299

Country:

Europe (0.93)
North America > United States > California (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Media (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

The GuardianJun-3-2025, 14:00:38 GMT

AI, bot farms and innocent indie victims: how music streaming became a hotbed of fraud and fakery

There is a battle gripping the music business today around the manipulation of streaming services – and innocent indie artists are the collateral damage. Fraudsters are flooding Spotify, Apple Music and the rest with AI-generated tracks, to try and hoover up the royalties generated by people listening to them. These tracks are cheap, quick and easy to make, with Deezer estimating in April that over 20,000 fully AI-created tracks – that's 18% of new tracks – were being ingested into its platform daily, almost double the number in January. The fraudsters often then use bots, AI or humans to endlessly listen to these fake songs and generate revenue, while others are exploiting upload services to get fake songs put on real artists' pages and siphon off royalties that way. Spotify fines the worst offenders and says it puts "significant engineering resources and research into detecting, mitigating, and removing artificial streaming activity", while Apple Music claims "less than 1% of all streams are manipulated" on its service.

artificial intelligence, artist, music, (14 more...)

The Guardian

Country:

Europe (0.50)
Asia (0.30)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence (0.70)

The GuardianJun-3-2025, 13:21:47 GMT

Will AI wipe out the first rung of the career ladder?

This week, I'm wondering what my first jobs in journalism would have been like had generative AI been around. In other news: Elon Musk leaves a trail of chaos, and influencers are selling the text they fed to AI to make art. Generative artificial intelligence may eliminate the job you got with your diploma still in hand, say executives who offered grim assessments of the entry-level job market last week in multiple forums. Dario Amodei, CEO of Anthropic, which makes the multifunctional AI model Claude, told Axios last week that he believes that AI could cut half of all entry-level white-collar jobs and send overall unemployment rocketing to 20% within the next five years. One explanation why an AI company CEO might make such a dire prediction is to hype the capabilities of his product.

amodei, machine learning, natural language, (18 more...)

The Guardian

Country: North America > United States (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance > Economy (1.00)
Media > News (0.90)
Information Technology > Services (0.70)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.69)

TIME - TechJun-3-2025, 11:00:00 GMT

Google's New AI Tool Generates Convincing Deepfakes of Riots, Conflict, and Election Fraud

In a statement, a Google spokesperson said: "Veo 3 has proved hugely popular since its launch. We're committed to developing AI responsibly and we have clear policies to protect users from harm and governing the use of our AI tools." Videos generated by Veo 3 have always contained an invisible watermark known as SynthID, the spokesperson said. Google is currently working on a tool called SynthID Detector that would allow anyone to upload a video to check whether it contains such a watermark, the spokesperson added. However, this tool is not yet publicly available.

ai tool generate convincing deepfake, artificial intelligence, machine learning, (8 more...)

TIME - Tech

Country: North America > United States (0.42)

Industry:

Media > News (0.40)
Information Technology > Security & Privacy (0.40)
Government > Voting & Elections (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.59)

The GuardianJun-3-2025, 04:00:21 GMT

'Nobody wants a robot to read them a story!' The creatives and academics rejecting AI – at work and at home

The novelist Ewan Morrison was alarmed, though amused, to discover he had written a book called Nine Inches Pleases a Lady. Intrigued by the limits of generative artificial intelligence (AI), he had asked ChatGPT to give him the names of the 12 novels he had written. "I've only written nine," he says. "Always eager to please, it decided to invent three." The "nine inches" from the fake title it hallucinated was stolen from a filthy Robert Burns poem.

large language model, machine learning, natural language, (19 more...)

The Guardian

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.51)

CiteEval: Principle-Driven Citation Evaluation for Source Attribution

Xu, Yumo, Qi, Peng, Chen, Jifan, Liu, Kunlun, Han, Rujun, Liu, Lan, Min, Bonan, Castelli, Vittorio, Gupta, Arshit, Wang, Zhiguo

Citation quality is crucial in information-seeking systems, directly influencing trust and the effectiveness of information access. Current evaluation frameworks, both human and automatic, mainly rely on Natural Language Inference (NLI) to assess binary or ternary supportiveness from cited sources, which we argue is a suboptimal proxy for citation evaluation. In this work we introduce CiteEval, a citation evaluation framework driven by principles focusing on fine-grained citation assessment within a broad context, encompassing not only the cited sources but the full retrieval context, user query, and generated text. Guided by the proposed framework, we construct CiteBench, a multi-domain benchmark with high-quality human annotations on citation quality. To enable efficient evaluation, we further develop CiteEval-Auto, a suite of model-based metrics that exhibit strong correlation with human judgments. Experiments across diverse systems demonstrate CiteEval-Auto's superior ability to capture the multifaceted nature of citations compared to existing metrics, offering a principled and scalable approach to evaluate and improve model-generated citations.

large language model, machine learning, natural language, (21 more...)

2506.01829

Country:

North America > United States (1.00)
Asia (1.00)

Genre: Research Report (0.40)

Industry: Media (0.56)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.98)
Information Technology > Information Management (0.86)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Bakman, Yavuz, Yaldiz, Duygu Nur, Kang, Sungmin, Zhang, Tuo, Buyukates, Baturalp, Avestimehr, Salman, Karimireddy, Sai Praneeth

Reconsidering LLM Uncertainty Estimation Methods in the Wild

Large Language Model (LLM) Uncertainty Estimation (UE) methods have become a crucial tool for detecting hallucinations in recent years. While numerous UE methods have been proposed, most existing studies evaluate them in isolated short-form QA settings using threshold-independent metrics such as AUROC or PRR. However, real-world deployment of UE methods introduces several challenges. In this work, we systematically examine four key aspects of deploying UE methods in practical settings. Specifically, we assess (1) the sensitivity of UE methods to decision threshold selection, (2) their robustness to query transformations such as typos, adversarial prompts, and prior chat history, (3) their applicability to long-form generation, and (4) strategies for handling multiple UE scores for a single query. Our evaluations on 19 UE methods reveal that most of them are highly sensitive to threshold selection when there is a distribution shift in the calibration dataset. While these methods generally exhibit robustness against previous chat history and typos, they are significantly vulnerable to adversarial prompts. Additionally, while existing UE methods can be adapted for long-form generation through various strategies, there remains considerable room for improvement. Lastly, ensembling multiple UE scores at test time provides a notable performance boost, which highlights its potential as a practical improvement strategy. Code is available at: https://github.com/duygunuryldz/uncertainty_in_the_wild.

large language model, machine learning, natural language, (17 more...)

2506.01114

Country:

Asia (1.00)
North America > United States > New Jersey (0.28)

Genre: Research Report > New Finding (0.68)

Industry:

Media (1.00)
Leisure & Entertainment > Games > Computer Games (1.00)
Education (0.92)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

An evaluation of LLMs for generating movie reviews: GPT-4o, Gemini-2.0 and DeepSeek-V3

Sands, Brendan, Wang, Yining, Xu, Chenhao, Zhou, Yuxuan, Wei, Lai, Chandra, Rohitash

Large language models (LLMs) have been prominent in various tasks, including text generation and summarisation. The applicability of LLMs to the generation of product reviews is gaining momentum, paving the way for the generation of movie reviews. In this study, we propose a framework that generates movie reviews using three LLMs (GPT-4o, DeepSeek-V3, and Gemini-2.0), and evaluate their performance by comparing the generated outputs with IMDb user reviews. We use movie subtitles and screenplays as input to the LLMs and investigate how they affect the quality of reviews generated. We review the LLM-based movie reviews in terms of vocabulary, sentiment polarity, similarity, and thematic consistency in comparison to IMDB user reviews. The results demonstrate that LLMs are capable of generating syntactically fluent and structurally complete movie reviews. Nevertheless, there is still a noticeable gap in emotional richness and stylistic coherence between LLM-generated and IMDb reviews, suggesting that further refinement is needed to improve the overall quality of movie review generation. We provided a survey-based analysis where participants were told to distinguish between LLM and IMDb user reviews. The results show that LLM-generated reviews are difficult to distinguish from IMDB user reviews. We found that DeepSeek-V3 produced the most balanced reviews, closely matching IMDb reviews. GPT-4o overemphasised positive emotions, while Gemini-2.0 captured negative emotions better but showed excessive emotional intensity.

large language model, machine learning, natural language, (16 more...)

2506.00312

Country:

Europe (0.68)
North America > United States (0.46)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset

Xie, Peng, Liu, Xingyuan, Chan, Tsz Wai, Bie, Yequan, Song, Yangqiu, Wang, Yang, Chen, Hao, Chen, Kani

Code-switching (CS) is the alternating use of two or more languages within a conversation or utterance, often influenced by social context and speaker identity. This linguistic phenomenon poses challenges for Automatic Speech Recognition (ASR) systems, which are typically designed for a single language and struggle to handle multilingual inputs. The growing global demand for multilingual applications, including Code-Switching ASR (CSASR), Text-to-Speech (CSTTS), and Cross-Lingual Information Retrieval (CLIR), highlights the inadequacy of existing monolingual datasets. Although some code-switching datasets exist, most are limited to bilingual mixing within homogeneous ethnic groups, leaving a critical need for a large-scale, diverse benchmark akin to ImageNet in computer vision. To bridge this gap, we introduce \textbf{LinguaMaster}, a multi-agent collaboration framework specifically designed for efficient and scalable multilingual data synthesis. Leveraging this framework, we curate \textbf{SwitchLingua}, the first large-scale multilingual and multi-ethnic code-switching dataset, including: (1) 420K CS textual samples across 12 languages, and (2) over 80 hours of audio recordings from 174 speakers representing 18 countries/regions and 63 racial/ethnic backgrounds, based on the textual data. This dataset captures rich linguistic and cultural diversity, offering a foundational resource for advancing multilingual and multicultural research. Furthermore, to address the issue that existing ASR evaluation metrics lack sensitivity to code-switching scenarios, we propose the \textbf{Semantic-Aware Error Rate (SAER)}, a novel evaluation metric that incorporates semantic information, providing a more accurate and context-aware assessment of system performance.

artificial intelligence, machine learning, natural language, (18 more...)

2506.00087

Country:

Asia (1.00)
Europe (0.93)

Genre: Research Report (0.50)

Industry:

Media (0.34)
Leisure & Entertainment (0.34)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.89)

Cornfeld, Andrew, Miller, Ashley, Mora-Figueroa, Mercedes, Samuels, Kurt, Palomba, Anthony

Optimizing Storytelling, Improving Audience Retention, and Reducing Waste in the Entertainment Industry

Television networks face high financial risk when making programming decisions, often relying on limited historical data to forecast episodic viewership. This study introduces a machine learning framework that integrates natural language processing (NLP) features from over 25000 television episodes with traditional viewership data to enhance predictive accuracy. By extracting emotional tone, cognitive complexity, and narrative structure from episode dialogue, we evaluate forecasting performance using SARIMAX, rolling XGBoost, and feature selection models. While prior viewership remains a strong baseline predictor, NLP features contribute meaningful improvements for some series. We also introduce a similarity scoring method based on Euclidean distance between aggregate dialogue vectors to compare shows by content. Tested across diverse genres, including Better Call Saul and Abbott Elementary, our framework reveals genre-specific performance and offers interpretable metrics for writers, executives, and marketers seeking data-driven insight into audience behavior.

machine learning, natural language, viewership, (16 more...)

2506.00076

Country: North America > United States > Virginia > Albemarle County > Charlottesville (0.15)

Genre:

Research Report > New Finding (0.47)
Research Report > Promising Solution (0.46)

Industry:

Media > Television (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)