AITopics

doi: 10.1109/ACCESS.2025.3561857

2505.09091

Country:

North America > United States (1.00)
Europe (1.00)
North America > Canada (0.93)
Asia > China > Zhejiang Province (0.29)

Genre: Research Report > New Finding (0.66)

Industry: Media > Music (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Daily Mail - Science & techMay-14-2025, 18:00:46 GMT

Scientists confirm woke change made to Barbie over the course of 35 years - so did you notice it?

Barbie is one of the most successful children's toys in history, spawning a multimedia franchise that includes merchandise, video games and a live-action film. Since US toy giant Mattel launched the original Barbie in 1959, more than 1 billion of the dolls have been sold worldwide. Certainly, Barbie's looks have been tweaked over the years to reflect changing beauty ideals and societal shifts. But according to a new study, one subtle change to Barbie has gone largely unnoticed – until now. Scientists in Australia have found that Barbies today have flatter feet than they did in past decades.

artificial intelligence, barbie, flat feet, (15 more...)

Daily Mail - Science & tech

Country:

Oceania > Australia (0.25)
North America > United States > California > Alameda County > San Leandro (0.05)

Genre: Research Report > New Finding (0.69)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.52)
Government > Regional Government (0.49)
Health & Medicine > Therapeutic Area (0.32)

Technology: Information Technology > Artificial Intelligence (0.51)

New ScientistMay-14-2025, 18:00:00 GMT

Who needs Eurovision when we have the Dance Your PhD contest?

Feedback is New Scientist's popular sideways look at the latest science and technology news. You can submit items you believe may amuse readers to Feedback by emailing feedback@newscientist.com Saturday 17 May will see the final of this year's Eurovision Song Contest, which will be the most over-the-top evening of television since, well, the previous Eurovision. Feedback is deeply relieved that Feedback Jr appears not to be interested this year, so we might escape having to sit up and watch the entire thing. While we are deeply supportive of the contest's kind and welcoming vibe, most of the songs make our ears bleed.

artificial intelligence, eurovision, phd contest, (8 more...)

New Scientist

Country:

Oceania > New Zealand (0.06)
North America > United States > New York (0.05)
Europe > Finland > Uusimaa > Helsinki (0.05)
Asia (0.05)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence (0.50)

PCWorldMay-14-2025, 17:00:00 GMT

New Winxvideo AI – One-stop Video/Image Enhancer & Toolkit

We seem to have more video footage and still images than ever before, thanks to smartphones, GoPro cameras and the backlog of older ones collected across a lifetime. Managing all these formats, as well as making sure they look their best, can be a frightening proposition. Thankfully, Winxvideo AI is a powerful all-in-one solution that not only uses advanced Artificial Intelligence software to upgrade the quality of your content but can rescue old photos and footage too. The newly updated version 4.0 also brings huge improvements to speed, plus a special price offer, so you can save both time and money while you upgrade your photo and video library. Winxvideo AI comes with an impressive array of features that can turn tired, old, blurry videos into something far more professional.

artificial intelligence, video image enhancer & toolkit, winxvideo ai, (11 more...)

PCWorld

Industry: Media > Photography (0.72)

Technology: Information Technology > Artificial Intelligence (1.00)

FOX NewsMay-14-2025, 10:00:03 GMT

Robot with animated face is here to make customer service better

Mirokaï is designed to be helpful, engaging and enchanting. Have you ever wished robots could do more than just follow instructions? Born from the creative minds at Paris-based startup Enchanted Tools, Mirokaï isn't just another humanoid robot. It's designed to be helpful, engaging and, honestly, a bit enchanting. With its blend of advanced artificial intelligence, storytelling and a dash of charm, Mirokaï turns ordinary moments into something a little more memorable.

artificial intelligence, humanoid robot, robot, (10 more...)

FOX News

Industry:

Health & Medicine > Health Care Providers & Services (0.32)
Media > News (0.31)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Peng, Letian, Shang, Jingbo

Codifying Character Logic in Role-Playing

This paper introduces Codified Profiles for role-playing, a novel approach that represents character logic as structured, executable functions for behavioral decision-making. Each profile defines a set of functions parse_by_scene(scene) that outputs a list of logic-grounded assertions triggered_statements, using both explicit control structures (e.g., if-then-else) and condition checks like check_condition(scene, question), where each question is a semantically meaningful prompt about the scene (e.g., "Is the character in danger?") discriminated by the role-playing LLM as true, false, or unknown. This explicit representation offers three key advantages over traditional prompt-based profiles, which append character descriptions directly into text prompts: (1) Persistence, by enforcing complete and consistent execution of character logic, rather than relying on the model's implicit reasoning; (2) Updatability, through systematic inspection and revision of behavioral logic, which is difficult to track or debug in prompt-only approaches; (3) Controllable Randomness, by supporting stochastic behavior directly within the logic, enabling fine-grained variability that prompting alone struggles to achieve. To validate these advantages, we introduce a new benchmark constructed from 83 characters and 5,141 scenes curated from Fandom, using NLI-based scoring to compare character responses against ground-truth actions. Our experiments demonstrate the significant benefits of codified profiles in improving persistence, updatability, and behavioral diversity. Notably, by offloading a significant portion of reasoning to preprocessing, codified profiles enable even 1B-parameter models to perform high-quality role-playing, providing a scalable and efficient foundation for local deployment of role-play agents.

large language model, machine learning, natural language, (19 more...)

2505.07705

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
North America > United States > Florida > Miami-Dade County > Miami (0.14)
Asia > Singapore (0.04)
(11 more...)

Genre:

Research Report > New Finding (0.46)
Overview > Innovation (0.34)

Industry:

Education (1.00)
Media > Television (0.93)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.87)

Miller, Justin K, Tang, Wenjia

Evaluating LLM Metrics Through Real-World Capabilities

As generative AI becomes increasingly embedded in everyday workflows, it is important to evaluate its performance in ways that reflect real-world usage rather than abstract notions of intelligence. Unlike many existing benchmarks that assess general intelligence, our approach focuses on real-world utility, evaluating how well models support users in everyday tasks. While current benchmarks emphasize code generation or factual recall, users rely on AI for a much broader range of activities-from writing assistance and summarization to citation formatting and stylistic feedback. In this paper, we analyze large-scale survey data and usage logs to identify six core capabilities that represent how people commonly use Large Language Models (LLMs): Summarization, Technical Assistance, Reviewing Work, Data Structuring, Generation, and Information Retrieval. We then assess the extent to which existing benchmarks cover these capabilities, revealing significant gaps in coverage, efficiency measurement, and interpretability. Drawing on this analysis, we use human-centered criteria to identify gaps in how well current benchmarks reflect common usage that is grounded in five practical criteria: coherence, accuracy, clarity, relevance, and efficiency. For four of the six capabilities, we identify the benchmarks that best align with real-world tasks and use them to compare leading models. We find that Google Gemini outperforms other models-including OpenAI's GPT, xAI's Grok, Meta's LLaMA, Anthropic's Claude, DeepSeek, and Qwen from Alibaba-on these utility-focused metrics.

benchmark, large language model, machine learning, (17 more...)

2505.08253

Country:

Asia > China (0.04)
Oceania > Australia (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(5 more...)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (1.00)
Research Report > Experimental Study (0.67)

Industry:

Media (1.00)
Law (1.00)
Information Technology (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.69)

A Mamba-based Network for Semi-supervised Singing Melody Extraction Using Confidence Binary Regularization

He, Xiaoliang, Dong, Kangjie, Cao, Jingkai, Yu, Shuai, Li, Wei, Yu, Yi

Singing melody extraction (SME) is a key task in the field of music information retrieval. However, existing methods are facing several limitations: firstly, prior models use transformers to capture the contextual dependencies, which requires quadratic computation resulting in low efficiency in the inference stage. Secondly, prior works typically rely on frequencysupervised methods to estimate the fundamental frequency (f0), which ignores that the musical performance is actually based on notes. Thirdly, transformers typically require large amounts of labeled data to achieve optimal performances, but the SME task lacks of sufficient annotated data. To address these issues, in this paper, we propose a mamba-based network, called SpectMamba, for semi-supervised singing melody extraction using confidence binary regularization. In particular, we begin by introducing vision mamba to achieve computational linear complexity. Then, we propose a novel note-f0 decoder that allows the model to better mimic the musical performance. Further, to alleviate the scarcity of the labeled data, we introduce a confidence binary regularization (CBR) module to leverage the unlabeled data by maximizing the probability of the correct classes. The proposed method is evaluated on several public datasets and the conducted experiments demonstrate the effectiveness of our proposed method.

artificial intelligence, machine learning, melody extraction, (16 more...)

2505.08681

Country:

Asia > China > Shanghai > Shanghai (0.05)
Asia > Japan > Honshū > Chūgoku > Hiroshima Prefecture > Hiroshima (0.04)

Genre: Research Report (0.82)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News

Liu, Yuhan, Liu, Yuxuan, Zhang, Xiaoqing, Chen, Xiuying, Yan, Rui

In today's digital environment, the rapid propagation of fake news via social networks poses significant social challenges. Most existing detection methods either employ traditional classification models, which suffer from low interpretability and limited generalization capabilities, or craft specific prompts for large language models (LLMs) to produce explanations and results directly, failing to leverage LLMs' reasoning abilities fully. Inspired by the saying that "truth becomes clearer through debate," our study introduces a novel multi-agent system with LLMs named TruEDebate (TED) to enhance the interpretability and effectiveness of fake news detection. TED employs a rigorous debate process inspired by formal debate settings. Central to our approach are two innovative components: the DebateFlow Agents and the InsightFlow Agents. The DebateFlow Agents organize agents into two teams, where one supports and the other challenges the truth of the news. These agents engage in opening statements, cross-examination, rebuttal, and closing statements, simulating a rigorous debate process akin to human discourse analysis, allowing for a thorough evaluation of news content. Concurrently, the InsightFlow Agents consist of two specialized sub-agents: the Synthesis Agent and the Analysis Agent. The Synthesis Agent summarizes the debates and provides an overarching viewpoint, ensuring a coherent and comprehensive evaluation. The Analysis Agent, which includes a role-aware encoder and a debate graph, integrates role embeddings and models the interactions between debate roles and arguments using an attention mechanism, providing the final judgment.

large language model, machine learning, natural language, (17 more...)

2505.08532

Country:

North America > United States (0.46)
Asia > China (0.29)
Asia > Middle East > UAE (0.28)

Genre: Research Report > Experimental Study (0.46)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Not that Groove: Zero-Shot Symbolic Music Editing

Zhang, Li

Most work in AI music generation focused on audio, which has seen limited use in the music production industry due to its rigidity. To maximize flexibility while assuming only textual instructions from producers, we are among the first to tackle symbolic music editing. We circumvent the known challenge of lack of labeled data by proving that LLMs with zero-shot prompting can effectively edit drum grooves. The recipe of success is a creatively designed format that interfaces LLMs and music, while we facilitate evaluation by providing an evaluation dataset with annotated unit tests that highly aligns with musicians' judgment.

16th note, large language model, machine learning, (21 more...)

2505.08203

Genre: Research Report (0.40)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)