AITopics | Media

Collaborating Authors

Media

Disentanglement Beyond Static vs. Dynamic: A Benchmark and Evaluation Framework for Multi-Factor Sequential Representations

Barami, Tal, Berman, Nimrod, Naiman, Ilan, Hason, Amos H., Ezra, Rotem, Azencot, Omri

arXiv.org Artificial IntelligenceOct-28-2025

Learning disentangled representations in sequential data is a key goal in deep learning, with broad applications in vision, audio, and time series. While real-world data involves multiple interacting semantic factors over time, prior work has mostly focused on simpler two-factor static and dynamic settings, primarily because such settings make data collection easier, thereby overlooking the inherently multi-factor nature of real-world data. We introduce the first standardized benchmark for evaluating multi-factor sequential disentanglement across six diverse datasets spanning video, audio, and time series. Our benchmark includes modular tools for dataset integration, model development, and evaluation metrics tailored to multi-factor analysis. We additionally propose a post-hoc Latent Exploration Stage to automatically align latent dimensions with semantic factors, and introduce a Koopman-inspired model that achieves state-of-the-art results. Moreover, we show that Vision-Language Models can automate dataset annotation and serve as zero-shot disentanglement evaluators, removing the need for manual labels and human intervention. Together, these contributions provide a robust and scalable foundation for advancing multi-factor sequential disentanglement. Our code is available on GitHub, and the datasets and trained models are available on Hugging Face.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2510.17313

Country: North America > United States (0.45)

Genre: Research Report > New Finding (0.46)

Industry:

Media > Music (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Automatic Music Sample Identification with Multi-Track Contrastive Learning

Riou, Alain, Serrà, Joan, Mitsufuji, Yuki

arXiv.org Artificial IntelligenceOct-28-2025

ABSTRACT Sampling, the technique of reusing pieces of existing audio tracks to create new music content, is a very common practice in modern music production. In this paper, we tackle the challenging task of automatic sample identification, that is, detecting such sampled content and retrieving the material from which it originates. To do so, we adopt a self-supervised learning approach that leverages a multi-track dataset to create positive pairs of artificial mixes, and design a novel contrastive learning objective. We show that such method significantly outperforms previous state-of-the-art baselines, that is robust to various genres, and that scales well when increasing the number of noise songs in the reference database. In addition, we extensively analyze the contribution of the different components of our training pipeline and highlight, in particular, the need for high-quality separated stems for this task.

artificial intelligence, dataset, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2510.11507

Genre: Research Report (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Cultivating Pluralism In Algorithmic Monoculture: The Community Alignment Dataset

Zhang, Lily Hong, Milli, Smitha, Jusko, Karen, Smith, Jonathan, Amos, Brandon, Bouaziz, Wassim, Revel, Manon, Kussman, Jack, Sheynin, Yasha, Titus, Lisa, Radharapu, Bhaktipriya, Yu, Jane, Sarma, Vidya, Rose, Kris, Nickel, Maximilian

arXiv.org Artificial IntelligenceOct-28-2025

How can large language models (LLMs) serve users with varying preferences that may conflict across cultural, political, or other dimensions? To advance this challenge, this paper establishes four key results. First, we demonstrate, through a large-scale multilingual human study with representative samples from five countries (N=15,000), that humans exhibit significantly more variation in preferences than the responses of 21 state-of-the-art LLMs. Second, we show that existing methods for preference dataset collection are insufficient for learning the diversity of human preferences even along two of the most salient dimensions of variability in global values, due to the underlying homogeneity of candidate responses. Third, we argue that this motivates the need for negatively-correlated sampling when generating candidate sets, and we show that simple prompt-based techniques for doing so significantly enhance the performance of alignment methods in learning heterogeneous preferences. Fourth, based on this novel candidate sampling approach, we collect and open-source Community Alignment, the largest and most representative multilingual and multi-turn preference dataset to date, featuring almost 200,000 comparisons from annotators spanning five countries. We hope that the Community Alignment dataset will be a valuable resource for improving the effectiveness of LLMs for a diverse global population.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2507.0965

Country:

Europe (1.00)
Asia (1.00)
South America (0.92)
North America > United States > New York (0.27)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (0.92)
(4 more...)

Industry:

Water & Waste Management > Water Management > Water Supplies & Services (1.00)
Media > Music (1.00)
Materials > Chemicals > Agricultural Chemicals (1.00)
(17 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Tesla revives 'Mad Max' mode in Full Self-Driving

FOX NewsOct-27-2025, 20:30:57 GMT

Tesla brings back Mad Max mode in its Full Self-Driving system update, allowing more aggressive driving amid ongoing regulatory investigations.

full self-driving, mad max mode, tesla, (8 more...)

FOX News

Country:

North America > United States > California (0.05)
North America > United States > Iowa (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Industry:

Leisure & Entertainment > Sports (1.00)
Media > News (0.84)
Health & Medicine > Therapeutic Area (0.77)
(2 more...)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Cancer cures could be in reach with cutting-edge medical tech, doctor predicts

FOX NewsOct-27-2025, 19:02:04 GMT

Fox News senior medical analyst Dr. Marc Siegel predicts that artificial intelligence will help cure cancer within five to 10 years through early detection and personalized treatments.

cancer, lifestyle real estate tech science, siegel, (7 more...)

FOX News

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.05)
North America > United States > California (0.05)

Industry:

Media (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.31)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.75)

Add feedback

Your eyes can only handle so much HDTV

More pixels doesn't always mean a better screen. Breakthroughs, discoveries, and DIY tips sent every weekday. Every year, tech and television companies boast their products' latest and greatest, highest-resolution displays. The 4K display--a screen with a horizontal display of approximately 4,000 pixels-- first became widely available around 2014. Barely a decade later, you can purchase a TV with double the resolution .

andrew paul, resolution, snellen chart, (16 more...)

Popular Science

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.06)
Asia > South Korea (0.05)

Genre: Research Report > New Finding (0.37)

Industry:

Media (0.50)
Health & Medicine > Therapeutic Area (0.48)

Technology: Information Technology > Artificial Intelligence (0.71)

Add feedback

Half of all uncontacted Indigenous tribes may disappear by 2036

Survival International's new report illustrates the dangers they face--and their resilience. This photo of an Awa Guajá couple was taken only five days before their first contact with outsiders in 1992. Breakthroughs, discoveries, and DIY tips sent every weekday. Half of the world's remaining uncontacted Indigenous groups may disappear within a decade without concerted conservation efforts . The dire assessment is detailed in a new report published on October 27 by the nonprofit advocacy group Survival International, and is based on years of field research, interviews, and information gathering expeditions.

laura baisa, tribe, uncontacted indigenous tribe, (14 more...)

Popular Science

Country:

South America > Brazil (0.05)
North America > United States > California (0.05)
North America > United States > Alaska (0.05)
(3 more...)

Genre: Research Report (0.56)

Industry:

Materials > Metals & Mining (0.96)
Health & Medicine (0.75)
Media (0.72)

Technology: Information Technology > Artificial Intelligence (0.50)

Add feedback

Jennifer Lawrence Goes Dark

The New YorkerOct-27-2025, 10:00:00 GMT

She has been cast in maternal roles since her teens. Now, playing a mother for the first time since becoming one, she has chosen the part of a woman pushed past the edge of sanity. In "Die My Love," Lawrence, as Grace, vibrates with boredom and fury. The novel "Die, My Love," by the Argentinean writer Ariana Harwicz, is narrated by a wife and new mother who is living in rural France and seems to be losing her mind. Motherhood has inserted an immersion blender into her psyche: lust, repulsion, pleasure, and doom swirl into a single mess. She calls herself a "sodomising rodent" with "bullet-wounds for eyes," and thinks, "When I masturbate I desecrate crypts, and when I rock my baby I say amen, and when I smile I unplug an iron lung." One night, standing in the cold, staring at her family through a sliding door, she thinks, "I'll stop trying to draw blood from a stone. I'll contain my madness, I'll use the bathroom. I'll put my baby to sleep, jerk off my man and postpone my rebellion in favor of a better life." Martin Scorsese saw a brief review of the novel in the some years ago and decided to pick up a copy. He found it to be a "powerful mosaic of the mind," he told me recently. Scorsese is a member of a book club of sorts, with a few other filmmakers, who read with an eye toward adaptation. For "Die, My Love," he imagined casting Jennifer Lawrence in the lead. He'd been amazed by her performance in Darren Aronofsky's bewildering 2017 fantasia, "Mother!" In that surreal film--it's like an allegory set inside an oil painting--Lawrence plays a woman living with her poet husband in an old farmhouse, which is gradually, then apocalyptically, invaded by strangers. "She really is feeling everything that's happening, in what appears to be a dream of some kind," Scorsese said. He and Lawrence had discussed adaptations before. They considered "The Awakening," Kate Chopin's 1899 novel of female liberation, which ends with the protagonist, Edna Pontellier, walking into the sea. "Die, My Love" was like "The Awakening" if it began with Edna already underwater.

ciarrocchi, lawrence, movie, (16 more...)

The New Yorker

Country:

North America > United States > Indiana > Marion County > Lawrence (0.24)
Europe > France (0.24)
North America > United States > New York (0.05)
(15 more...)

Genre: Personal (1.00)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Government (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence (0.68)
Information Technology > Communications > Social Media (0.46)

Add feedback

Some People Can't See Mental Images. The Consequences Are Profound

The New YorkerOct-27-2025, 10:00:00 GMT

Ebeyer published posts about famous people who had realized that they were aphantasic: Glen Keane, one of the leading Disney animators on "The Little Mermaid" and "Beauty and the Beast"; John Green, the author of "The Fault in Our Stars," whose books had sold more than fifty million copies; J. Craig Venter, the biologist who led the first team to sequence the human genome; Blake Ross, who co-created the Mozilla-Firefox web browser when he was nineteen. Ebeyer also wanted the Aphantasia Network to be a place where aphantasics could find recent scientific research. For instance, estimating the strength of a person's imagery had been thoroughly subjective until Joel Pearson, a cognitive neuroscientist at the University of New South Wales, in Australia, devised tests to measure it more precisely. In a paper from 2022, Pearson reported that when people with imagery visualized a bright object their pupils contracted, as though they were seeing a bright object in real life, but the pupils of aphantasics imagining a bright object stayed the same. Another study of his had shown that, although aphantasics had the same fear response (sweating) as typical imagers to a frightening image shown on a screen, when exposed to a frightening story they barely responded at all.

aphantasia, imagery, zeman, (16 more...)

The New Yorker

Country:

Oceania > Australia > New South Wales (0.24)
North America > United States > New York (0.04)
North America > Canada > Ontario > Toronto (0.04)
(15 more...)

Genre:

Research Report (0.93)
Personal (0.67)

Industry:

Media (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Government (1.00)
Education (0.93)

Technology:

Information Technology > Artificial Intelligence (0.93)
Information Technology > Communications (0.87)

Add feedback

The Argument for Letting AI Burn It All Down

WIREDOct-27-2025, 10:00:00 GMT

When the AI bubble bursts, the nerds will do their best work. Suddenly, and not long ago, our dearest tech industry leaders began to suggest caution. Sam Altman said that AI is in a bubble "for sure," albeit one formed around "a kernel of truth." Mark Zuckerberg said an AI bubble "is quite possible," though "if the models keep on growing in capability year over year and demand keeps growing, then maybe there is no collapse, or something." Even Eric Schmidt is saying to calm down about artificial general intelligence and focus on competing with China .

ai burn, argument, openai, (16 more...)

WIRED

Country:

Asia > China (0.24)
North America > United States > California (0.14)
Europe > Slovenia (0.04)
(2 more...)

Industry:

Banking & Finance (0.95)
Transportation > Passenger (0.47)
Transportation > Ground > Road (0.47)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.33)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback