Goto

Collaborating Authors

 Media


12 award-winning photos of our beautiful world

Popular Science

In Onyx Tempest, I wanted to capture the intensity of a Friesian skidding into a sharp turn. The backlit dust, flying mane, and sudden shift in momentum revealed his power and control. I framed the moment to emphasize energy, contrast, and the precision of the movement. Breakthroughs, discoveries, and DIY tips sent every weekday. The reFocus Awards has announced the stunning winners of the 2025 Photographers of the Year at the World Photo Annual .


Consensus and Subjectivity of Skin Tone Annotation for ML Fairness

Neural Information Processing Systems

Understanding different human attributes and how they affect model behavior may become a standard need for all model creation and usage, from traditional computer vision tasks to the newest multimodal generative AI systems. In computer vision specifically, we have relied on datasets augmented with perceived attribute signals (eg, gender presentation, skin tone, and age) and benchmarks enabled by these datasets. Typically labels for these tasks come from human annotators. However, annotating attribute signals, especially skin tone, is a difficult and subjective task. Perceived skin tone is affected by technical factors, like lighting conditions, and social factors that shape an annotator's lived experience.This paper examines the subjectivity of skin tone annotation through a series of annotation experiments using the Monk Skin Tone (MST) scale~\cite{Monk2022Monk}, a small pool of professional photographers, and a much larger pool of trained crowdsourced annotators. Along with this study we release the Monk Skin Tone Examples (MST-E) dataset, containing 1515 images and 31 videos spread across the full MST scale. MST-E is designed to help train human annotators to annotate MST effectively.Our study shows that annotators can reliably annotate skin tone in a way that aligns with an expert in the MST scale, even under challenging environmental conditions. We also find evidence that annotators from different geographic regions rely on different mental models of MST categories resulting in annotations that systematically vary across regions. Given this, we advise practitioners to use a diverse set of annotators and a higher replication count for each image when annotating skin tone for fairness research.


Amazon adds controversial AI facial recognition to Ring

FOX News

Amazon Ring introduces AI-powered facial recognition to identify friends and delivery drivers, while privacy advocates warn of surveillance risks despite convenience benefits.


Video Timeline Modeling For News Story Understanding

Neural Information Processing Systems

In this paper, we present a novel problem, namely video timeline modeling. Our objective is to create a video-associated timeline from a set of videos related to a specific topic, thereby facilitating the content and structure understanding of the story being told. This problem has significant potential in various real-world applications, for instance, news story summarization.


What Kind of New World Is Being Born?

The New Yorker

What Kind of New World Is Being Born? According to the Gospel of Luke, the Virgin Mary first learns that she'll soon give birth to Christ when she gets an unsolicited visit from an angel. Nice messenger service if you can get it. But before trusty Gabriel can dispense the good news upon which Christmas depends he has to calm the girl down. "Fear not," he says, and, in a way, this sombre reassurance is the Yuletide message in drastic miniature.


Defending Against Neural Fake News

Neural Information Processing Systems

Recent progress in natural language generation has raised dual-use concerns. While applications like summarization and translation are positive, the underlying technology also might enable adversaries to generate neural fake news: targeted propaganda that closely mimics the style of real news. Modern computer security relies on careful threat modeling: identifying potential threats and vulnerabilities from an adversary's point of view, and exploring potential mitigations to these threats. Likewise, developing robust defenses against neural fake news requires us first to carefully investigate and characterize the risks of these models. We thus present a model for controllable text generation called Grover.


Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing

Neural Information Processing Systems

Large-scale text-to-image generative models have been a ground-breaking development in generative AI, with diffusion models showing their astounding ability to synthesize convincing images following an input text prompt. The goal of image editing research is to give users control over the generated images by modifying the text prompt. Current image editing techniques are susceptible to unintended modifications of regions outside the targeted area, such as on the background or on distractor objects which have some semantic or visual relationship with the targeted object. According to our experimental findings, inaccurate cross-attention maps are at the root of this problem. Based on this observation, we propose $\textit{Dynamic Prompt Learning}$ ($DPL$) to force cross-attention maps to focus on correct $\textit{noun}$ words in the text prompt. By updating the dynamic tokens for nouns in the textual input with the proposed leakage repairment losses, we achieve fine-grained image editing over particular objects while preventing undesired changes to other image regions. Our method $DPL$, based on the publicly available $\textit{Stable Diffusion}$, is extensively evaluated on a wide range of images, and consistently obtains superior results both quantitatively (CLIP score, Structure-Dist) and qualitatively (on user-evaluation). We show improved prompt editing results for Word-Swap, Prompt Refinement, and Attention Re-weighting, especially for complex multi-object scenes.


12 books you need to read in 2026

BBC News

Whenever I fantasise about a couple of hours of uninterrupted relaxation during the chilly winter months, my mind immediately conjures up images of curling up on the sofa with a deliciously good book. And when summer eventually comes around, just swap the location to a sun lounger in the back garden (or somewhere more exotic). So with 2026 nearly upon us, join me for an eclectic taste of a few literary delights worth feasting upon over the next 12 months. It's the final instalment of Oseman's hit graphic novel series which has followed the lives of Nick and Charlie, two teenage boys who fall for each other at school. Along with their friends, we've followed all the ups and downs of their relationship as they navigated family drama, homophobia and mental health issues, alongside the joy of first love.


Exploring Low-Dimensional Subspace in Diffusion Models for Controllable Image Editing

Neural Information Processing Systems

Recently, diffusion models have emerged as a powerful class of generative models. Despite their success, there is still limited understanding of their semantic spaces. This makes it challenging to achieve precise and disentangled image generation without additional training, especially in an unsupervised way. In this work, we improve the understanding of their semantic spaces from intriguing observations: among a certain range of noise levels, (1) the learned posterior mean predictor (PMP) in the diffusion model is locally linear, and (2) the singular vectors of its Jacobian lie in low-dimensional semantic subspaces. We provide a solid theoretical basis to justify the linearity and low-rankness in the PMP.


Democrats warn Trump green-lighting Nvidia AI chip sales could boost China's military edge

FOX News

This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by Refinitiv Lipper .