Industry
France to ditch Palantir's AI data tools in favour of domestic provider
The French decision to use its own AI models comes amid growing concern among European governments about US-controlled technology. The French decision to use its own AI models comes amid growing concern among European governments about US-controlled technology. Move to ChapsVision is to avoid'strategic dependencies', says PM amid concern about reliance on US-controlled tools Tue 16 Jun 2026 13.08 EDTLast modified on Tue 16 Jun 2026 15.39 EDT France's domestic intelligence service is to ditch AI data tools from the US tech company Palantir in favour of a domestic provider in an effort to avoid "strategic dependency", the prime minister, Sébastien Lecornu, has said. "We must use our own AI models; we cannot accept new strategic dependencies in the digital sphere," Lecornu posted on social media. "We cannot rely on tools developed by foreign powers. France must have its own tools."
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
In cooperative Multi-Agent Reinforcement Learning (MARL), it is a common practice to tune hyperparameters in ideal simulated environments to maximize cooperative performance. However, policies tuned for cooperation often fail to maintain robustness and resilience under real-world uncertainties. Building trustworthy MARL systems requires a deep understanding of robustness, which ensures stability under uncertainties, and resilience, the ability to recover from disruptions--a concept extensively studied in control systems but largely overlooked in MARL. In this paper, we present a large-scale empirical study comprising over 82,620 experiments to evaluate cooperation, robustness, and resilience in MARL across 4 real-world environments, 13 uncertainty types, and 15 hyperparameters. Our key findings are: (1) Under mild uncertainty, optimizing cooperation improves robustness and resilience, but this link weakens as perturbations intensify. Robustness and resilience also varies by algorithm and uncertainty type.
Safety Pretraining: Toward the Next Generation of Safe AI
As large language models (LLMs) are increasingly deployed in high-stakes settings, the risk of generating harmful or toxic content remains a central challenge. Post-hoc alignment methods are brittle: once unsafe patterns are learned during pretraining, they are hard to remove. In this work, we present a data-centric pretraining framework that builds safety into the model from the start. Our framework consists of four key steps: (i) Safety Filtering: building a safety classifier to classify webdata into safe and unsafe categories; (ii) Safety Rephrasing: we recontextualize unsafe webdata into safer narratives; (iii) Native Refusal: we synthetically generate pretraining datasets that actively teach models to refuse on unsafe content and the moral reasoning behind it, and (iv) Harmfulness-Tag annotated pretraining: we flag unsafe content during pretraining using a special token, and use it to steer models away from unsafe generations at inference-time. Our safety-pretrained models reduce attack success rates from 38.8% to 8.4% on standard LLM safety benchmarks with no performance degradation on general tasks.
Enhancing Sample Selection Against Label Noise by Cutting Mislabeled Easy Examples
Sample selection is a prevalent approach in learning with noisy labels, aiming to identify confident samples for training. Although existing sample selection methods have achieved decent results by reducing the noise rate of the selected subset, they often overlook that not all mislabeled examples harm the model's performance equally. In this paper, we demonstrate that mislabeled examples correctly predicted by the model early in the training process are particularly harmful to model performance. We refer to these examples as Mislabeled Easy Examples (MEEs). To address this, we propose Early Cutting, which introduces a recalibration step that employs the model's later training state to re-select the confident subset identified early in training, thereby avoiding misleading confidence from early learning and effectively filtering out MEEs. Experiments on the CIFAR, WebVision, and full ImageNet-1k datasets demonstrate that our method effectively improves sample selection and model performance by reducing MEEs.
Qualcomm unveils its Snapdragon Reality Elite chip for next-gen AR headsets
The company also debuted a new platform for brands wanting to build their own AI glasses. High-end augmented reality and mixed reality devices are set to get a boost thanks to Qualcomm's latest XR chip. During a keynote at Augmented World Expo (AWE), the company unveiled its Snapdragon Reality Elite processor, which it says will allow the next generation of AR and mixed reality headsets to be smaller and more efficient. In terms of specs, the Snapdragon Reality Elite can support up to 4.4K resolution in each eye at 90 fps, a modest upgrade from the XR2+ Gen 2, but one that Qualcomm says will enable better image quality and lower latency. It also delivers significant improvements in terms of efficiency, with up to 20 percent boost in battery life while running up to 12 degrees Celsius (about 54 degrees Fahrenheit) cooler, compared with the XR2+ Gen 2. Performance-wise, Reality Elite comes with notable gains over the previous generation as well.
One Climate Change Innovation: Just Look Up
To build one family's dream house on a flood-prone Mississippi bayou, AD100 architect Tom Kundig decided the sky's the limit. Tom Kundig absorbed lessons in resilience before he even knew the word. As a child, he saw many of the industrial and agricultural buildings of the rural Pacific Northwest abandoned but still standing, the harsh winter conditions no match for their steel columns. That background came in handy when he was asked to design a house for a young family on a coastal Mississippi site susceptible to severe flooding. The clients, Joel and Jill Kavanaugh, had fallen in love with a plot bordering the Gulf Islands National Seashore in Ocean Springs, Mississippi.
GRASS: Scalable Data Attribution with Gradient Sparsification and Sparse Projection
Gradient-based data attribution methods, such as influence functions, are critical for understanding the impact of individual training samples without requiring repeated model retraining. However, their scalability is often limited by the high computational and memory costs associated with per-sample gradient computation. In this work, we propose GRASS, a novel gradient compression algorithm and its variants FACTGRASS for linear layers specifically, that explicitly leverage the inherent sparsity of per-sample gradients to achieve sub-linear space and time complexity. Extensive experiments demonstrate the effectiveness of our approach, achieving substantial speedups while preserving data influence fidelity. In particular, FACTGRASS achieves up to 165% faster throughput on billion-scale models compared to the previous state-of-the-art baselines.
3e5b0db387078ac4968fd536d3c3a019-Supplemental-Datasets_and_Benchmarks_Track.pdf
For models trained for multi-image input, text prompt is:850 Which objects are present in both images? You can think of your answer in any way (e.g. For models where we first concatenate the input images, the text prompt is:855 There are two images provided, one on the left and the other on the right.856 Which objects are present in both images? You can think of your answer in any way (e.g. We used the following procedure to guide our creation of images.
What in Common Models Hallucinate When Reasoning Across Scenes
Multimodal language models possess a remarkable ability to handle an openvocabulary worth of objects. Yet the best models still suffer from hallucinations when reasoning about scenes in the real world, revealing a gap between their seemingly strong performance on existing perception benchmarks that are saturating and their reasoning in the real world. To address this gap, we build a novel benchmark of in-the-wild scenes that we call Common-OBench. With more than 10.5k examples using exclusively new images not found in web training data to avoid contamination, Common-OBenchgoes beyond just perception, inspired by cognitive tests for humans, to probe reasoning across scenes by asking "what's in common?". We evaluate leading multimodal language models, including models specifically trained to reason. We find that perceiving objects in single images is easy for most models, yet reasoning across scenes is very challenging even for the best models, including reasoning models. Despite saturating many leaderboards focusing on perception, the best performing model only achieves 35% on Common-OBench--and on Common-OComplex, consisting of more complex scenes, the best model achieves only 1%. Curiously, we find models are more prone to hallucinate when similar objects are present in the scene, suggesting models may be relying on object co-occurrence seen during training. Among the models we evaluated, we found scale can provide modest improvements while models explicitly trained with multi-image inputs show bigger improvements, suggesting scaled multi-image training may offer promise.
You Can Finally Buy Snap's New AR Specs--for 2,150
You Can Finally Buy Snap's New AR Specs--for $2,195 Snap CEO Evan Spiegel lays out the company's vision for its augmented-reality smart glasses, arriving later this year. Snap--maker of the popular social app Snapchat--has a new pair of augmented-reality smart glasses called Specs. Snap CEO Evan Spiegel revealed the new glasses at an event during the Augmented World Expo (AWE) tech conference in Long Beach, California. As Snap frames it, this isn't a prototype or developer device--it's the first actual consumer version of the Specs AR glasses, unlike the previous generation exclusively sold to developers and creators. Snap says it expects the devices to ship this fall in the US, UK, and France.