AITopics | Europe

Collaborating Authors

Europe

05057404e0cab4fe58971dc3a7d6044c-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsJun-14-2026, 10:02:56 GMT

The authors would like to thank Ulrich-Michael, Frances, James, Maryam, and Mandolyn for their help in labeling the dataset. The work at the Université de Montréal was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) (Paull), an NSERCPGS DScholarship (Morin) and an FRQNT Doctoral Scholarship (Morin). Moreover, this research was enabled in part by compute resources provided by Mila (mila.quebec). The work at the University of Freiburg was funded by an academic grant from NVIDIA. The work at the University of Oxford was supported by a Royal Society University Research Fellowship (Fallon, Kassab), a Sellafield Robotics and AICentre of Excellence Grant, and EPSRCC2CGrant EP/Z531212/1 (Mattamala), and the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT)(No.

artificial intelligence, conceptgraph, conceptgraph 0, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.24)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.24)
Europe > Germany > Baden-Württemberg > Freiburg (0.24)

Industry: Information Technology (0.34)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

04185b5ae2d450ef39bd53c0ec4802cb-Paper-Conference.pdf

Neural Information Processing SystemsJun-14-2026, 09:22:31 GMT

The recent rise of Large Reasoning Models (LRMs) has significantly improved multi-step reasoning performance, but often at the cost of generating excessively long reasoning chains. This paper revisits the efficiency of such reasoning processes through an information-theoretic lens, revealing a fundamental trade-off between reasoning length and semantic efficiency. We propose two metrics--InfoBias and InfoGain--to quantify divergence from ideal reasoning paths and stepwise information contribution, respectively. Empirical analyses show that longer reasoning chains tend to exhibit higher information bias and diminishing information gain, especially for incorrect answers. Motivated by these findings, we introduce an entropy-based Adaptive Think strategy that dynamically halts reasoning once confidence is sufficiently high, improving efficiency while maintaining competitive accuracy. Compared to the Vanilla Think approach (default mode), our strategy yields a 1.10% improvement in average accuracy and a 50.80% reduction in token usage on QwQ-32B across six benchmark tasks spanning diverse reasoning types and difficulty levels, demonstrating superior efficiency and reasoning performance. These results underscore the promise of entropy-based methods for enhancing both accuracy and cost-effiiciency in large language model deployment.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Asia (0.67)
North America > United States (0.45)
Europe > Austria (0.27)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.46)
Education > Educational Setting (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Should you store chocolate in the fridge or in the cupboard? Scientist finally settles the debate - so, do you agree with his advice?

Daily Mail - Science & techJun-14-2026, 08:48:14 GMT

Concertgoer, 51, who plunged to his death in front of horrified wife at Madison Square Garden is identified as'much-loved' dad-of-two Jennifer Lopez enjoys concert night with Ben Affleck's child Fin and her own child Oskar CNN star Jake Tapper slammed for choice of guests for his Father's Day TV special: 'What the heck?' Call me cynical, but the real reason Gruesome Twosome Harry and Meghan are returning to the UK is just so obvious... and highly humiliating: MAUREEN CALLAHAN No one can see the real reason Jelly Roll divorced Bunnie XO. Family-man facade of award-winning children's swim coach is shattered by disturbing teen babysitter claims: Read all the vile texts How to boost your testosterone WITHOUT supplements or risky treatments: Jason, 56, doubled his levels with these simple lifestyle tweaks - and doctors say any man can do the same. Here's how to reap the benefits to your body AND sex life My secret sex fantasy is destroying my marriage. I'm repulsed by my husband... but can't bear to admit what I REALLY want: DEAR JANE Trump sparks confusion after sharing Father's Day photo of'mystery' woman while appearing to call her a'great daughter' Karoline Leavitt flaunts her postpartum body seven weeks after giving birth... and shares gushing tribute to husband, 60, for Father's Day I had sex with my brother.

artificial intelligence, chocolate, social media, (15 more...)

Daily Mail - Science & tech

Country:

North America > United States (1.00)
Africa (1.00)
Europe > United Kingdom > England (0.46)

Genre: Research Report > New Finding (0.46)

Industry:

Media > Television (1.00)
Media > Music (1.00)
Media > Film (1.00)
(5 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Mobile (0.69)
Information Technology > Artificial Intelligence (0.68)

Add feedback

EndoBench: AComprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis

Neural Information Processing SystemsJun-14-2026, 08:46:15 GMT

Endoscopic procedures are essential for diagnosing and treating internal diseases, and multi-modal large language models (MLLMs) are increasingly applied to assist in endoscopy analysis. However, current benchmarks are limited, as they typically cover specific endoscopic scenarios and a small set of clinical tasks, failing to capture the real-world diversity of endoscopic scenarios and the full range of skills needed in clinical workflows. To address these issues, we introduce EndoBench, the first comprehensive benchmark specifically designed to assess MLLMs across the full spectrum of endoscopic practice with multi-dimensional capacities. EndoBench encompasses 4 distinct endoscopic scenarios, 12 specialized clinical tasks with 12 secondary subtasks, and 5 levels of visual prompting granularities, resulting in 6,832 rigorously validated VQA pairs from 21 diverse datasets. Our multi-dimensional evaluation framework mirrors the clinical workflow--spanning anatomical recognition, lesion analysis, spatial localization, and surgical operations--to holistically gauge the perceptual and diagnostic abilities of MLLMs in realistic scenarios. We benchmark 23 state-of-the-art models, including generalpurpose, medical-specialized, and proprietary MLLMs, and establish human clinician performance as a reference standard. Our extensive experiments reveal: (1) proprietary MLLMs outperform open-source and medical-specialized models overall, but still trail human experts; (2) medical-domain supervised fine-tuning substantially boosts task-specific accuracy; and (3) model performance remains sensitive to prompt format and clinical task complexity. EndoBench establishes a new standard for evaluating and advancing MLLMs in endoscopy, highlighting both progress and persistent gaps between current models and expert clinical reasoning. We publicly release our benchmark and code.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: Europe > Austria (0.27)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry:

Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Let LRMs Break Free from Overthinking via Self-Braking Tuning

Neural Information Processing SystemsJun-14-2026, 08:27:03 GMT

Large reasoning models (LRMs), such as OpenAI o1 and DeepSeek-R1, have significantly enhanced their reasoning capabilities by generating longer chains of thought, demonstrating outstanding performance across a variety of tasks. However, this performance gain comes at the cost of a substantial increase in redundant reasoning during the generation process, leading to high computational overhead and exacerbating the issue of overthinking. Although numerous existing approaches aim to address the problem of overthinking, they often rely on external interventions. In this paper, we propose a novel framework, Self-Braking Tuning (SBT), which tackles overthinking from the perspective of allowing the model to regulate its own reasoning process, thus eliminating the reliance on external control mechanisms. We construct a set of overthinking identification metrics based on standard answers and design a systematic method to detect redundant reasoning. This method accurately identifies unnecessary steps within the reasoning trajectory and generates training signals for learning self-regulation behaviors. Building on this foundation, we develop a complete strategy for constructing data with adaptive reasoning lengths and introduce an innovative braking prompt mechanism that enables the model to naturally learn when to terminate reasoning at an appropriate point. Experiments across mathematical benchmarks (AIME, AMC, MATH500, GSM8K) demonstrate that our method reduces token consumption by up to 60% while maintaining comparable accuracy to unconstrained models.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia > China (0.28)
Europe > Austria (0.28)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Scaling Image Geo-Localization to Continent Level

Neural Information Processing SystemsJun-14-2026, 03:26:41 GMT

Determining the precise geographic location of an image at a global scale remains an unsolved challenge. Standard image retrieval techniques are inefficient due to the sheer volume of images (>100M) and fail when coverage is insufficient. Scalable solutions, however, involve a trade-off: global classification typically yields coarse results (10+ kilometers), while cross-view retrieval between ground and aerial imagery suffers from a domain gap and has been primarily studied on smaller regions. This paper introduces a hybrid approach that achieves fine-grained geo-localization across a large geographic expanse the size of a continent. We leverage a proxy classification task during training to learn rich feature representations that implicitly encode precise location information. We combine these learned prototypes with embeddings of aerial imagery to increase robustness to the sparsity of ground-level data. This enables direct, fine-grained retrieval over areas spanning multiple countries. Our extensive evaluation demonstrates that our approach can localize within 200m more than 68\% of queries of a dataset covering a large part of Europe. The code is publicly available at scaling-geoloc.github.io .

artificial intelligence, machine learning, proceedings, (3 more...)

Neural Information Processing Systems

Country: Europe (0.27)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

UrbanIng-V2X: A Large-Scale Multi-Vehicle, Multi-Infrastructure Dataset Across Multiple Intersections for Cooperative Perception

Neural Information Processing SystemsJun-13-2026, 22:47:52 GMT

Recent cooperative perception datasets have played a crucial role in advancing smart mobility applications by enabling information exchange between intelligent agents, helping to overcome challenges such as occlusions and improving overall scene understanding. While some existing real-world datasets incorporate both vehicle-to-vehicle and vehicle-to-infrastructure interactions, they are typically limited to a single intersection or a single vehicle. A comprehensive perception dataset featuring multiple connected vehicles and infrastructure sensors across several intersections remains unavailable, limiting the benchmarking of algorithms in diverse traffic environments. Consequently, overfitting can occur, and models may demonstrate misleadingly high performance due to similar intersection layouts and traffic participant behavior. To address this gap, we introduce UrbanIng-V2X, the first large-scale, multi-modal dataset supporting cooperative perception involving vehicles and infrastructure sensors deployed across three urban intersections in Ingolstadt, Germany. UrbanIng-V2X consists of 34 temporally aligned and spatially calibrated sensor sequences, each lasting 20 seconds. All sequences contain recordings from one of three intersections, involving two vehicles and up to three infrastructure-mounted sensor poles operating in coordinated scenarios. In total, UrbanIng-V2X provides data from 12 vehicle-mounted RGB cameras, 2 vehicle LiDARs, 17 infrastructure thermal cameras, and 12 infrastructure LiDARs. All sequences are annotated at a frequency of 10 Hz with 3D bounding boxes spanning 13 object classes, resulting in approximately 712k annotated instances across the dataset.

artificial intelligence, proceedings, urbaning-v2x, (8 more...)

Neural Information Processing Systems

Country: Europe > Germany > Bavaria > Upper Bavaria > Ingolstadt (0.26)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.81)
Information Technology > Artificial Intelligence > Vision (0.58)

Add feedback

Meet the New Dyson Vacuums: V16 Piston Animal, V10 Konical, V8 Cyclone (2026)

WIREDJun-13-2026, 11:33:00 GMT

The rest of Dyson's promised 2026 vacuum lineup is here, from the new Dyson V16 Piston Animal to an updated version of the favored Dyson V8 Cyclone. Dyson's vacuum lineup had a new look planned for this year . Some of the vacuums have already arrived, like the Dyson PencilVac and Dyson Spot+Scrub robot vacuum, but others we've still been waiting to see. That wait is over as of this month, as Dyson has finally dropped the rest of its anticipated models. Dyson now has three new cordless vacuums you can shop, plus one with a Submarine head variant: the Dyson V16 Piston Animal ($980) and Dyson V16 Piston Animal Submarine ($1,100), the Dyson V10 Konical ($500), and the Dyson V8 Cyclone ($400) .

artificial intelligence, dyson, vacuum, (16 more...)

WIRED

Country:

North America > United States (0.15)
Europe (0.15)

Industry: Appliances & Durable Goods (0.62)

Technology: Information Technology > Artificial Intelligence > Robots (0.36)

Add feedback

Officer accused of using AI to 'create evidence'

BBC NewsJun-13-2026, 11:07:08 GMT

Officer accused of using AI to'create evidence' Police have launched a criminal investigation into an officer accused of using artificial intelligence (AI) systems to create evidential material in a number of cases. The Derbyshire Police officer has been removed from frontline duties, pending the outcome of the investigation, said the force. The officer is alleged to have perverted the course of justice, but no arrests have been made, said police. A Crown Prosecution Service spokesperson said they were working with police, adding: We are engaging with defence teams and the courts in appropriate cases. They added: As police inquiries continue, it would not be appropriate to comment further.

artificial intelligence, football 2026, social media, (11 more...)

BBC News

Country:

North America (1.00)
Europe > United Kingdom (1.00)

Industry:

Leisure & Entertainment (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Government > Regional Government > Europe Government > United Kingdom Government (0.53)

Technology:

Information Technology > Communications > Social Media (0.74)
Information Technology > Artificial Intelligence > Applied AI (0.70)

Add feedback

UK sets out AI infrastructure push at London Tech Week – how does it stack up?

The GuardianJun-13-2026, 11:01:12 GMT

The issue of AI sovereignty was in focus at London Tech Week. The issue of AI sovereignty was in focus at London Tech Week. UK sets out AI infrastructure push at London Tech Week - how does it stack up? Ownership of the commanding heights of the AI economy is a political talking point around the world, as countries seek to assert some control of a technology dominated by the US and China. London Tech Week, the showcase event for the UK tech industry, focused heavily on that theme this week.

artificial intelligence, government, social media, (14 more...)

The Guardian

Country: Europe > United Kingdom (1.00)

Industry:

Government (1.00)
Information Technology > Hardware (0.85)
Leisure & Entertainment > Sports (0.70)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback