AITopics

Genre:

Overview (0.93)
Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Neural Information Processing SystemsFeb-12-2026, 15:31:31 GMT

Improved Sample Complexity for Multiclass PAC Learning

We aim to understand the optimal P AC sample complexity in multiclass learning.

learner, machine learning, natural language, (18 more...)

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States (0.04)
Europe > Russia (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.82)

Di Carlo, Luca, Goddard, Chase, Schwab, David J.

Entropic Confinement and Mode Connectivity in Overparameterized Neural Networks

arXiv.org Machine LearningDec-9-2025

Modern neural networks exhibit a striking property: basins of attraction in the loss landscape are often connected by low-loss paths, yet optimization dynamics generally remain confined to a single convex basin (Baity-Jesi et al., 2019; Juneja et al., 2023) and rarely explore intermediate points. We resolve this paradox by identifying entropic barriers arising from the interplay between curvature variations along these paths and noise in optimization dynamics. Empirically, we find that curvature systematically rises away from minima, producing effective forces that bias noisy dynamics back toward the endpoints -- even when the loss remains nearly flat. These barriers persist longer than energetic barriers, shaping the late-time localization of solutions in parameter space. Our results highlight the role of curvature-induced entropic forces in governing both connectivity and confinement in deep learning landscapes. Deep neural networks trained, in the overparametrized regime, exhibit a number of surprising and counterintuitive properties. One of the most striking is the observation that distinct solutions, found with standard optimization algorithms, are often connected by low-loss paths in parameter space (Garipov et al., 2018; Draxler et al., 2018; Frankle et al., 2020). Such mode connectivity results imply that the landscape is far less rugged than once assumed: minima that appear isolated are, in fact, linked by paths of low, nearly constant loss. At the same time, however, optimization dynamics display a seemingly contradictory behavior.

curvature, entropic force, minima, (16 more...)

arXiv.org Machine Learning

2512.06297

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.04)
Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)

Genre: Research Report > New Finding (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Julien Audiffren, Liva Ralaivola

Bandits Dueling on Partially Ordered Sets

Neural Information Processing SystemsNov-21-2025, 11:57:00 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, pareto front, (17 more...)

Country:

Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Industry:

Media > Film (0.68)
Leisure & Entertainment (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Berden, Senne, Mahmutoğulları, Ali İrfan, Tsouros, Dimos, Guns, Tias

Solver-Free Decision-Focused Learning for Linear Optimization Problems

arXiv.org Artificial IntelligenceNov-13-2025

Mathematical optimization is a fundamental tool for decision-making in a wide range of applications. However, in many real-world scenarios, the parameters of the optimization problem are not known a priori and must be predicted from contextual features. This gives rise to predict-then-optimize problems, where a machine learning model predicts problem parameters that are then used to make decisions via optimization. A growing body of work on decision-focused learning (DFL) addresses this setting by training models specifically to produce predictions that maximize downstream decision quality, rather than accuracy. While effective, DFL is computationally expensive, because it requires solving the optimization problem with the predicted parameters at each loss evaluation. In this work, we address this computational bottleneck for linear optimization problems, a common class of problems in both DFL literature and real-world applications. We propose a solver-free training method that exploits the geometric structure of linear optimization to enable efficient training with minimal degradation in solution quality. Our method is based on the insight that a solution is optimal if and only if it achieves an objective value that is at least as good as that of its adjacent vertices on the feasible polytope. Building on this, our method compares the estimated quality of the ground-truth optimal solution with that of its precomputed adjacent vertices, and uses this as loss function. Experiments demonstrate that our method significantly reduces computational cost while maintaining high decision quality.

adjacent vertex, artificial intelligence, machine learning, (14 more...)

2505.22224

Country: Europe (0.28)

Genre:

Overview (0.93)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceOct-21-2025

RL makes MLLMs see better than SFT

Song, Junha, Yun, Sangdoo, Han, Dongyoon, Choo, Jaegul, Heo, Byeongho

A dominant assumption in Multimodal Language Model (MLLM) research is that its performance is largely inherited from the LLM backbone, given its immense parameter scale and remarkable capabilities. This has created a void in the understanding of the vision encoder, which determines how MLLMs perceive images. The recent shift in MLLM training paradigms, from Supervised Finetuning (SFT) to Reinforcement Learning (RL), magnifies this oversight-namely, the significant lack of analysis on how such training reshapes the vision encoder as well as the MLLM. To address this, we first investigate the impact of training strategies on MLLMs, where RL shows a clear advantage over SFT in strongly vision-related VQA benchmarks. Motivated by this, we conduct a critical yet under-explored analysis of the vision encoder of MLLMs through diverse and in-depth experiments, ranging from ImageNet classification and segmentation to gradient visualization. Our results demonstrate that MLLM's post-training strategy (i.e., SFT or RL) not only leads to distinct outcomes on MLLM downstream tasks, but also fundamentally reshapes MLLM's underlying visual representations. Specifically, the key finding of our study is that RL produces stronger and precisely localized visual representations compared to SFT, boosting the ability of the vision encoder for MLLM. We then reframe our findings into a simple recipe for building strong vision encoders for MLLMs, Preference-Instructed Vision OpTimization (PIVOT). When integrated into MLLMs, a PIVOT-trained vision encoder outperforms even larger and more heavily-trained counterparts, despite requiring less than 1% of the computational cost of standard vision pretraining. This result opens an effective and efficient path for advancing the vision backbones of MLLMs. Project page available at https://june-page.github.io/pivot/

large language model, machine learning, natural language, (17 more...)

2510.16333

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceOct-15-2025

See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation

Hu, Chih Yao, Lin, Yang-Sen, Lee, Yuna, Su, Chih-Hai, Lee, Jie-Ying, Tsai, Shr-Ruei, Lin, Chin-Yang, Chen, Kuan-Wen, Ke, Tsung-Wei, Liu, Yu-Lun

We present See, Point, Fly (SPF), a training-free aerial vision-and-language navigation (AVLN) framework built atop vision-language models (VLMs). SPF is capable of navigating to any goal based on any type of free-form instructions in any kind of environment. In contrast to existing VLM-based approaches that treat action prediction as a text generation task, our key insight is to consider action prediction for AVLN as a 2D spatial grounding task. SPF harnesses VLMs to decompose vague language instructions into iterative annotation of 2D waypoints on the input image. Along with the predicted traveling distance, SPF transforms predicted 2D waypoints into 3D displacement vectors as action commands for UAVs. Moreover, SPF also adaptively adjusts the traveling distance to facilitate more efficient navigation. Notably, SPF performs navigation in a closed-loop control manner, enabling UAVs to follow dynamic targets in dynamic environments. SPF sets a new state of the art in DRL simulation benchmark, outperforming the previous best method by an absolute margin of 63%. In extensive real-world evaluations, SPF outperforms strong baselines by a large margin. We also conduct comprehensive ablation studies to highlight the effectiveness of our design choice. Lastly, SPF shows remarkable generalization to different VLMs. Project page: https://spf-web.pages.dev

arxiv preprint arxiv, large language model, machine learning, (20 more...)

2509.22653

Country: Asia (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Robotics & Automation (0.68)
Transportation > Air (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Neural Information Processing SystemsOct-10-2025, 01:38:39 GMT

Improved Sample Complexity for Multiclass PAC Learning

We aim to understand the optimal P AC sample complexity in multiclass learning.

learner, sample complexity, theorem 2, (15 more...)

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States (0.04)
Europe > Russia (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.82)

The Japan TimesSep-19-2025, 09:34:00 GMT

SoftBank's Vision Fund mulls 20% job cuts after Son's pivot to AI

SoftBank's Vision Fund mulls 20% job cuts after Son's pivot to AI SoftBank Group's Vision Fund is considering cutting as much as 20% of its staff. SoftBank Group's Vision Fund is considering cutting as much as 20% of its staff, a person familiar with the matter said, underscoring a shift in CEO Masayoshi Son's focus to ambitious bets on artificial intelligence. The unit, which employed about 282 people as of the end of March, may shed more than 50 roles, the person said, asking not to be identified discussing private deliberations. The reduction extends years of cutbacks as the Vision Fund unit shrank in importance next to Son's growing appetite for big AI bets. Those include a plan to invest about $30 billion in OpenAI and a $6.5 billion deal to acquire chip designer Ampere Computing, which faces regulatory scrutiny.

japan, softbank, vision fund mull 20, (9 more...)

The Japan Times

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.20)
North America > United States (0.05)
Asia > Japan > Honshū > Kantō > Chiba Prefecture (0.05)
(2 more...)

Industry:

Telecommunications (1.00)
Information Technology (1.00)
Leisure & Entertainment > Sports (0.34)
Media > News (0.31)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.78)

arXiv.org Artificial IntelligenceAug-27-2025

Investigating Advanced Reasoning of Large Language Models via Black-Box Interaction

Yin, Congchi, Wu, Tianyi, Shu, Yankai, Gu, Alex, Wang, Yunhan, Shao, Jun, Jiang, Xun, Li, Piji

Existing tasks fall short in evaluating reasoning ability of Large Language Models (LLMs) in an interactive, unknown environment. This deficiency leads to the isolated assessment of deductive, inductive, and abductive reasoning, neglecting the integrated reasoning process that is indispensable for humans discovery of real world. We introduce a novel evaluation paradigm, \textit{black-box interaction}, to tackle this challenge. A black-box is defined by a hidden function that maps a specific set of inputs to outputs. LLMs are required to unravel the hidden function behind the black-box by interacting with it in given exploration turns, and reasoning over observed input-output pairs. Leveraging this idea, we build the \textsc{Oracle} benchmark which comprises 6 types of black-box task and 96 black-boxes. 19 modern LLMs are benchmarked. o3 ranks first in 5 of the 6 tasks, achieving over 70\% accuracy on most easy black-boxes. But it still struggles with some hard black-box tasks, where its average performance drops below 40\%. Further analysis indicates a universal difficulty among LLMs: They lack the high-level planning capability to develop efficient and adaptive exploration strategies for hypothesis refinement.

large language model, machine learning, natural language, (19 more...)

2508.19035

Country: Asia (0.27)

Genre: Research Report (1.00)

Industry:

Transportation > Air (1.00)
Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)