AITopics | Overview

Collaborating Authors

Overview

7664a7e946a84ac5e97649a967717cf2-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 04:47:21 GMT

diffusion model, machine learning, natural language, (13 more...)

Neural Information Processing Systems

Genre: Overview (0.46)

Industry: Media (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios 2

Neural Information Processing SystemsMar-27-2025, 04:43:09 GMT

Building agents based on tree-search planning capabilities with learned models has achieved remarkable success in classic decision-making problems, such as Go and Atari. However, it has been deemed challenging or even infeasible to extend Monte Carlo Tree Search (MCTS) based algorithms to diverse real-world applications, especially when these environments involve complex action spaces and significant simulation costs, or inherent stochasticity. In this work, we introduce LightZero, the first unified benchmark for deploying MCTS/MuZero in general sequential decision scenarios. Specificially, we summarize the most critical challenges in designing a general MCTS-style decision-making solver, then decompose the tightly-coupled algorithm and system design of tree-search RL methods into distinct sub-modules. By incorporating more appropriate exploration and optimization strategies, we can significantly enhance these sub-modules and construct powerful LightZero agents to tackle tasks across a wide range of domains, such as board games, Atari, MuJoCo, MiniGrid and GoBigger. Detailed benchmark results reveal the significant potential of such methods in building scalable and efficient decision intelligence.

artificial intelligence, machine learning, survey article, (19 more...)

Neural Information Processing Systems

Country:

North America (0.28)
Asia > China (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.45)

Industry: Leisure & Entertainment > Games > Computer Games (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Supplementary Material: Continuous-Time Functional Diffusion Processes A Reverse Functional Diffusion Processes In this Section, we review the mathematical details to obtain the backward

Neural Information Processing SystemsMar-27-2025, 04:16:50 GMT

Then we move to a different approach in Appendix A.2 for the The work in Föllmer (1986) is based on a finite entropy condition, which we report here as Condition 1. Notice that if Assumption 1 is true, then Condition 1 holds (Föllmer (1986), Thm. The proof can be obtained by adapting the result of Lemma 3.6 of Föllmer & Wakolbinger Theorem 4. Let Q be a finite entropy measure. For the proof, we refer to Theorem 3.14 of Föllmer & Wakolbinger (1986). This assumption is simply the translation of H1 from Millet et al. (1989) to our notation.

artificial intelligence, machine learning, survey article, (16 more...)

Neural Information Processing Systems

Genre: Overview (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Snap ML: A Hierarchical Framework for Machine Learning

Celestine Dünner, Thomas Parnell, Dimitrios Sarigiannis, Nikolas Ioannou, Andreea Anghel, Gummadi Ravi, Madhusudanan Kandasamy, Haralampos Pozidis

Neural Information Processing SystemsMar-27-2025, 04:12:10 GMT

We describe a new software framework for fast training of generalized linear models. The framework, named Snap Machine Learning (Snap ML), combines recent advances in machine learning systems and algorithms in a nested manner to reflect the hierarchical architecture of modern computing systems. We prove theoretically that such a hierarchical system can accelerate training in distributed environments where intra-node communication is cheaper than inter-node communication. Additionally, we provide a review of the implementation of Snap ML in terms of GPU acceleration, pipelining, communication patterns and software architecture, highlighting aspects that were critical for achieving high performance. We evaluate the performance of Snap ML in both single-node and multi-node environments, quantifying the benefit of the hierarchical scheme and the data streaming functionality, and comparing with other widely-used machine learning software frameworks. Finally, we present a logistic regression benchmark on the Criteo Terabyte Click Logs dataset and show that Snap ML achieves the same test loss an order of magnitude faster than any of the previously reported results, including those obtained using TensorFlow and scikit-learn.

artificial intelligence, machine learning, survey article, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.68)
Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Overview (0.34)

Industry: Information Technology > Services (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.71)

Add feedback

74fa3651b41560e1c7555e0958c70333-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 03:32:51 GMT

data mining, information, machine learning, (21 more...)

Neural Information Processing Systems

Country:

Asia > China (0.46)
North America > United States (0.46)

Genre:

Research Report > New Finding (0.93)
Overview (0.67)

Industry:

Health & Medicine (0.92)
Government > Regional Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

VLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval

Neural Information Processing SystemsMar-27-2025, 03:22:13 GMT

VLC is to add a synthetic hard negative image generated from the synthetic text, resulting in two image-to-text retrieval examples (one for each image) and, more importantly, two text-to-image retrieval examples (one for each text).

caption, large language model, machine learning, (22 more...)

Neural Information Processing Systems

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Genre:

Overview (0.68)
Research Report > New Finding (0.46)

Industry:

Information Technology (0.68)
Government (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)
(2 more...)

Add feedback

e22dd5dabde45eda5a1a67772c8e25dd-Paper.pdf

Neural Information Processing SystemsMar-27-2025, 03:05:42 GMT

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.46)
North America > Canada > Alberta (0.28)

Genre: Overview (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

An Information Theoretic Perspective on Conformal Prediction Qualcomm AI Research

Neural Information Processing SystemsMar-27-2025, 02:42:27 GMT

Conformal Prediction (CP) is a distribution-free uncertainty estimation framework that constructs prediction sets guaranteed to contain the true answer with a userspecified probability. Intuitively, the size of the prediction set encodes a general notion of uncertainty, with larger sets associated with higher degrees of uncertainty. In this work, we leverage information theory to connect conformal prediction to other notions of uncertainty. More precisely, we prove three different ways to upper bound the intrinsic uncertainty, as described by the conditional entropy of the target variable given the inputs, by combining CP with information theoretical inequalities. Moreover, we demonstrate two direct and useful applications of such connection between conformal prediction and information theory: (i) more principled and effective conformal training objectives that generalize previous approaches and enable end-to-end training of machine learning models from scratch, and (ii) a natural mechanism to incorporate side information into conformal prediction. We empirically validate both applications in centralized and federated learning settings, showing our theoretical results translate to lower inefficiency (average prediction set size) for popular CP methods.

artificial intelligence, machine learning, prediction, (19 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > United States > New York (0.14)
North America > United States > Massachusetts (0.14)

Genre:

Research Report > Experimental Study (1.00)
Overview (1.00)
Instructional Material (1.00)
Research Report > New Finding (0.67)

Industry:

Health & Medicine (1.00)
Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Efficient Deep Approximation of GMMs

Shirin Jalali, Carl Nuzman, Iraj Saniee

Neural Information Processing SystemsMar-27-2025, 02:37:38 GMT

The universal approximation theorem states that any regular function can be approximated closely using a single hidden layer neural network. Some recent work has shown that, for some special functions, the number of nodes in such an approximation could be exponentially reduced with multi-layer neural networks.

artificial intelligence, machine learning, survey article, (20 more...)

Neural Information Processing Systems

Country: North America (0.28)

Genre: Overview (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Neural Information Processing SystemsMar-27-2025, 02:21:23 GMT

We aim to evaluate Large Language Models (LLMs) for embodied decision making. While a significant body of work has been leveraging LLMs for decision making in embodied environments, we still lack a systematic understanding of their performance because they are usually applied in different domains, for different purposes, and built based on different inputs and outputs. Furthermore, existing evaluations tend to rely solely on a final success rate, making it difficult to pinpoint what ability is missing in LLMs and where the problem lies, which in turn blocks embodied agents from leveraging LLMs effectively and selectively.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.45)

Genre: