AITopics

Plotting

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Neuro-Symbolic Data Generation for Math Reasoning Zenan Li1 Zhi Zhou 1 Yuan Yao

Neural Information Processing SystemsMay-28-2025, 21:26:51 GMT

A critical question about Large Language Models (LLMs) is whether their apparent deficiency in mathematical reasoning is inherent, or merely a result of insufficient exposure to high-quality mathematical data. To explore this, we developed an automated method for generating high-quality, supervised mathematical datasets. The method carefully mutates existing math problems, ensuring both diversity and validity of the newly generated problems. This is achieved by a neuro-symbolic data generation framework combining the intuitive informalization strengths of LLMs, and the precise symbolic reasoning of math solvers along with projected Markov chain Monte Carlo sampling in the highly-irregular symbolic space. Empirical experiments demonstrate the high quality of data generated by the proposed method, and that the LLMs, specifically LLaMA-2 and Mistral, when realigned with the generated data, surpass their state-of-the-art counterparts.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Scalarization for Multi-Task and Multi-Domain Learning at Scale

Neural Information Processing SystemsMay-28-2025, 21:24:50 GMT

Training a single model on multiple input domains and/or output tasks allows for compressing information from multiple sources into a unified backbone hence improves model efficiency. It also enables potential positive knowledge transfer across tasks/domains, leading to improved accuracy and data-efficient training. However, optimizing such networks is a challenge, in particular due to discrepancies between the different tasks or domains: Despite several hypotheses and solutions proposed over the years, recent work has shown that uniform scalarization training, i.e., simply minimizing the average of the task losses, yields on-par performance with more costly SotA optimization methods. This raises the issue of how well we understand the training dynamics of multi-task and multi-domain networks. In this work, we first devise a large-scale unified analysis of multi-domain and multi-task learning to better understand the dynamics of scalarization across varied task/domain combinations and model sizes. Following these insights, we then propose to leverage population-based training to efficiently search for the optimal scalarization weights when dealing with a large number of tasks or domains.

artificial intelligence, machine learning, scalarization, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
North America > Canada > Ontario > Toronto (0.14)
Europe > Netherlands (0.14)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
(2 more...)

Add feedback

5011bf6d8a37692913fce3a15a51f070-Supplemental.pdf

Neural Information Processing SystemsMay-28-2025, 21:24:42 GMT

artificial intelligence, gradient, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.71)

Add feedback

5011bf6d8a37692913fce3a15a51f070-Paper.pdf

Neural Information Processing SystemsMay-28-2025, 21:24:39 GMT

arxiv preprint arxiv, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

PAC-Bayes Learning Bounds for Sample-Dependent Priors

Neural Information Processing SystemsMay-28-2025, 21:24:23 GMT

We present a series of new PAC-Bayes learning guarantees for randomized algorithms with sample-dependent priors.

artificial intelligence, generalization, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

500ee9106e0e4d8f769fadfdf9f2837e-Paper.pdf

Neural Information Processing SystemsMay-28-2025, 21:24:04 GMT

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe (0.68)
North America > United States > New York (0.14)
North America > United States > California (0.14)
Asia > Middle East > Israel (0.14)

Genre: Research Report (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

into a robustified action set

Neural Information Processing SystemsMay-28-2025, 21:23:56 GMT

We illustrate the online optimization process of RCL in Figure 1. Finally, we discuss the performance of RCL. We consider the management of N batteries. This problem falls into SOCO based on the reduction framework described in [49]. We consider each problem instance as one day (T = 24 hours, plus an initial action).

artificial intelligence, competitive ratio, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe (0.14)

Industry:

Energy (0.68)
Transportation > Ground > Road (0.47)
Transportation > Electric Vehicle (0.47)
Automobiles & Trucks (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Robust Learning for Smoothed Online Convex Optimization with Feedback Delay Jianyi Yang University of California Riverside University of California Riverside Riverside, CA, USA

Neural Information Processing SystemsMay-28-2025, 21:23:52 GMT

We study a challenging form of Smoothed Online Convex Optimization, a.k.a.

artificial intelligence, machine learning, ml prediction, (17 more...)

Neural Information Processing Systems

Country: North America > United States > California > Riverside County > Riverside (1.00)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation > Ground > Road (0.94)
Transportation > Electric Vehicle (0.94)
Automobiles & Trucks (0.94)
Energy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs

Neural Information Processing SystemsMay-28-2025, 21:23:41 GMT

Most large multimodal models (LMMs) are implemented by feeding visual tokens as a sequence into the first layer of a large language model (LLM). The resulting architecture is simple but significantly increases computation and memory costs, as it has to handle a large number of additional tokens in its input layer. This paper presents a new architecture DeepStack for LMMs. Considering N layers in the language and vision transformer of LMMs, we stack the visual tokens into N groups and feed each group to its aligned transformer layer from bottom to top, as illustrated in Figure 1. Surprisingly, this simple method greatly enhances the power of LMMs to model interactions among visual tokens across layers but with minimal additional cost. We apply DeepStack to both language and vision transformer in LMMs, and validate the effectiveness of DeepStack LMMs with extensive empirical results. Using the same context length, our DeepStack 7B and 13B parameters surpass their counterparts by 2.7 and 2.9 on average across 9 benchmarks, respectively.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Technology: