AITopics | Neural Information Processing Systems

Collaborating Authors

Neural Information Processing Systems

Understanding Transformers via N-gram Statistics

Neural Information Processing SystemsJun-1-2025, 02:49:29 GMT

Transformer based large-language models (LLMs) display extreme proficiency with language yet a precise understanding of how they work remains elusive. One way of demystifying transformer predictions would be to describe how they depend on their context in terms of simple template functions. This paper takes a first step in this direction by considering families of functions (i.e.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Ohio (0.14)
North America > Mexico > Mexico City (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

PerspectiveNet: A Scene-consistent Image Generator for New View Synthesis in Real Indoor Environments

David Novotny, Ben Graham, Jeremy Reizenstein

Neural Information Processing SystemsJun-1-2025, 02:48:16 GMT

Given a set of a reference RGBD views of an indoor environment, and a new viewpoint, our goal is to predict the view from that location.

artificial intelligence, machine learning, reference view, (15 more...)

Neural Information Processing Systems

Country: North America (0.14)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

Supplementary Material: Einsum Benchmark

Neural Information Processing SystemsJun-1-2025, 02:47:48 GMT

For what purpose was the dataset created? The dataset was created with two primary purposes. First, it serves as a benchmark for einsum libraries, enabling the assessment of both the efficiency in determining contraction paths and the performance in executing einsum expressions. Second, it provides developers with a diverse set of einsum problem instances, thereby facilitating the development of more efficient, general-purpose einsum libraries. The dataset instances were created by the authors.

artificial intelligence, dataset, expression, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.98)

Add feedback

Einsum Benchmark: Enabling the Development of Next-Generation Tensor Execution Engines

Neural Information Processing SystemsJun-1-2025, 02:47:45 GMT

Modern artificial intelligence and machine learning workflows rely on efficient tensor libraries. However, tuning tensor libraries without considering the actual problems they are meant to execute can lead to a mismatch between expected performance and the actual performance. Einsum libraries are tuned to efficiently execute tensor expressions with only a few, relatively large, dense, floating-point tensors. But, practical applications of einsum cover a much broader range of tensor expressions than those that can currently be executed efficiently. For this reason, we have created a benchmark dataset that encompasses this broad range of tensor expressions, allowing future implementations of einsum to build upon and be evaluated against. In addition, we also provide generators for einsum expressions and converters to einsum expressions in our repository, so that additional data can be generated as needed. The benchmark dataset, the generators and converters are released openly and are publicly available at https://benchmark.einsum.org.

artificial intelligence, expression, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia (0.14)

Industry: Information Technology (0.46)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Hierarchical Optimal Transport for Document Representation

Mikhail Yurochkin, Sebastian Claici, Edward Chien, Farzaneh Mirzazadeh, Justin M. Solomon

Neural Information Processing SystemsJun-1-2025, 02:46:58 GMT

The ability to measure similarity between documents enables intelligent summarization and analysis of large corpora. Past distances between documents suffer from either an inability to incorporate semantic similarities between words or from scalability issues. As an alternative, we introduce hierarchical optimal transport as a meta-distance between documents, where documents are modeled as distributions over topics, which themselves are modeled as distributions over words. We then solve an optimal transport problem on the smaller topic space to compute a similarity score. We give conditions on the topics under which this construction defines a distance, and we relate it to the word mover's distance. We evaluate our technique for k-NN classification and show better interpretability and scalability with comparable performance to current methods at a fraction of the cost.

machine learning, natural language, wmd, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
North America > Canada (0.14)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Discriminator optimal transport

Akinori Tanaka

Neural Information Processing SystemsJun-1-2025, 02:44:26 GMT

We show that it improves inception score and FID calculated by unconditional GAN trained by CIFAR-10, STL-10 and a public pre-trained model of conditional GAN trained by ImageNet.

artificial intelligence, machine learning, optimal transport, (15 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō (0.28)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.15)
North America > United States > California > Los Angeles County > Long Beach (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Reviewer # 1 > It is better to add a section: comparison with related works, to highlight the main contributions

Neural Information Processing SystemsJun-1-2025, 02:44:11 GMT

We wish to express our appreciation to the reviewers for their insightful comments on our paper. All responses are reflected in our camera-ready version. Thank you for the proposal. We are sorry for that our writing makes itself hard to follow. Thank you for the important comment.

artificial intelligence, assumption, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

Learning-Augmented Algorithms with Explicit Predictors

Neural Information Processing SystemsJun-1-2025, 02:44:00 GMT

Recent advances in algorithmic design show how to utilize predictions obtained by machine learning models from past and present data. These approaches have demonstrated an enhancement in performance when the predictions are accurate, while also ensuring robustness by providing worst-case guarantees when predictions fail. In this paper we focus on online problems; prior research in this context was focused on a paradigm where the algorithms are oblivious of the predictors' design, treating them as a black box. In contrast, in this work, we unpack the predictor and integrate the learning problem it gives rise for within the algorithmic challenge. In particular we allow the predictor to learn as it receives larger parts of the input, with the ultimate goal of designing online learning algorithms specifically tailored for the algorithmic task at hand. Adopting this perspective, we focus on a number of fundamental problems, including caching and scheduling, which have been well-studied in the black-box setting. For each of the problems, we introduce new algorithms that take advantage of explicit and carefully designed learning rules.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe (0.28)
Asia > Middle East > Israel (0.14)

Genre: Research Report > Experimental Study (0.92)

Industry:

Transportation > Air (0.54)
Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.34)

Add feedback

Extracting Training Data from Molecular Pre-trained Models

Neural Information Processing SystemsJun-1-2025, 02:43:40 GMT

Graph Neural Networks (GNNs) have significantly advanced the field of drug discovery, enhancing the speed and efficiency of molecular identification. However, training these GNNs demands vast amounts of molecular data, which has spurred the emergence of collaborative model-sharing initiatives. These initiatives facilitate the sharing of molecular pre-trained models among organizations without exposing proprietary training data. Despite the benefits, these molecular pre-trained models may still pose privacy risks. For example, malicious adversaries could perform data extraction attack to recover private training data, thereby threatening commercial secrets and collaborative trust.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Industry: