AITopics

ASPEN: Breaking Operator Barriers for Efficient Parallel Execution of Deep Neural Networks

Neural Information Processing SystemsMay-25-2025, 13:28:18 GMT

Modern Deep Neural Network (DNN) frameworks use tensor operators as the main building blocks of DNNs. However, we observe that operator-based construction of DNNs incurs significant drawbacks in parallelism in the form of synchronization barriers. Synchronization barriers of operators confine the scope of parallel computation to each operator and obscure the rich parallel computation opportunities that exist across operators. To this end, we present ASPEN, a novel parallel computation solution for DNNs that allows fine-grained dynamic execution of DNNs, which (1) removes the operator barriers and expresses DNNs in dataflow graphs of fine-grained tiles to expose the parallel computation opportunities across operators, and (2) exploits these opportunities by dynamically locating and scheduling them in runtime. This novel approach of ASPEN enables opportunistic parallelism, a new class of parallelism for DNNs that is unavailable in the existing operator-based approaches. ASPEN also achieves high resource utilization and memory reuse by letting each resource asynchronously traverse depthwise in the DNN graph to its full computing potential. We provide challenges and solutions to our approach and show that our proof-of-concept implementation of ASPEN on CPU shows exceptional performance, outperforming state-of-the-art inference systems of TorchScript and TVM by up to 3.2 and 4.3, respectively.

artificial intelligence, execution, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Colorado (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

d891d240b5784656a0356bf4b00f5cdd-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 13:28:05 GMT

artificial intelligence, classifier, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Genre: Research Report (0.69)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

d862f7f5445255090de13b825b880d59-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 13:27:45 GMT

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Europe (0.67)
North America > United States > Louisiana (0.14)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.90)

Add feedback

Faster Discrete Convex Function Minimization with Predictions: The M-Convex Case

Taihei Oki, The University of Tokyo, Tokyo, Japan, oki@mist.i.u-tokyo.ac.jp "3026 Shinsaku Sakaue, The University of Tokyo, Tokyo, Japan, sakaue@mist.i.u-tokyo.ac.jp

Neural Information Processing SystemsMay-25-2025, 13:27:31 GMT

algorithm, artificial intelligence, optimization problem, (14 more...)

Neural Information Processing Systems

Country:

Asia > Japan (0.14)
Europe (0.14)
North America > United States > New York (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

Faster Discrete Convex Function Minimization with Predictions: The M-Convex Case

Taihei Oki, The University of Tokyo, Tokyo, Japan, oki@mist.i.u-tokyo.ac.jp "3026 Shinsaku Sakaue, The University of Tokyo, Tokyo, Japan, sakaue@mist.i.u-tokyo.ac.jp

Neural Information Processing SystemsMay-25-2025, 13:27:27 GMT

algorithm, artificial intelligence, optimization problem, (13 more...)

Neural Information Processing Systems

Country:

Asia > Japan (0.14)
Europe (0.14)
North America > United States > New York (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

The probability flow ODE is provably fast Sitan Chen Holden Lee Yuanzhi Li

Neural Information Processing SystemsMay-25-2025, 13:27:09 GMT

We provide the first polynomial-time convergence guarantees for the probability flow ODE implementation (together with a corrector step) of score-based generative modeling with an OU forward process. Our analysis is carried out in the wake of recent results obtaining such guarantees for the SDE-based implementation (i.e., denoising diffusion probabilistic modeling or DDPM), but requires the development of novel techniques for studying deterministic dynamics without contractivity. Through the use of a specially chosen corrector step based on the underdamped Langevin diffusion, we obtain better dimension dependence than prior works on DDPM (O( d) vs. O(d), assuming smoothness of the data distribution), highlighting potential advantages of the ODE framework.

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland (0.14)
Europe > France (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

d842425e4bf79ba039352da0f658a906-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 13:26:55 GMT

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania (0.14)

Genre: Personal (0.34)

Industry:

Leisure & Entertainment (0.68)
Health & Medicine (0.68)
Government > Regional Government > North America Government > United States Government (0.47)
Media > Music (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Toolformer: Language Models Can Teach Themselves to Use Tools

Neural Information Processing SystemsMay-25-2025, 13:26:51 GMT

Language models (LMs) exhibit remarkable abilities to solve new tasks from just a few examples or textual instructions, especially at scale. They also, paradoxically, struggle with basic functionality, such as arithmetic or factual lookup, where much simpler and smaller specialized models excel. In this paper, we show that LMs can teach themselves to use external tools via simple APIs and achieve the best of both worlds. We introduce Toolformer, a model trained to decide which APIs to call, when to call them, what arguments to pass, and how to best incorporate the results into future token prediction. This is done in a self-supervised way, requiring nothing more than a handful of demonstrations for each API. We incorporate a range of tools, including a calculator, a Q&A system, a search engine, a translation system, and a calendar. Toolformer achieves substantially improved zero-shot performance across a variety of downstream tasks, often competitive with much larger models, without sacrificing its core language modeling abilities.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Industry: Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

Temporal Graph Neural Tangent Kernel with Graphon-Guaranteed Katherine Tieu

Neural Information Processing SystemsMay-25-2025, 13:26:45 GMT

Graph Neural Tangent Kernel (GNTK) fuses graph neural networks and graph kernels, simplifies the process of graph representation learning, interprets the training dynamics of graph neural networks, and serves various applications like protein identification, image segmentation, and social network analysis. In practice, graph data carries complex information among entities that inevitably evolves over time, and previous static graph neural tangent kernel methods may be stuck in the sub-optimal solution in terms of both effectiveness and efficiency. As a result, extending the advantage of GNTK to temporal graphs becomes a critical problem. To this end, we propose the temporal graph neural tangent kernel, which not only extends the simplicity and interpretation ability of GNTK to the temporal setting but also leads to rigorous temporal graph classification error bounds.

artificial intelligence, machine learning, temp-g 3, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.28)
North America > United States > California > Los Angeles County > Long Beach (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry:

Information Technology (1.00)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

rPPG-Toolbox: Deep Remote PPG Toolbox

Neural Information Processing SystemsMay-25-2025, 13:24:29 GMT

Camera-based physiological measurement is a fast growing field of computer vision. Remote photoplethysmography (rPPG) utilizes imaging devices (e.g., cameras) to measure the peripheral blood volume pulse (BVP), and enables cardiac measurement via webcams and smartphones. However, the task is non-trivial with important pre-processing, modeling, and post-processing steps required to obtain state-of-the-art results. Replication of results and benchmarking of new models is critical for scientific progress; however, as with many other applications of deep learning, reliable codebases are not easy to find or use.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry: