AITopics

cc3f5463bc4d26bc38eadc8bcffbc654-Paper.pdf

Neural Information Processing SystemsMar-21-2025, 21:32:42 GMT

artificial intelligence, convergence, machine learning, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.93)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

DeltaDEQ: Exploiting Heterogeneous Convergence for Accelerating Deep Equilibrium Iterations

Neural Information Processing SystemsMar-21-2025, 21:32:35 GMT

Implicit neural networks including deep equilibrium models have achieved superior task performance with better parameter efficiency in various applications. However, it is often at the expense of higher computation costs during inference. In this work, we identify a phenomenon named heterogeneous convergence that exists in deep equilibrium models and other iterative methods. We observe much faster convergence of state activations in certain dimensions therefore indicating the dimensionality of the underlying dynamics of the forward pass is much lower than the defined dimension of the states. We thereby propose to exploit heterogeneous convergence by storing past linear operation results (e.g., fully connected and convolutional layers) and only propagating the state activation when its change exceeds a threshold. Thus, for the already converged dimensions, the computations can be skipped. We verified our findings and reached 84% FLOPs reduction on the implicit neural representation task, 73% on the Sintel and 76% on the KITTI datasets for the optical flow estimation task while keeping comparable task accuracy with the models that perform the full update.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe (0.46)
North America > United States (0.28)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.87)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
(2 more...)

Add feedback

A Tasks description and assumptions used for the different method of reward shaping

Neural Information Processing SystemsMar-21-2025, 21:32:27 GMT

This supplementary material provides additional results and discussion, as well as implementation details. Section A summarises the different tasks and the assumption used in RIDE, EAGER, ELLA. Section B gives more details about training of the QA module and the agent. It also includes explanations of how we built the training data set for the QA module. Section C gathers several results on EAGER: comparison with behavioural cloning, generalisation capacity of QA, robustness results of EAGER... Section D contains a commented version of the EAGER algorithm. Table 1 describes the tasks used in the experiments with an example and if it has been used to train the QA module or the agent.

machine learning, reinforcement learning, trajectory, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

50eb39ab717507cccbe2b8590de32030-Paper-Conference.pdf

Neural Information Processing SystemsMar-21-2025, 21:32:24 GMT

machine learning, reinforcement learning, trajectory, (15 more...)

Neural Information Processing Systems

Genre:

Workflow (0.93)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

MGF: Mixed Gaussian Flow for Diverse Trajectory Prediction

Neural Information Processing SystemsMar-21-2025, 21:32:12 GMT

To predict future trajectories, the normalizing flow with a standard Gaussian prior suffers from weak diversity. The ineffectiveness comes from the conflict between the fact of asymmetric and multi-modal distribution of likely outcomes and symmetric and single-modal original distribution and supervision losses. Instead, we propose constructing a mixed Gaussian prior for a normalizing flow model for trajectory prediction. The prior is constructed by analyzing the trajectory patterns in the training samples without requiring extra annotations while showing better expressiveness and being multi-modal and asymmetric. Besides diversity, it also provides better controllability for probabilistic trajectory generation. We name our method Mixed Gaussian Flow (MGF). It achieves state-of-the-art performance in the evaluation of both trajectory alignment and diversity on the popular UCY/ETH and SDD datasets. Code is available at https://github.com/mulplue/MGF.

artificial intelligence, machine learning, prediction, (19 more...)

Neural Information Processing Systems

Country:

Europe (0.46)
Asia > China (0.28)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)
(2 more...)

Add feedback

William T. Stephenson

Neural Information Processing SystemsMar-21-2025, 21:32:05 GMT

Models like LASSO and ridge regression are extensively used in practice due to their interpretability, ease of use, and strong theoretical guarantees. Crossvalidation (CV) is widely used for hyperparameter tuning in these models, but do practical optimization methods minimize the true out-of-sample loss? A recent line of research promises to show that the optimum of the CV loss matches the optimum of the out-of-sample loss (possibly after simple corrections). It remains to show how tractable it is to minimize the CV loss. In the present paper, we show that, in the case of ridge regression, the CV loss may fail to be quasiconvex and thus may have multiple local optima. We can guarantee that the CV loss is quasiconvex in at least one case: when the spectrum of the covariate matrix is nearly flat and the noise in the observed responses is not too high. More generally, we show that quasiconvexity status is independent of many properties of the observed data (response norm, covariate-matrix right singular vectors, and singular-value scaling) and has a complex dependence on the few that remain. We empirically confirm our theory using simulated experiments.

artificial intelligence, machine learning, optimization problem, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

cc298d5bc587e1b650f80e10449ee9d5-Paper.pdf

Neural Information Processing SystemsMar-21-2025, 21:32:02 GMT

artificial intelligence, machine learning, optimization problem, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Add feedback

f56de5ef149cf0aedcc8f4797031e229-Supplemental.pdf

Neural Information Processing SystemsMar-21-2025, 21:31:55 GMT

artificial intelligence, point process, sequence, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

User-Dependent Neural Sequence Models for Continuous-Time Event Data Alex Boyd Robert Bamler 2 Stephan Mandt 1,2

Neural Information Processing SystemsMar-21-2025, 21:31:48 GMT

Continuous-time event data are common in applications such as individual behavior data, financial transactions, and medical health records. Modeling such data can be very challenging, in particular for applications with many different types of events, since it requires a model to predict the event types as well as the time of occurrence. Recurrent neural networks that parameterize time-varying intensity functions are the current state-of-the-art for predictive modeling with such data. These models typically assume that all event sequences come from the same data distribution. However, in many applications event sequences are generated by different sources, or users, and their characteristics can be very different. In this paper, we extend the broad class of neural marked point process models to mixtures of latent embeddings, where each mixture component models the characteristic traits of a given user. Our approach relies on augmenting these models with a latent variable that encodes user characteristics, represented by a mixture model over user behavior that is trained via amortized variational inference. We evaluate our methods on four large real-world datasets and demonstrate systematic improvements from our approach over existing work for a variety of predictive metrics such as log-likelihood, next event ranking, and source-of-sequence identification.

artificial intelligence, machine learning, sequence, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.28)

Industry:

Media (0.94)
Information Technology > Security & Privacy (0.93)
Health & Medicine (0.66)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Online Inventory Problems Beyond the i . Setting with Online Convex Optimization

Neural Information Processing SystemsMar-21-2025, 21:31:34 GMT

We study multi-product inventory control problems where a manager makes sequential replenishment decisions based on partial historical information in order to minimize its cumulative losses. Our motivation is to consider general demands, losses and dynamics to go beyond standard models which usually rely on newsvendor-type losses, fixed dynamics, and unrealistic i.i.d.

artificial intelligence, inventory problem, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > France (0.14)
Asia > China (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

Filters

Collaborating Authors

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

cc3f5463bc4d26bc38eadc8bcffbc654-Paper.pdf

DeltaDEQ: Exploiting Heterogeneous Convergence for Accelerating Deep Equilibrium Iterations

A Tasks description and assumptions used for the different method of reward shaping

50eb39ab717507cccbe2b8590de32030-Paper-Conference.pdf

MGF: Mixed Gaussian Flow for Diverse Trajectory Prediction

William T. Stephenson

cc298d5bc587e1b650f80e10449ee9d5-Paper.pdf

f56de5ef149cf0aedcc8f4797031e229-Supplemental.pdf

User-Dependent Neural Sequence Models for Continuous-Time Event Data Alex Boyd Robert Bamler 2 Stephan Mandt 1,2

Online Inventory Problems Beyond the i . Setting with Online Convex Optimization