AITopics | Optimization

However, AdaMM is not suited for solving black-box optimization problems, where explicit gradient forms are difficult or infeasible to obtain. In this paper, we propose a zeroth-order AdaMM (ZO-AdaMM) algorithm, that generalizes AdaMM to the gradient-free regime.

artificial intelligence, machine learning, optimization, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
Europe (0.28)

Industry: Transportation > Air (0.62)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-2-2025, 18:38:12 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. Summary: The paper describes an efficient optimization approach to find structured low-rank matrices. The structure is encoded by a linear map and enforcing low rank is achieved by adding to the cost function the nuclear norm of the structured matrix. The cost function is optimized with a generalized conditional gradient algorithm. By using a factorization of the large structured matrix the optimization is accelerated further.

algorithm, formulation, matrix, (10 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.05)

Genre: Research Report (0.72)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.50)

Add feedback

Gradient Surgery for Multi-Task Learning Tianhe Y u

Neural Information Processing SystemsOct-2-2025, 18:17:06 GMT

We propose a form of gradient surgery that projects a task's gradient onto the normal plane of the gradient of

artificial intelligence, inductive learning, machine learning, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games

Kaiqing Zhang, Zhuoran Yang, Tamer Basar

Neural Information Processing SystemsOct-2-2025, 18:12:06 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
North America > Canada (0.04)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Game Theory (0.94)
(2 more...)

Add feedback

Learning Reward Machines for Partially Observable Reinforcement Learning

Neural Information Processing SystemsOct-2-2025, 17:53:20 GMT

The use of neural networks for function approximation has led to many recent advances in Reinforcement Learning (RL) .

agent, cookie, reward machine, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.15)
South America > Chile (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.69)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-2-2025, 17:52:57 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. The paper presents a new technique for solving MDPs. The new technique, presented as an alternative to approximate policy/value iteration, consists in directly minimizing the Optimal Bellman Residual (OBR). The authors first motivate their method by showing that the loss bound of OBR is often tighter than the loss bound of policy/value iteration, which is a known result [9,15]. The authors then show that an empirical estimate of OBR is consistent in the Vapnick sense, i.e. minimizing the empirical OBR is equivalent to minimizing an upper bound on the true OBR, which is unknown when the MDP model is unknown. Finally, the authors show that OBR can be decomposed into a difference of two convex functions, and a standard Difference of Convex Functions (DC) optimization method can be used for finding a local optimum.

contribution, convex function, decomposition, (12 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.05)

Genre: Research Report > New Finding (0.69)

Technology: