AITopics | observable state variable

Nonparametric Density Estimation for Stochastic Optimization with an Observable State Variable

Neural Information Processing SystemsFeb-16-2024, 10:37:37 GMT

We study convex stochastic optimization problems where a noisy objective function value is observed after a decision is made. There are many stochastic optimization problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation for the joint distribution of state-outcome pairs to create weights for previous observations. Those similar to the current state are used to create a convex, deterministic approximation of the objective function.

nonparametric density estimation, observable state variable, stochastic optimization, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.65)

Add feedback

Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation

Yang, Wenyan, Angleraud, Alexandre, Pieters, Roel S., Pajarinen, Joni, Kämäräinen, Joni-Kristian

arXiv.org Artificial IntelligenceMar-5-2023

Robot control for tactile feedback-based manipulation can be difficult due to the modeling of physical contacts, partial observability of the environment, and noise in perception and control. This work focuses on solving partial observability of contact-rich manipulation tasks as a Sequence-to-Sequence (Seq2Seq)} Imitation Learning (IL) problem. The proposed Seq2Seq model produces a robot-environment interaction sequence to estimate the partially observable environment state variables. Then, the observed interaction sequence is transformed to a control sequence for the task itself. The proposed Seq2Seq IL for tactile feedback-based manipulation is experimentally validated on a door-open task in a simulated environment and a snap-on insertion task with a real robot. The model is able to learn both tasks from only 50 expert demonstrations, while state-of-the-art reinforcement learning and imitation learning methods fail.

artificial intelligence, demonstration, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2303.02646

Country:

North America > United States > Montana (0.04)
Europe > Finland > Pirkanmaa > Tampere (0.04)

Genre:

Research Report (0.50)
Instructional Material (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Nonparametric Density Estimation for Stochastic Optimization with an Observable State Variable

Hannah, Lauren, Powell, Warren, Blei, David M.

Neural Information Processing SystemsFeb-15-2020, 01:12:36 GMT

We study convex stochastic optimization problems where a noisy objective function value is observed after a decision is made. There are many stochastic optimization problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation for the joint distribution of state-outcome pairs to create weights for previous observations. Those similar to the current state are used to create a convex, deterministic approximation of the objective function.

nonparametric density estimation, observable state variable, stochastic optimization, (4 more...)

Neural Information Processing Systems

Genre: Research Report (0.42)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.65)

Add feedback

Nonparametric Density Estimation for Stochastic Optimization with an Observable State Variable

Hannah, Lauren, Powell, Warren, Blei, David M.

Neural Information Processing SystemsDec-31-2010

We study convex stochastic optimization problems where a noisy objective function value is observed after a decision is made. There are many stochastic optimization problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation for the joint distribution of state-outcome pairs to create weights for previous observations. The weights effectively group similar states. Those similar to the current state are used to create a convex, deterministic approximation of the objective function. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We offer two weighting schemes, kernel based weights and Dirichlet process based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour ahead wind commitment problem. Our results show Dirichlet process weights can offer substantial benefits over kernel based weights and, more generally, that nonparametric estimation methods provide good solutions to otherwise intractable problems.

artificial intelligence, machine learning, optimization problem, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.94)

Genre: Research Report > New Finding (0.86)

Industry: Banking & Finance > Trading (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Filters

Collaborating Authors

observable state variable

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Nonparametric Density Estimation for Stochastic Optimization with an Observable State Variable

Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation

Nonparametric Density Estimation for Stochastic Optimization with an Observable State Variable

Nonparametric Density Estimation for Stochastic Optimization with an Observable State Variable