AITopics | vapor

6a6e010edde1b8f2812f558b67a1974e-Paper-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 10:35:12 GMT

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Workflow (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
(2 more...)

Add feedback

Probabilistic Inference in Reinforcement Learning Done Right Jean T arbouriech Google DeepMind

Neural Information Processing SystemsOct-8-2025, 20:37:00 GMT

A popular perspective in Reinforcement learning (RL) casts the problem as probabilistic inference on a graphical model of the Markov decision process (MDP). The core object of study is the probability of each state-action pair being visited under the optimal policy.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Workflow (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
(2 more...)

Add feedback

Probabilistic Inference in Reinforcement Learning Done Right

Tarbouriech, Jean, Lattimore, Tor, O'Donoghue, Brendan

arXiv.org Artificial IntelligenceNov-22-2023

A popular perspective in Reinforcement learning (RL) casts the problem as probabilistic inference on a graphical model of the Markov decision process (MDP). The core object of study is the probability of each state-action pair being visited under the optimal policy. Previous approaches to approximate this quantity can be arbitrarily poor, leading to algorithms that do not implement genuine statistical inference and consequently do not perform well in challenging problems. In this work, we undertake a rigorous Bayesian treatment of the posterior probability of state-action optimality and clarify how it flows through the MDP. We first reveal that this quantity can indeed be used to generate a policy that explores efficiently, as measured by regret. Unfortunately, computing it is intractable, so we derive a new variational Bayesian approximation yielding a tractable convex optimization problem and establish that the resulting policy also explores efficiently. We call our approach VAPOR and show that it has strong connections to Thompson sampling, K-learning, and maximum entropy exploration. We conclude with some experiments demonstrating the performance advantage of a deep RL version of VAPOR.

inference, optimization problem, vapor, (14 more...)

arXiv.org Artificial Intelligence

2311.13294

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.81)

Add feedback

Learning Sequential Acquisition Policies for Robot-Assisted Feeding

Sundaresan, Priya, Wu, Jiajun, Sadigh, Dorsa

arXiv.org Artificial IntelligenceOct-16-2023

A robot providing mealtime assistance must perform specialized maneuvers with various utensils in order to pick up and feed a range of food items. Beyond these dexterous low-level skills, an assistive robot must also plan these strategies in sequence over a long horizon to clear a plate and complete a meal. Previous methods in robot-assisted feeding introduce highly specialized primitives for food handling without a means to compose them together. Meanwhile, existing approaches to long-horizon manipulation lack the flexibility to embed highly specialized primitives into their frameworks. We propose Visual Action Planning OveR Sequences (VAPORS), a framework for long-horizon food acquisition. VAPORS learns a policy for high-level action selection by leveraging learned latent plate dynamics in simulation. To carry out sequential plans in the real world, VAPORS delegates action execution to visually parameterized primitives. We validate our approach on complex real-world acquisition trials involving noodle acquisition and bimanual scooping of jelly beans. Across 38 plates, VAPORS acquires much more efficiently than baselines, generalizes across realistic plate variations such as toppings and sauces, and qualitatively appeals to user feeding preferences in a survey conducted across 49 individuals. Code, datasets, videos, and supplementary materials can be found on our website: https://sites.google.com/view/vaporsbot.

acquisition, arxiv preprint arxiv, vapor, (13 more...)

arXiv.org Artificial Intelligence

2309.05197

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Florida > Hillsborough County > University (0.04)

Genre: Research Report > Experimental Study (0.68)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning

Weerakoon, Kasun, Sathyamoorthy, Adarsh Jagan, Elnoor, Mohamed, Manocha, Dinesh

arXiv.org Artificial IntelligenceSep-19-2023

We present VAPOR, a novel method for autonomous legged robot navigation in unstructured, densely vegetated outdoor environments using offline Reinforcement Learning (RL). Our method trains a novel RL policy using an actor-critic network and arbitrary data collected in real outdoor vegetation. Our policy uses height and intensity-based cost maps derived from 3D LiDAR point clouds, a goal cost map, and processed proprioception data as state inputs, and learns the physical and geometric properties of the surrounding obstacles such as height, density, and solidity/stiffness. The fully-trained policy's critic network is then used to evaluate the quality of dynamically feasible velocities generated from a novel context-aware planner. Our planner adapts the robot's velocity space based on the presence of entrapment inducing vegetation, and narrow passages in dense environments. We demonstrate our method's capabilities on a Spot robot in complex real-world outdoor scenes, including dense vegetation. We observe that VAPOR's actions improve success rates by up to 40%, decrease the average current consumption by up to 2.9%, and decrease the normalized trajectory length by up to 11.2% compared to existing end-to-end offline RL and other outdoor navigation methods.

legged robot navigation, offline reinforcement learning, outdoor vegetation, (1 more...)

arXiv.org Artificial Intelligence

2309.07832

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.60)

Add feedback

Performance of the Pre-Trained Large Language Model GPT-4 on Automated Short Answer Grading

Kortemeyer, Gerd

arXiv.org Artificial IntelligenceSep-17-2023

Automated Short Answer Grading (ASAG) has been an active area of machine-learning research for over a decade. It promises to let educators grade and give feedback on free-form responses in large-enrollment courses in spite of limited availability of human graders. Over the years, carefully trained models have achieved increasingly higher levels of performance. More recently, pre-trained Large Language Models (LLMs) emerged as a commodity, and an intriguing question is how a general-purpose tool without additional training compares to specialized models. We studied the performance of GPT-4 on the standard benchmark 2-way and 3-way datasets SciEntsBank and Beetle, where in addition to the standard task of grading the alignment of the student answer with a reference answer, we also investigated withholding the reference answer. We found that overall, the performance of the pre-trained general-purpose GPT-4 LLM is comparable to hand-engineered models, but worse than pre-trained LLMs that had specialized training.

gpt-4, reference answer, student answer, (11 more...)

arXiv.org Artificial Intelligence

2309.09338

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > District of Columbia > Washington (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Instructional Material (1.00)

Industry:

Education > Educational Setting (0.70)
Education > Educational Technology (0.47)
Information Technology > Security & Privacy (0.46)
Education > Assessment & Standards > Student Performance (0.38)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Machine learning methods for Schlieren imaging of a plasma channel in tenuous atomic vapor

Bíró, Gábor, Pocsai, Mihály, Barna, Imre Ferenc, Moody, Joshua T., Demeter, Gábor

arXiv.org Artificial IntelligenceMay-13-2022

We investigate the usage of a Schlieren imaging setup to measure the geometrical dimensions of a plasma channel in atomic vapor. Near resonant probe light is used to image the plasma channel in a tenuous vapor and machine learning techniques are tested for extracting quantitative information from the images. By building a database of simulated signals with a range of plasma parameters for training Deep Neural Networks, we demonstrate that they can extract from the Schlieren images reliably and with high accuracy the location, the radius and the maximum ionization fraction of the plasma channel as well as the width of the transition region between the core of the plasma channel and the unionized vapor. We test several different neural network architectures with supervised learning and show that the parameter estimations supplied by the networks are resilient with respect to slight changes of the experimental parameters that may occur in the course of a measurement.

artificial intelligence, machine learning, plasma channel, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.optlastec.2022.108948

2205.12731

Country:

Europe > Hungary > Budapest > Budapest (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Kansas City doctor uses 'vaping robot' in research

#artificialintelligenceSep-20-2019, 18:56:45 GMT

Dr. Matthias Salathe does the research in his lab at the University of Kansas Medical Center. Dr. Matthias Salathe does the research in his lab at the University of Kansas Medical Center. A Kansas City doctor is performing groundbreaking research on vaping, using a robot. Dr. Matthias Salathe spends a lot of time with e-cigarettes. "The notion was it's safe, and frankly we did not believe this," said Salathe.

kansas city doctor use, robot, salathé, (5 more...)

#artificialintelligence

Country:

North America > United States > Missouri > Jackson County > Kansas City (0.63)
North America > United States > Kansas (0.56)

Industry:

Health & Medicine > Public Health (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (1.00)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (0.37)

Technology: Information Technology > Artificial Intelligence > Robots (0.75)

Add feedback

Kansas City doctor uses 'vaping robot' in research

#artificialintelligenceSep-20-2019, 18:56:45 GMT

Dr. Matthias Salathe does the research in his lab at the University of Kansas Medical Center. Dr. Matthias Salathe does the research in his lab at the University of Kansas Medical Center. A Kansas City doctor is performing groundbreaking research on vaping, using a robot. Dr. Matthias Salathe spends a lot of time with e-cigarettes. "The notion was it's safe, and frankly we did not believe this," said Salathe.

kansas city doctor use, robot, salathé, (5 more...)

#artificialintelligence

Country:

North America > United States > Missouri > Jackson County > Kansas City (0.63)
North America > United States > Kansas (0.56)

Industry:

Health & Medicine > Public Health (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (1.00)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (0.37)

Technology: Information Technology > Artificial Intelligence > Robots (0.75)

Add feedback

Filters

Collaborating Authors

vapor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

6a6e010edde1b8f2812f558b67a1974e-Paper-Conference.pdf

Probabilistic Inference in Reinforcement Learning Done Right Jean T arbouriech Google DeepMind

Probabilistic Inference in Reinforcement Learning Done Right

Learning Sequential Acquisition Policies for Robot-Assisted Feeding

VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning

Performance of the Pre-Trained Large Language Model GPT-4 on Automated Short Answer Grading

Machine learning methods for Schlieren imaging of a plasma channel in tenuous atomic vapor

Kansas City doctor uses 'vaping robot' in research

Kansas City doctor uses 'vaping robot' in research