Goto

Collaborating Authors

 mathematical description


POMDPs in Continuous Time and Discrete Spaces

Neural Information Processing Systems

Many processes, such as discrete event systems in engineering or population dynamics in biology, evolve in discrete space and continuous time. We consider the problem of optimal decision making in such discrete state and action space systems under partial observability. This places our work at the intersection of optimal filtering and optimal control. At the current state of research, a mathematical description for simultaneous decision making and filtering in continuous time with finite state and action spaces is still missing. In this paper, we give a mathematical description of a continuous-time partial observable Markov decision process (POMDP). By leveraging optimal filtering theory we derive a Hamilton-Jacobi-Bellman (HJB) type equation that characterizes the optimal solution. Using techniques from deep learning we approximately solve the resulting partial integro-differential equation. We present (i) an approach solving the decision problem offline by learning an approximation of the value function and (ii) an online algorithm which provides a solution in belief space using deep reinforcement learning. We show the applicability on a set of toy examples which pave the way for future methods providing solutions for high dimensional problems.


POMDPs in Continuous Time and Discrete Spaces

Neural Information Processing Systems

Many processes, such as discrete event systems in engineering or population dynamics in biology, evolve in discrete space and continuous time. We consider the problem of optimal decision making in such discrete state and action space systems under partial observability. This places our work at the intersection of optimal filtering and optimal control. At the current state of research, a mathematical description for simultaneous decision making and filtering in continuous time with finite state and action spaces is still missing. In this paper, we give a mathematical description of a continuous-time partial observable Markov decision process (POMDP).


Reprogrammable materials selectively self-assemble

Robohub

While automated manufacturing is ubiquitous today, it was once a nascent field birthed by inventors such as Oliver Evans, who is credited with creating the first fully automated industrial process, in flour mill he built and gradually automated in the late 1700s. The processes for creating automated structures or machines are still very top-down, requiring humans, factories, or robots to do the assembling and making. However, the way nature does assembly is ubiquitously bottom-up; animals and plants are self-assembled at a cellular level, relying on proteins to self-fold into target geometries that encode all the different functions that keep us ticking. For a more bio-inspired, bottom-up approach to assembly, then, human-architected materials need to do better on their own. Making them scalable, selective, and reprogrammable in a way that could mimic nature's versatility means some teething problems, though.


Machine Learning for Programmers

#artificialintelligence

I have read a book or some posts on machine learning. I have watched some of the Coursera machine learning course. I still don't know how to get started… How do you get started in machine learning? The most common question I'm asked by developers on my newsletter is: How do I get started in machine learning? I honestly cannot remember how many times I have answered it. In this post, I lay out all of my very best thinking on this topic. You are a developer and you're interested in getting into machine learning. You read some blog posts.


How Artificial Intelligence Can Explain Unconscious Decision-Making

#artificialintelligence

If I drive through a stop sign, I might protest to the traffic officer that "I didn't notice it," or "it was blocked by a tree," or "I thought it was a red flag." And those claims might be perfectly true from what I remember of my experience: nothing feels more natural than explaining a mistake. But as plausible as these explanations may sound, the officer shouldn't believe them. The truth is that even I don't know the source of my error. This is because the real explanation had something to do with computations performed by my brain.