AITopics | davi

Doubly-AsynchronousValueIteration: MakingValueIterationAsynchronousinActions

Neural Information Processing SystemsFeb-7-2026, 22:43:32 GMT

However, Asynchronous VI still requires a maximization over the entire action space, making it impractical for domains with large action space.

artificial intelligence, davi, probability, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)

Technology: Information Technology > Artificial Intelligence (0.95)

Add feedback

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

Neural Information Processing SystemsDec-23-2025, 22:12:03 GMT

Value iteration (VI) is a foundational dynamic programming method, important for learning and planning in optimal control and reinforcement learning. VI proceeds in batches, where the update to the value of each state must be completed before the next batch of updates can begin. Completing a single batch is prohibitively expensive if the state space is large, rendering VI impractical for many applications. Asynchronous VI helps to address the large state space problem by updating one state at a time, in-place and in an arbitrary order. However, Asynchronous VI still requires a maximization over the entire action space, making it impractical for domains with large action space.

doubly-asynchronous value iteration, name change, value iteration asynchronous, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.82)

Add feedback

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

Neural Information Processing SystemsOct-10-2024, 08:53:30 GMT

Value iteration (VI) is a foundational dynamic programming method, important for learning and planning in optimal control and reinforcement learning. VI proceeds in batches, where the update to the value of each state must be completed before the next batch of updates can begin. Completing a single batch is prohibitively expensive if the state space is large, rendering VI impractical for many applications. Asynchronous VI helps to address the large state space problem by updating one state at a time, in-place and in an arbitrary order. However, Asynchronous VI still requires a maximization over the entire action space, making it impractical for domains with large action space.

action space, doubly-asynchronous value iteration, value iteration asynchronous, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.85)

Add feedback

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

Tian, Tian, Young, Kenny, Sutton, Richard S.

arXiv.org Artificial IntelligenceNov-27-2022

Value iteration (VI) is a foundational dynamic programming method, important for learning and planning in optimal control and reinforcement learning. VI proceeds in batches, where the update to the value of each state must be completed before the next batch of updates can begin. Completing a single batch is prohibitively expensive if the state space is large, rendering VI impractical for many applications. Asynchronous VI helps to address the large state space problem by updating one state at a time, in-place and in an arbitrary order. However, Asynchronous VI still requires a maximization over the entire action space, making it impractical for domains with large action space. To address this issue, we propose doubly-asynchronous value iteration (DAVI), a new algorithm that generalizes the idea of asynchrony from states to states and actions. More concretely, DAVI maximizes over a sampled subset of actions that can be of any user-defined size. This simple approach of using sampling to reduce computation maintains similarly appealing theoretical properties to VI without the need to wait for a full sweep through the entire action space in each update. In this paper, we show DAVI converges to the optimal value function with probability one, converges at a near-geometric rate with probability 1-delta, and returns a near-optimal policy in computation time that nearly matches a previously established bound for VI. We also empirically demonstrate DAVI's effectiveness in several experiments.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2207.01613

Country:

North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

Clinical trials are better, faster, cheaper with big data

MIT Technology ReviewJun-10-2021, 14:00:00 GMT

"One of the most difficult parts of my job is enrolling patients into studies," says Nicholas Borys, chief medical officer for Lawrenceville, N.J., biotechnology company Celsion, which develops next-generation chemotherapy and immunotherapy agents for liver and ovarian cancers and certain types of brain tumors. Borys estimates that fewer than 10% of cancer patients are enrolled in clinical trials. "If we could get that up to 20% or 30%, we probably could have had several cancers conquered by now." Clinical trials test new drugs, devices, and procedures to determine whether they're safe and effective before they're approved for general use. But the path from study design to approval is long, winding, and expensive.

clinical trial, control arm, medidata, (14 more...)

MIT Technology Review

Country: North America > United States > New York (0.05)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence (0.74)
Information Technology > Data Science > Data Mining > Big Data (0.41)

Add feedback

Filters

Collaborating Authors

davi

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Doubly-AsynchronousValueIteration: MakingValueIterationAsynchronousinActions

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

Clinical trials are better, faster, cheaper with big data