Weakly Coupled Deep Q-Networks
–Neural Information Processing Systems
We propose weakly coupled deep Q-networks (WCDQN), a novel deep reinforcement learning algorithm that enhances performance in a class of structured problems called weakly coupled Markov decision processes (WCMDP). WCMDPs consist of multiple independent subproblems connected by an action space constraint, which is a structural property that frequently emerges in practice. Despite this appealing structure, WCMDPs quickly become intractable as the number of subproblems grows. WCDQN employs a single network to train multiple DQN "subagents," one for each subproblem, and then combine their solutions to establish an upper bound on the optimal action value.
Neural Information Processing Systems
Feb-11-2025, 03:05:12 GMT
- Country:
- North America > United States (0.73)
- Industry:
- Automobiles & Trucks (0.94)
- Transportation
- Electric Vehicle (0.94)
- Ground > Road (0.94)