Planning & Scheduling
The Art of Drafting: A Team-Oriented Hero Recommendation System for Multiplayer Online Battle Arena Games
Chen, Zhengxing, Nguyen, Truong-Huy D, Xu, Yuyu, Amato, Chris, Cooper, Seth, Sun, Yizhou, El-Nasr, Magy Seif
Multiplayer Online Battle Arena (MOBA) games have received increasing popularity recently. In a match of such games, players compete in two teams of five, each controlling an in-game avatars, known as heroes, selected from a roster of more than 100. The selection of heroes, also known as pick or draft, takes place before the match starts and alternates between the two teams until each player has selected one hero. Heroes are designed with different strengths and weaknesses to promote team cooperation in a game. Intuitively, heroes in a strong team should complement each other's strengths and suppressing those of opponents. Hero drafting is therefore a challenging problem due to the complex hero-to-hero relationships to consider. In this paper, we propose a novel hero recommendation system that suggests heroes to add to an existing team while maximizing the team's prospect for victory. To that end, we model the drafting between two teams as a combinatorial game and use Monte Carlo Tree Search (MCTS) for estimating the values of hero combinations. Our empirical evaluation shows that hero teams drafted by our recommendation algorithm have significantly higher win rate against teams constructed by other baseline and state-of-the-art strategies.
CASP Solutions for Planning in Hybrid Domains
Balduccini, Marcello, Magazzeni, Daniele, Maratea, Marco, LeBlanc, Emily
CASP is an extension of ASP that allows for numerical constraints to be added in the rules. PDDL+ is an extension of the PDDL standard language of automated planning for modeling mixed discrete-continuous dynamics. In this paper, we present CASP solutions for dealing with PDDL+ problems, i.e., encoding from PDDL+ to CASP, and extensions to the algorithm of the EZCSP CASP solver in order to solve CASP programs arising from PDDL+ domains. An experimental analysis, performed on well-known linear and non-linear variants of PDDL+ domains, involving various configurations of the EZCSP solver, other CASP solvers, and PDDL+ planners, shows the viability of our solution.
A New Approach for Resource Scheduling with Deep Reinforcement Learning
Ye, Yufei, Ren, Xiaoqin, Wang, Jin, Xu, Lingxiao, Guo, Wenxia, Huang, Wenqiang, Tian, Wenhong
With the rapid development of deep learning, deep reinforcement learning (DRL) began to appear in the field of resource scheduling in recent years. Based on the previous research on DRL in the literature, we introduce online resource scheduling algorithm DeepRM2 and the offline resource scheduling algorithm DeepRM_Off. Compared with the state-of-the-art DRL algorithm DeepRM and heuristic algorithms, our proposed algorithms have faster convergence speed and better scheduling efficiency with regarding to average slowdown time, job completion time and rewards.
American expects back to normal operations at regional unit
FORT WORTH, Texas โ American Airlines says a regional affiliate should run close to a normal operation Thursday after canceling 2,750 flights in the past week because of a computer problem. Spokeswoman Katie Cody said PSA Airlines stabilized its computer systems but faced delays getting planes and crews back in place. Based in Dayton, Ohio, PSA is owned by American and operates many American Eagle regional flights, especially in Charlotte, North Carolina. Cody said American has been rebooking stranded passengers on American and other airlines since disruptions started June 14. She said there was a hardware problem in computers used to run crew-scheduling applications.
American Airlines Suffers The Latest Airline IT Meltdown
Within the span of a month during the summer of 2016, two of the top four U.S. airlines suffered crippling IT failures. Delta Air Lines (NYSE:DAL) and Southwest Airlines (NYSE:LUV) were each forced to cancel thousands of flights during the peak season, leading to lost revenue and reputational damage. This article originally appeared in the Motley Fool. The summer 2018 peak season is just getting started, but there has already been a major airline IT failure. In the past week, flight cancellations have rapidly mounted at American Airlines' (NASDAQ:AAL) regional subsidiary PSA Airlines, due to problems with the carrier's crew scheduling system.
Validation of Hierarchical Plans via Parsing of Attribute Grammars
Bartak, Roman (Charles University) | Maillard, Adrien (Charles University) | Cardoso, Rafael Cauรช ( Pontifรญcia Universidade Catรณlica do Rio Grande do Sul )
An important problem of automated planning is validating if a plan complies with the domain model. Such validation is straightforward for classical sequential planning but until recently there was no plan validation approach for Hierarchical Task Networks (HTN). In this paper we propose a novel technique for validating HTN plans by parsing of attribute grammars with the timeline constraint.
RMPD โ A Recursive Mid-Point Displacement Algorithm for Path Planning
Li, Fangda (Purdue University) | Manerikar, Ankit V. (Purdue University) | Kak, Avinash C. (Purdue University)
Motivated by what is required for real-time path planning, the paper starts out by presenting RMPD, a new recursive ''local'' planner founded on the key notion that, unless made necessary by an obstacle, there must be no deviation from the shortest path between any two points, which would normally be a straight line path in the configuration space. Subsequently, we increase the power of RMPD by introducing the notion of cost-awareness into the algorithm to improve the path quality -- this is done by associating obstacle and smoothness costs with the currently selected path points and factoring those costs in choosing the best points for the next iteration. In this manner, the overall strategy in the cost-aware form of RMPD, cRMPD, combines the computational efficiency made possible by the recursive RMPD planner with the cost efficacy of a stochastic trajectory optimizer to rapidly produce high-quality local collision-free paths. Based on the test cases we have run, our experiments show that cRMPD can reduce planning time by up to two orders of magnitude as compared to RRT-Connect, while still maintaining a path length optimality equivalent to that of RRT*.
ASP-Based Time-Bounded Planning for Logistics Robots
Schรคpers, Bjรถrn (RWTH Aachen University) | Niemueller, Tim (RWTH Aachen University) | Lakemeyer, Gerhard (RWTH Aachen University) | Gebser, Martin (University of Potsdam) | Schaub, Torsten (University of Potsdam)
Manufacturing industries are undergoing a major paradigm shift towards more autonomy. Automated planning and scheduling then becomes a necessity. The Planning and Execution Competition for Logistics Robots in Simulation held at ICAPS is based on this scenario and provides an interesting testbed. However, the posed problem is challenging as also demonstrated by the somewhat weak results in 2017. The domain requires temporal reasoning and dealing with uncertainty. We propose a novel planning system based on Answer Set Programming and the Clingo solver to tackle these problems and incentivize robot cooperation. Our results show a significant performance improvement, both, in terms of lowering computational requirements and better game metrics.
Algorithms and Conditional Lower Bounds for Planning Problems
Chatterjee, Krishnendu (Institute of Science and Technology Austria) | Dvoลรกk, Wolfgang (Vienna University of Technology) | Henzinger, Monika (University of Vienna) | Svozil, Alexander (University of Vienna)
We consider planning problems for graphs, Markov decision processes (MDPs), and games on graphs. While graphs represent the most basic planning model, MDPs represent interaction with nature and games on graphs represent interaction with an adversarial environment.We consider two planning problems where there are k different target sets, and the problems are as follows: (a) the coverage problem asks whether there is a plan for each individual target set, and (b) the sequential target reachability problem asks whether the targets can be reached in sequence. For the coverage problem, we present a linear-time algorithm for graphs, and quadratic conditional lower bound for MDPs and games on graphs.For the sequential target problem, we present a linear-time algorithm for graphs, a sub-quadratic algorithm for MDPs, and a quadratic conditional lower bound for games on graphs.Our results with conditional lower bounds establish (i) model-separation results showing that for the coverage problem MDPs and games on graphs are harder than graphs and for the sequential reachability problem games on graphs are harder than MDPs and graphs;and (ii) objective-separation results showing that for MDPs the coverage problem is harder than the sequential target problem.
Training Deep Reactive Policies for Probabilistic Planning Problems
Issakkimuthu, Murugeswari (Oregon State University) | Fern, Alan (Oregon State University) | Tadepalli, Prasad (Oregon State University)
State-of-the-art probabilistic planners typically apply look- ahead search and reasoning at each step to make a decision. While this approach can enable high-quality decisions, it can be computationally expensive for problems that require fast decision making. In this paper, we investigate the potential for deep learning to replace search by fast reactive policies. We focus on supervised learning of deep reactive policies for probabilistic planning problems described in RDDL. A key challenge is to explore the large design space of network architectures and training methods, which was critical to prior deep learning successes. We investigate a number of choices in this space and conduct experiments across a set of benchmark problems. Our results show that effective deep reactive policies can be learned for many benchmark problems and that leveraging the planning problem description to define the network structure can be beneficial.