ppo-bihyb
b2f627fff19fda463cb386442eac2b3d-Supplemental.pdf
Erd os GNN [29] is a novel framework with unsupervised learning, however, its main limitation is that this framework is incapable of handling constraints beyond simple node constraints. Following the implementation from [39], the job nodes are scheduled in sequential order with PPO-Single. Herestateisthe current DAGGk with atimestamp, and some of the nodes are already scheduled by the current timestamp. Torepresent the current state of the problem, the finished nodes, running nodes and unscheduled nodes are marked by different node 15 attributes, so that the state information is fully encoded by the nodes and edges ofGk. After anode finishes, itwillfreesome resources, and sometimes add some available nodes to be scheduled.
- North America > Mexico > Gulf of Mexico (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
- North America > United States (0.04)
- (2 more...)
- Information Technology (0.67)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.47)
- Health & Medicine > Therapeutic Area > Immunology (0.47)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
A Comparison with Other General MLCO Frameworks
We would also like to discuss the limitations of the approaches including ours. As shown in Tab. 4, the PPO-Single that serves as a baseline in our paper is designed following As shown in Tab. 4, NerRewritter is most general because it can be viewed as a learning-based local It is also worth noting that there are some problems that are beyond our knowledge to tackle, e.g. the expression simplify problem, and it may requires experts with specific domain We have discussed the model details of PPO-BiHyb in Sec. 4, and in this section, we discuss the DAG. Considering the structure of DAG, we design two GCNs: the first GCN processes the original DAG, and the second GCN processes the DAG with all edges reversed. The predicted doubly-stochastic matrix by SK is processed by considering the partial matching matrix. Graph-level features are obtained via attention pooling, which are fed to the critic net.
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.35)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.31)
- Asia > China > Shanghai > Shanghai (0.04)
- North America > United States (0.04)
- Europe > Italy (0.04)
- Asia > Middle East > Jordan (0.04)
- Information Technology (0.67)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.47)
- Health & Medicine > Therapeutic Area > Immunology (0.47)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- (2 more...)