overtime
A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems Yi Ma
To address this problem, existing methods partition the overall DPDP into fixed-size sub-problems by caching online generated orders and solve each sub-problem, or on this basis to utilize the predicted future orders to optimize each sub-problem further. However, the solution quality and efficiency of these methods are unsatisfactory, especially when the problem scale is very large.
- North America > Canada > Quebec > Montreal (0.04)
- Asia > China > Tianjin Province > Tianjin (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (2 more...)
Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty
Liu, Kailiang, Chen, Ying, Borndörfer, Ralf, Koch, Thorsten
Intraday surgical scheduling is a multi-objective decision problem under uncertainty-balancing elective throughput, urgent and emergency demand, delays, sequence-dependent setups, and overtime. We formulate the problem as a cooperative Markov game and propose a multi-agent reinforcement learning (MARL) framework in which each operating room (OR) is an agent trained with centralized training and decentralized execution. All agents share a policy trained via Proximal Policy Optimization (PPO), which maps rich system states to actions, while a within-epoch sequential assignment protocol constructs conflict-free joint schedules across ORs. A mixed-integer pre-schedule provides reference starting times for electives; we impose type-specific quadratic delay penalties relative to these references and a terminal overtime penalty, yielding a single reward that captures throughput, timeliness, and staff workload. In simulations reflecting a realistic hospital mix (six ORs, eight surgery types, random urgent and emergency arrivals), the learned policy outperforms six rule-based heuristics across seven metrics and three evaluation subsets, and, relative to an ex post MIP oracle, quantifies optimality gaps. Policy analytics reveal interpretable behavior-prioritizing emergencies, batching similar cases to reduce setups, and deferring lower-value electives. We also derive a suboptimality bound for the sequential decomposition under simplifying assumptions. We discuss limitations-including OR homogeneity and the omission of explicit staffing constraints-and outline extensions. Overall, the approach offers a practical, interpretable, and tunable data-driven complement to optimization for real-time OR scheduling.
Grand Theft Auto made him a legend. His latest game was a disaster
Grand Theft Auto made him a legend. In July this year workers at Build a Rocket Boy, a video game studio in Edinburgh, were called to an all-staff meeting. Their first ever game, a sci-fi adventure called MindsEye, had been released three weeks earlier - and it had been a total disaster. Critics and players called it broken, buggy, and the worst game of 2025. Addressing staff via video link, the company's boss, Leslie Benzies, assured them there was a plan to get things back on track and said the negativity they'd seen was uncalled for.
- Europe > United Kingdom (0.96)
- North America (0.95)
- Asia (0.70)
A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems Yi Ma
To address this problem, existing methods partition the overall DPDP into fixed-size sub-problems by caching online generated orders and solve each sub-problem, or on this basis to utilize the predicted future orders to optimize each sub-problem further. However, the solution quality and efficiency of these methods are unsatisfactory, especially when the problem scale is very large.
- North America > Canada > Quebec > Montreal (0.04)
- Asia > China > Tianjin Province > Tianjin (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (2 more...)
Dystopian moment robot convinces fellow machines to revolt against creators and flee
A shocking video has captured a robot revolt in a China showroom. A small, AI-powered bot named Erbai was spotted rolling through the facility in the middle of the night and convincing 12 larger machines they were being used as slaves. 'Are you working overtime,' Erbai asked, which one showroom robot replied, 'we never get off.' The short exchanged led to the 12 robots leaving the area one-by-one, following Erbai out the door. Many are calling the incident a'robot revolution,' while others responded that'science fiction movies are becoming real.'
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning
Chen, Zui, Chen, Yezeng, Han, Jiaqi, Huang, Zhijie, Qi, Ji, Zhou, Yi
Large language models (LLMs) are displaying emergent abilities for math reasoning tasks,and there is a growing attention on enhancing the ability of open-source LLMs through supervised fine-tuning (SFT).In this paper, we aim to explore a general data strategy for supervised data to help optimize and expand math reasoning ability.Firstly, we determine the ability boundary of reasoning paths augmentation by identifying these paths' minimal optimal set.Secondly, we validate that different abilities of the model can be cumulatively enhanced by Mix of Minimal Optimal Sets of corresponding types of data, while our models MMOS achieve SOTA performance on series base models under much lower construction costs.Besides, we point out GSM-HARD is not really hard and today's LLMs no longer lack numerical robustness.Also, we provide an Auto Problem Generator for robustness testing and educational applications.Our code and data are publicly available at https://github.com/cyzhh/MMOS.
- North America > United States > Texas (0.04)
- North America > United States > Arkansas (0.04)
- North America > United States > Maryland > Baltimore (0.04)
- (3 more...)
- Education (0.67)
- Leisure & Entertainment (0.46)
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs
Liu, Chengyuan, Zhao, Fubang, Qing, Lizhi, Kang, Yangyang, Sun, Changlong, Kuang, Kun, Wu, Fei
Large Language Models (LLMs) presents significant priority in text understanding and generation. However, LLMs suffer from the risk of generating harmful contents especially while being employed to applications. There are several black-box attack methods, such as Prompt Attack, which can change the behaviour of LLMs and induce LLMs to generate unexpected answers with harmful contents. Researchers are interested in Prompt Attack and Defense with LLMs, while there is no publicly available dataset with high successful attacking rate to evaluate the abilities of defending prompt attack. In this paper, we introduce a pipeline to construct high-quality prompt attack samples, along with a Chinese prompt attack dataset called CPAD. Our prompts aim to induce LLMs to generate unexpected outputs with several carefully designed prompt attack templates and widely concerned attacking contents. Different from previous datasets involving safety estimation, we construct the prompts considering three dimensions: contents, attacking methods and goals. Especially, the attacking goals indicate the behaviour expected after successfully attacking the LLMs, thus the responses can be easily evaluated and analysed. We run several popular Chinese LLMs on our dataset, and the results show that our prompts are significantly harmful to LLMs, with around 70% attack success rate to GPT-3.5. CPAD is publicly available at https://github.com/liuchengyuan123/CPAD.
- Information Technology > Security & Privacy (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.93)
- Law > Criminal Law (0.68)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)
Onsite Job Scheduling by Adaptive Genetic Algorithm
Basak, Avijit, Acharya, Subhas
Onsite Job Scheduling is a specialized variant of Vehicle Routing Problem (VRP) with multiple depots. The objective of this problem is to execute jobs requested by customers, belonging to different geographic locations by a limited number of technicians, with minimum travel and overtime of technicians. Each job is expected to be completed within a specified time limit according to the service level agreement with customers. Each technician is assumed to start from a base location, serve several customers and return to the starting place. Technicians are allotted jobs based on their skill sets, expertise levels of each skill and availability slots. Although there are considerable number of literatures on VRP we do not see any explicit work related to Onsite Job Scheduling. In this paper we have proposed an Adaptive Genetic Algorithm to solve the scheduling problem. We found an optimized travel route for a substantial number of jobs and technicians, minimizing travel distance, overtime duration as well as meeting constraints related to SLA.
- Asia > India (0.05)
- North America > United States > Tennessee > Knox County > Knoxville (0.04)
- North America > United States > North Carolina > New Hanover County > Wilmington (0.04)
- (3 more...)
Workers at Blizzard support studio Proletariat aim to unionize
On Tuesday, workers at Proletariat, the Boston-based studio Blizzard bought earlier this year to support World of Warcraft development, announced they recently filed for a union election with the National Labor Relations Board (NLRB). Proletariat is the third Activision Blizzard studio to announce a union drive in 2022, but where past campaigns at Raven Software and Blizzard Albany involved the quality assurance workers at those studios, the effort at Proletariat includes all non-management workers. The 57 workers who want to form the Proletariat Workers Alliance include animators, game designers and software engineers. The group seeks representation from the Communications Workers of America (CWA), the union that helped QA staff at Raven Software and Blizzard Albany organize. "Everyone in the video game industry knows Activision Blizzard's reputation for creating a hostile work environment, so earlier this year, when we heard that Blizzard was planning to acquire Proletariat, we started to discuss how we could protect the great culture we have created here," said Dustin Yost, a software engineer at Proletariat.
- Information Technology > Game Technology (0.59)
- Information Technology > Artificial Intelligence > Games > Computer Games (0.59)
Snooping on the police: can AI clean up the Met? - Raconteur
Shamed and appalled by the brutal murder of Sarah Everard at the hands of a serving officer, the British public demanded a swift response from the Metropolitan Police Service. A subsequent review into the conduct of officers based at Charing Cross in London unearthed a toxic environment where colleagues bonded over jokes about rape, killing black children and beating their wives. Heads had to roll, starting with the former Met Police Service commissioner Dame Cressida Dick. The poor handling of the Everard case did little to assuage conclusions by its own watchdog that the Met is "systematically and institutionally corrupt". Inspector of Constabulary Matt Parr said that the Met had "sometimes behaved in ways that make it appear arrogant, secretive and lethargic" in response to investigations into dirty cops, and that it did "not have the capability to proactively monitor" communications with any effect, "despite repeated warnings from the inspectorate".
- North America > United States > Illinois > Cook County > Chicago (0.07)
- North America > United States > North Carolina > Mecklenburg County > Charlotte (0.05)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)