AITopics | seeder

564127c03caab942e503ee6f810f54fd-Supplemental.pdf

Neural Information Processing SystemsApr-26-2026, 00:06:03 GMT

This paper solves three NP-hard routing problems, traveling salesman problem (TSP), prize collecting TSP (PCTSP), and capacitated vehicle routing problem (CVRP). This section provides detailed descriptions of PCTSP and CVRP (for TSP, see section 3). The PCTSP is similar to TSP, while there are differences in that we do not have to visit all the nodes and that the destination is not the first node but the depot node, i.e., a tour is not a cycle. Let N be the number of nodes. The problem instance of PCTSP is s = {(xi,λi,µi)}N+1i=1, where the xi R2 is in 2D euclidean coordinates, λi R is the penalty of unvisited node, and µi R is the prize of visited node. The L(π|s) is the tour length, and λ(π|s) is the total penalty of the unvisited nodes.

artificial intelligence, experiment, machine learning, (17 more...)

Neural Information Processing Systems

Industry: Transportation (0.76)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

564127c03caab942e503ee6f810f54fd-Paper.pdf

Neural Information Processing SystemsApr-26-2026, 00:06:00 GMT

artificial intelligence, machine learning, survey article, (18 more...)

Neural Information Processing Systems

Genre:

Research Report (0.68)
Overview (0.46)

Industry: Transportation (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Communications > Networks (0.68)

Add feedback

564127c03caab942e503ee6f810f54fd-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 18:16:50 GMT

experiment, node, reviser, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Learning Collaborative Policies to Solve NP-hard Routing Problems

Neural Information Processing SystemsDec-24-2025, 03:41:49 GMT

Recently, deep reinforcement learning (DRL) frameworks have shown potential for solving NP-hard routing problems such as the traveling salesman problem (TSP) without problem-specific expert knowledge. Although DRL can be used to solve complex problems, DRL frameworks still struggle to compete with state-of-the-art heuristics showing a substantial performance gap. This paper proposes a novel hierarchical problem-solving strategy, termed learning collaborative policies (LCP), which can effectively find the near-optimum solution using two iterative DRL policies: the seeder and reviser. The seeder generates as diversified candidate solutions as possible (seeds) while being dedicated to exploring over the full combinatorial action space (i.e., sequence of assignment action). To this end, we train the seeder's policy using a simple yet effective entropy regularization reward to encourage the seeder to find diverse solutions. On the other hand, the reviser modifies each candidate solution generated by the seeder; it partitions the full trajectory into sub-tours and simultaneously revises each sub-tour to minimize its traveling distance. Thus, the reviser is trained to improve the candidate solution's quality, focusing on the reduced solution space (which is beneficial for exploitation). Extensive experiments demonstrate that the proposed two-policies collaboration scheme improves over single-policy DRL framework on various NP-hard routing problems, including TSP, prize collecting TSP (PCTSP), and capacitated vehicle routing problem (CVRP).

learning collaborative policy, name change, solve np-hard routing problem, (6 more...)

Neural Information Processing Systems

Industry: Transportation (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.59)

Add feedback

Learning Collaborative Policies to Solve NP-hard Routing Problems

Neural Information Processing SystemsOct-10-2024, 11:43:40 GMT

Recently, deep reinforcement learning (DRL) frameworks have shown potential for solving NP-hard routing problems such as the traveling salesman problem (TSP) without problem-specific expert knowledge. Although DRL can be used to solve complex problems, DRL frameworks still struggle to compete with state-of-the-art heuristics showing a substantial performance gap. This paper proposes a novel hierarchical problem-solving strategy, termed learning collaborative policies (LCP), which can effectively find the near-optimum solution using two iterative DRL policies: the seeder and reviser. The seeder generates as diversified candidate solutions as possible (seeds) while being dedicated to exploring over the full combinatorial action space (i.e., sequence of assignment action). To this end, we train the seeder's policy using a simple yet effective entropy regularization reward to encourage the seeder to find diverse solutions.

candidate solution, learning collaborative policy, solve np-hard routing problem, (3 more...)

Neural Information Processing Systems

Industry: Transportation (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.62)

Add feedback

Learning Collaborative Policies to Solve NP-hard Routing Problems

Kim, Minsu, Park, Jinkyoo, Kim, Joungho

arXiv.org Machine LearningOct-26-2021

Recently, deep reinforcement learning (DRL) frameworks have shown potential for solving NP-hard routing problems such as the traveling salesman problem (TSP) without problem-specific expert knowledge. Although DRL can be used to solve complex problems, DRL frameworks still struggle to compete with state-of-the-art heuristics showing a substantial performance gap. This paper proposes a novel hierarchical problem-solving strategy, termed learning collaborative policies (LCP), which can effectively find the near-optimum solution using two iterative DRL policies: the seeder and reviser. The seeder generates as diversified candidate solutions as possible (seeds) while being dedicated to exploring over the full combinatorial action space (i.e., sequence of assignment action). To this end, we train the seeder's policy using a simple yet effective entropy regularization reward to encourage the seeder to find diverse solutions. On the other hand, the reviser modifies each candidate solution generated by the seeder; it partitions the full trajectory into sub-tours and simultaneously revises each sub-tour to minimize its traveling distance. Thus, the reviser is trained to improve the candidate solution's quality, focusing on the reduced solution space (which is beneficial for exploitation). Extensive experiments demonstrate that the proposed two-policies collaboration scheme improves over single-policy DRL framework on various NP-hard routing problems, including TSP, prize collecting TSP (PCTSP), and capacitated vehicle routing problem (CVRP).

node, reviser, tsp, (16 more...)

arXiv.org Machine Learning

2110.13987

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Transportation (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
(2 more...)

Add feedback

The Dot Power Platform Could Transform Farming Technology

WIREDMar-9-2018, 17:40:29 GMT

The Dot Power Platform is a prime example of an explosion in advanced agricultural technology, which Goldman Sachs predicts will raise crop yields 70 percent by 2050. But Dot isn't just a tractor that can drive without a human for backup. It's the Transformer of ag-bots, capable of performing 100-plus jobs, from hay baler and seeder to rock picker and manure spreader, via an arsenal of tool modules. And though the hulking machine can carry 40,000 pounds, it navigates fields with balletic precision. Farmers map their land using an aerial drone or GPS receiver, upload that data to the Dot controller--a Microsoft Surface Pro--then unleash the beast into the field.

artificial intelligence, dot power platform, transform farming technology, (1 more...)

WIRED

Country: North America > Canada > Saskatchewan (0.07)

Industry: Food & Agriculture > Agriculture (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.39)

Add feedback

Should we anthropomorphize an AI who wants to kill us all?

#artificialintelligenceJun-1-2016, 19:10:47 GMT

Musk, Gates, and Hawking are worrying about AI. But would a sentient AI really want to kill us all? And if it does, should we anthropomorphize the AI to give us humans some measure of advantage? One way to consider these questions is to peer into our human nature or even science fiction for clues. The answers may be a matter of perspective: Are we looking into the window or out?

anthropomorphize, artificial intelligence, seeder, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback