Reinforcement learning for combinatorial optimization: A survey
Many traditional algorithms for solving combinatorial optimization problems involve using
hand-crafted heuristics that sequentially construct a solution. Such heuristics are designed …
hand-crafted heuristics that sequentially construct a solution. Such heuristics are designed …
An overview and experimental study of learning-based optimization algorithms for the vehicle routing problem
The vehicle routing problem (VRP) is a typical discrete combinatorial optimization problem,
and many models and algorithms have been proposed to solve the VRP and its variants …
and many models and algorithms have been proposed to solve the VRP and its variants …
Illuminating generalization in deep reinforcement learning through procedural level generation
Deep reinforcement learning (RL) has shown impressive results in a variety of domains,
learning directly from high-dimensional sensory streams. However, when neural networks …
learning directly from high-dimensional sensory streams. However, when neural networks …
A reinforcement learning approach for optimizing multiple traveling salesman problems over graphs
This paper proposes a learning-based approach to optimize the multiple traveling salesman
problem (MTSP), which is one classic representative of cooperative combinatorial …
problem (MTSP), which is one classic representative of cooperative combinatorial …
Planning with learned object importance in large problem instances using graph neural networks
Real-world planning problems often involve hundreds or even thousands of objects,
straining the limits of modern planners. In this work, we address this challenge by learning to …
straining the limits of modern planners. In this work, we address this challenge by learning to …
Learning general optimal policies with graph neural networks: Expressive power, transparency, and limits
It has been recently shown that general policies for many classical planning domains can be
expressed and learned in terms of a pool of features defined from the domain predicates …
expressed and learned in terms of a pool of features defined from the domain predicates …
Learning domain-independent planning heuristics with hypergraph networks
We present the first approach capable of learning domain-independent planning heuristics
entirely from scratch. The heuristics we learn map the hypergraph representation of the …
entirely from scratch. The heuristics we learn map the hypergraph representation of the …
Learning general policies with policy gradient methods
While reinforcement learning methods have delivered remarkable results in a number of
settings, generalization, ie, the ability to produce policies that generalize in a reliable and …
settings, generalization, ie, the ability to produce policies that generalize in a reliable and …
Learning generalized policies without supervision using gnns
We consider the problem of learning generalized policies for classical planning domains
using graph neural networks from small instances represented in lifted STRIPS. The …
using graph neural networks from small instances represented in lifted STRIPS. The …
Deep learning for big data applications in CAD and PLM–Research review, opportunities and case study
With the increasing amount of available data, computing power and network speed for a
decreasing cost, the manufacturing industry is facing an unprecedented amount of data to …
decreasing cost, the manufacturing industry is facing an unprecedented amount of data to …