Reinforcement learning for combinatorial optimization: A survey

N Mazyavkina, S Sviridov, S Ivanov… - Computers & Operations …, 2021 - Elsevier
Many traditional algorithms for solving combinatorial optimization problems involve using
hand-crafted heuristics that sequentially construct a solution. Such heuristics are designed …

An overview and experimental study of learning-based optimization algorithms for the vehicle routing problem

B Li, G Wu, Y He, M Fan… - IEEE/CAA Journal of …, 2022 - ieeexplore.ieee.org
The vehicle routing problem (VRP) is a typical discrete combinatorial optimization problem,
and many models and algorithms have been proposed to solve the VRP and its variants …

Illuminating generalization in deep reinforcement learning through procedural level generation

N Justesen, RR Torrado, P Bontrager, A Khalifa… - arxiv preprint arxiv …, 2018 - arxiv.org
Deep reinforcement learning (RL) has shown impressive results in a variety of domains,
learning directly from high-dimensional sensory streams. However, when neural networks …

A reinforcement learning approach for optimizing multiple traveling salesman problems over graphs

Y Hu, Y Yao, WS Lee - Knowledge-Based Systems, 2020 - Elsevier
This paper proposes a learning-based approach to optimize the multiple traveling salesman
problem (MTSP), which is one classic representative of cooperative combinatorial …

Planning with learned object importance in large problem instances using graph neural networks

T Silver, R Chitnis, A Curtis, JB Tenenbaum… - Proceedings of the …, 2021 - ojs.aaai.org
Real-world planning problems often involve hundreds or even thousands of objects,
straining the limits of modern planners. In this work, we address this challenge by learning to …

Learning general optimal policies with graph neural networks: Expressive power, transparency, and limits

S Ståhlberg, B Bonet, H Geffner - Proceedings of the International …, 2022 - ojs.aaai.org
It has been recently shown that general policies for many classical planning domains can be
expressed and learned in terms of a pool of features defined from the domain predicates …

Learning domain-independent planning heuristics with hypergraph networks

W Shen, F Trevizan, S Thiébaux - Proceedings of the International …, 2020 - aaai.org
We present the first approach capable of learning domain-independent planning heuristics
entirely from scratch. The heuristics we learn map the hypergraph representation of the …

Learning general policies with policy gradient methods

S Ståhlberg, B Bonet, H Geffner - Proceedings of the …, 2023 - proceedings.kr.org
While reinforcement learning methods have delivered remarkable results in a number of
settings, generalization, ie, the ability to produce policies that generalize in a reliable and …

Learning generalized policies without supervision using gnns

S Ståhlberg, B Bonet, H Geffner - arxiv preprint arxiv:2205.06002, 2022 - arxiv.org
We consider the problem of learning generalized policies for classical planning domains
using graph neural networks from small instances represented in lifted STRIPS. The …

Deep learning for big data applications in CAD and PLM–Research review, opportunities and case study

J Dekhtiar, A Durupt, M Bricogne, B Eynard… - Computers in …, 2018 - Elsevier
With the increasing amount of available data, computing power and network speed for a
decreasing cost, the manufacturing industry is facing an unprecedented amount of data to …