Rl4co: an extensive reinforcement learning for combinatorial optimization benchmark

F Berto, C Hua, J Park, L Luttmann, Y Ma, F Bu… - arxiv preprint arxiv …, 2023 - arxiv.org
We introduce RL4CO, an extensive reinforcement learning (RL) for combinatorial
optimization (CO) benchmark. RL4CO employs state-of-the-art software libraries as well as …

UDC: A unified neural divide-and-conquer framework for large-scale combinatorial optimization problems

Z Zheng, C Zhou, T **aliang, M Yuan… - arxiv preprint arxiv …, 2024 - arxiv.org
Single-stage neural combinatorial optimization solvers have achieved near-optimal results
on various small-scale combinatorial optimization (CO) problems without requiring expert …

Parco: Learning parallel autoregressive policies for efficient multi-agent combinatorial optimization

F Berto, C Hua, L Luttmann, J Son, J Park… - arxiv preprint arxiv …, 2024 - arxiv.org
Multi-agent combinatorial optimization problems such as routing and scheduling have great
practical relevance but present challenges due to their NP-hard combinatorial nature, hard …

Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design

Z Zheng, Z **e, Z Wang, B Hooi - arxiv preprint arxiv:2501.08603, 2025 - arxiv.org
Handcrafting heuristics for solving complex planning tasks (eg, NP-hard combinatorial
optimization (CO) problems) is a common practice but requires extensive domain …

CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems

C Hua, F Berto, J Son, S Kang, C Kwon… - arxiv preprint arxiv …, 2025 - arxiv.org
The profiled vehicle routing problem (PVRP) is a generalization of the heterogeneous
capacitated vehicle routing problem (HCVRP) in which the objective is to optimize the routes …