Dynamic automaton-guided reward shaping for monte carlo tree search A Velasquez, B Bissey, L Barak, A Beckus, I Alkhouri, D Melcer, G Atia Proceedings of the AAAI Conference on Artificial Intelligence 35 (13), 12015 …, 2021 | 20 | 2021 |
Shield decentralization for safe multi-agent reinforcement learning D Melcer, C Amato, S Tripakis Advances in Neural Information Processing Systems 35, 13367-13379, 2022 | 16 | 2022 |
Constrained Decoding for Code Language Models via Efficient Left and Right Quotienting of Context-Sensitive Grammars D Melcer, N Fulton, SK Gouda, H Qian arXiv preprint arXiv:2402.17988, 2024 | 3 | 2024 |
Multi-agent tree search with dynamic reward shaping A Velasquez, B Bissey, L Barak, D Melcer, A Beckus, I Alkhouri, G Atia Proceedings of the International Conference on Automated Planning and …, 2022 | 2 | 2022 |
Verification-guided tree search A Velasquez, D Melcer Proceedings of the 19th International Conference on Autonomous Agents and …, 2020 | 2 | 2020 |
Shield Decentralization for Safe Reinforcement Learning in General Partially Observable Multi-Agent Environments D Melcer, C Amato, S Tripakis Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024 | 1 | 2024 |
Approximately Aligned Decoding D Melcer, S Gonugondla, P Perera, H Qian, WH Chiang, Y Wang, N Jain, ... arXiv preprint arXiv:2410.01103, 2024 | | 2024 |
Shield Decomposition for Safe Reinforcement Learning in General Partially Observable Multi-Agent Environments D Melcer, C Amato, S Tripakis Reinforcement Learning Conference, 2024 | | 2024 |
ProofViz: An Interactive Visual Proof Explorer D Melcer, S Chang Trends in Functional Programming: 22nd International Symposium, TFP 2021 …, 2021 | | 2021 |