Agentreview: Exploring peer review dynamics with llm agents

Y **, Q Zhao, Y Wang, H Chen, K Zhu, Y **ao… - arxiv preprint arxiv …, 2024 - arxiv.org
Peer review is fundamental to the integrity and advancement of scientific publication.
Traditional methods of peer review analyses often rely on exploration and statistics of …

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

S Zhang, Z Xu, P Liu, X Yu, Y Li, Q Gao, Z Fei… - arxiv preprint arxiv …, 2024 - arxiv.org
General-purposed embodied agents are designed to understand the users' natural
instructions or intentions and act precisely to complete universal tasks. Recently, methods …

LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation

B Li, Z Li, Q Du, J Luo, W Wang, Y **e… - Advances in …, 2025 - proceedings.neurips.cc
Recent years have witnessed the rapid development of Neuro-Symbolic (NeSy) AI systems,
which integrate symbolic reasoning into deep neural networks. However, most of the …

A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities

Y Liu, X Cao, T Chen, Y Jiang, J You, M Wu… - arxiv preprint arxiv …, 2025 - arxiv.org
Healthcare systems worldwide face persistent challenges in efficiency, accessibility, and
personalization. Powered by modern AI technologies such as multimodal large language …

Don't Let Your Robot be Harmful: Responsible Robotic Manipulation

M Ni, L Zhang, Z Chen, W Zuo - arxiv preprint arxiv:2411.18289, 2024 - arxiv.org
Unthinking execution of human instructions in robotic manipulation can lead to severe safety
risks, such as poisonings, fires, and even explosions. In this paper, we present responsible …

HARBOR: Exploring Persona Dynamics in Multi-Agent Competition

K Jiang, L **ong, F Liu - arxiv preprint arxiv:2502.12149, 2025 - arxiv.org
We investigate factors contributing to LLM agents' success in competitive multi-agent
environments, using auctions as a testbed where agents bid to maximize profit. The agents …

PlanGenLLMs: A Modern Survey of LLM Planning Capabilities

H Wei, Z Zhang, S He, T **a, S Pan, F Liu - arxiv preprint arxiv …, 2025 - arxiv.org
LLMs have immense potential for generating plans, transforming an initial world state into a
desired goal state. A large body of research has explored the use of LLMs for various …

Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents

H Sun, S Zhang, L Ren, H Xu, H Fu, C Yuan… - arxiv preprint arxiv …, 2025 - arxiv.org
Large language models (LLMs) based agent systems have made great strides in real-world
applications beyond traditional NLP tasks. This paper proposes a new LLM-powered Multi …

[PDF][PDF] From Screens to Scenes: A Survey of Embodied AI in Healthcare

Y Liu, X Cao, T Chen, Y Jiang, J You… - arxiv preprint arxiv …, 2025 - researchgate.net
Healthcare systems worldwide face persistent challenges in efficiency, accessibility, and
personalization. Modern artificial intelligence (AI) has shown promise in addressing these …

On the Limit of Language Models as Planning Formalizers

C Huang, L Zhang - arxiv preprint arxiv:2412.09879, 2024 - arxiv.org
Large Language Models have been shown to fail to create executable and verifiable plans
in grounded environments. An emerging line of work shows success in using LLM as a …