Agent lumos: Unified and modular training for open-source language agents

D Yin, F Brahman, A Ravichander… - Proceedings of the …, 2024 - aclanthology.org
Closed-source agents suffer from several issues such as a lack of affordability, transparency,
and reproducibility, particularly on complex interactive tasks. This motivates the …

Lumos: Learning agents with unified data, modular design, and open-source llms

D Yin, F Brahman, A Ravichander… - ICLR 2024 Workshop …, 2023 - openreview.net
We introduce Lumos, a novel framework for training language agents that employs a unified
data format and a modular architecture based on open-source large language models …

Put your money where your mouth is: Evaluating strategic planning and execution of llm agents in an auction arena

J Chen, S Yuan, R Ye, BP Majumder… - arxiv preprint arxiv …, 2023 - arxiv.org
Recent advancements in Large Language Models (LLMs) showcase advanced reasoning,
yet NLP evaluations often depend on static benchmarks. Evaluating this necessitates …

Deep learning for trajectory data management and mining: A survey and beyond

W Chen, Y Liang, Y Zhu, Y Chang, K Luo… - arxiv preprint arxiv …, 2024 - arxiv.org
Trajectory computing is a pivotal domain encompassing trajectory data management and
mining, garnering widespread attention due to its crucial role in various practical …

Q*: Improving multi-step reasoning for llms with deliberative planning

C Wang, Y Deng, Z Lyu, L Zeng, J He, S Yan… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) have demonstrated impressive capability in many natural
language tasks. However, the auto-regressive generation process makes LLMs prone to …

Urban foundation models: A survey

W Zhang, J Han, Z Xu, H Ni, H Liu… - Proceedings of the 30th …, 2024 - dl.acm.org
Machine learning techniques are now integral to the advancement of intelligent urban
services, playing a crucial role in elevating the efficiency, sustainability, and livability of …

LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench

K Valmeekam, K Stechly, S Kambhampati - arxiv preprint arxiv …, 2024 - arxiv.org
The ability to plan a course of action that achieves a desired state of affairs has long been
considered a core competence of intelligent agents and has been an integral part of AI …

Exploring automated energy optimization with unstructured building data: A multi-agent based framework leveraging large language models

T **ao, P Xu - Energy and Buildings, 2024 - Elsevier
The building sector is a significant energy consumer, making building energy optimization
crucial for reducing energy demand. Automating energy optimization tasks eases the …

Autoflow: Automated workflow generation for large language model agents

Z Li, S Xu, K Mei, W Hua, B Rama, O Raheja… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in Large Language Models (LLMs) have shown significant progress
in understanding complex natural language. One important application of LLM is LLM-based …

Recommendation with generative models

Y Deldjoo, Z He, J McAuley, A Korikov… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative models are a class of AI models capable of creating new instances of data by
learning and sampling from their statistical distributions. In recent years, these models have …