[PDF][PDF] A survey of large language models

WX Zhao, K Zhou, J Li, T Tang… - arxiv preprint arxiv …, 2023 - paper-notes.zhjwpku.com
Ever since the Turing Test was proposed in the 1950s, humans have explored the mastering
of language intelligence by machine. Language is essentially a complex, intricate system of …

Drivelm: Driving with graph visual question answering

C Sima, K Renz, K Chitta, L Chen, H Zhang… - … on Computer Vision, 2024 - Springer
We study how vision-language models (VLMs) trained on web-scale data can be integrated
into end-to-end driving systems to boost generalization and enable interactivity with human …

Efficient large language models: A survey

Z Wan, X Wang, C Liu, S Alam, Y Zheng, J Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have demonstrated remarkable capabilities in important
tasks such as natural language understanding and language generation, and thus have the …

Toward self-improvement of llms via imagination, searching, and criticizing

Y Tian, B Peng, L Song, L **, D Yu… - Advances in Neural …, 2025 - proceedings.neurips.cc
Despite the impressive capabilities of Large Language Models (LLMs) on various tasks, they
still struggle with scenarios that involves complex reasoning and planning. Self-correction …

Task me anything

J Zhang, W Huang, Z Ma, O Michel, D He… - arxiv preprint arxiv …, 2024 - arxiv.org
Benchmarks for large multimodal language models (MLMs) now serve to simultaneously
assess the general capabilities of models instead of evaluating for a specific capability. As a …

Self-playing adversarial language game enhances llm reasoning

P Cheng, T Hu, H Xu, Z Zhang, Y Dai… - Advances in Neural …, 2025 - proceedings.neurips.cc
We explore the potential of self-play training for large language models (LLMs) in a two-
player adversarial language game called Adversarial Taboo. In this game, an attacker and a …

Chain of preference optimization: Improving chain-of-thought reasoning in llms

X Zhang, C Du, T Pang, Q Liu… - Advances in Neural …, 2025 - proceedings.neurips.cc
The recent development of chain-of-thought (CoT) decoding has enabled large language
models (LLMs) to generate explicit logical reasoning paths for complex problem-solving …

Ufo: A ui-focused agent for windows os interaction

C Zhang, L Li, S He, X Zhang, B Qiao, S Qin… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to
applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a …

Xpert: Empowering incident management with query recommendations via large language models

Y Jiang, C Zhang, S He, Z Yang, M Ma, S Qin… - Proceedings of the …, 2024 - dl.acm.org
Large-scale cloud systems play a pivotal role in modern IT infrastructure. However, incidents
occurring within these systems can lead to service disruptions and adversely affect user …

Llama-berry: Pairwise optimization for o1-like olympiad-level mathematical reasoning

D Zhang, J Wu, J Lei, T Che, J Li, T **e… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper presents an advanced mathematical problem-solving framework, LLaMA-Berry,
for enhancing the mathematical reasoning ability of Large Language Models (LLMs). The …