A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Datasets for large language models: A comprehensive survey

Y Liu, J Cao, C Liu, K Ding, L ** - arxiv preprint arxiv:2402.18041, 2024 - arxiv.org
This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …

Text-to-sql empowered by large language models: A benchmark evaluation

D Gao, H Wang, Y Li, X Sun, Y Qian, B Ding… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have emerged as a new paradigm for Text-to-SQL task.
However, the absence of a systematical benchmark inhibits the development of designing …

Qwen2. 5-coder technical report

B Hui, J Yang, Z Cui, J Yang, D Liu, L Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
In this report, we introduce the Qwen2. 5-Coder series, a significant upgrade from its
predecessor, CodeQwen1. 5. This series includes six models: Qwen2. 5-Coder-(0.5 B/1.5 …

GPTEval: A survey on assessments of ChatGPT and GPT-4

R Mao, G Chen, X Zhang, F Guerin… - arxiv preprint arxiv …, 2023 - arxiv.org
The emergence of ChatGPT has generated much speculation in the press about its potential
to disrupt social and economic systems. Its astonishing language ability has aroused strong …

A survey of large language models for code: Evolution, benchmarking, and future trends

Z Zheng, K Ning, Y Wang, J Zhang, D Zheng… - arxiv preprint arxiv …, 2023 - arxiv.org
General large language models (LLMs), represented by ChatGPT, have demonstrated
significant potential in tasks such as code generation in software engineering. This has led …

Domain specialization as the key to make large language models disruptive: A comprehensive survey

C Ling, X Zhao, J Lu, C Deng, C Zheng, J Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have significantly advanced the field of natural language
processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of …

Sql-palm: Improved large language model adaptation for text-to-sql (extended)

R Sun, SÖ Arik, A Muzio, L Miculicich… - arxiv preprint arxiv …, 2023 - arxiv.org
Text-to-SQL, the process of translating natural language into Structured Query Language
(SQL), represents a transformative application of large language models (LLMs), potentially …

Universal self-consistency for large language model generation

X Chen, R Aksitov, U Alon, J Ren, K **ao, P Yin… - arxiv preprint arxiv …, 2023 - arxiv.org
Self-consistency with chain-of-thought prompting (CoT) has demonstrated remarkable
performance gains on various challenging tasks, by utilizing multiple reasoning paths …

Metatool benchmark for large language models: Deciding whether to use tools and which to use

Y Huang, J Shi, Y Li, C Fan, S Wu, Q Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have garnered significant attention due to their impressive
natural language processing (NLP) capabilities. Recently, many studies have focused on …