A comprehensive survey of scientific large language models and their applications in scientific discovery

Y Zhang, X Chen, B **, S Wang, S Ji, W Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
In many scientific fields, large language models (LLMs) have revolutionized the way text and
other modalities of data (eg, molecules and proteins) are handled, achieving superior …

Din-sql: Decomposed in-context learning of text-to-sql with self-correction

M Pourreza, D Rafiei - Advances in Neural Information …, 2023 - proceedings.neurips.cc
There is currently a significant gap between the performance of fine-tuned models and
prompting approaches using Large Language Models (LLMs) on the challenging task of text …

A survey of data augmentation approaches for NLP

SY Feng, V Gangal, J Wei, S Chandar… - arxiv preprint arxiv …, 2021 - arxiv.org
Data augmentation has recently seen increased interest in NLP due to more work in low-
resource domains, new tasks, and the popularity of large-scale neural networks that require …

Large language models are few (1)-shot table reasoners

W Chen - arxiv preprint arxiv:2210.06710, 2022 - arxiv.org
Recent literature has shown that large language models (LLMs) are generally excellent few-
shot reasoners to solve text reasoning tasks. However, the capability of LLMs on table …

Codes: Towards building open-source language models for text-to-sql

H Li, J Zhang, H Liu, J Fan, X Zhang, J Zhu… - Proceedings of the …, 2024 - dl.acm.org
Language models have shown promising performance on the task of translating natural
language questions into SQL queries (Text-to-SQL). However, most of the state-of-the-art …

Sql-palm: Improved large language model adaptation for text-to-sql (extended)

R Sun, SÖ Arik, A Muzio, L Miculicich… - arxiv preprint arxiv …, 2023 - arxiv.org
Text-to-SQL, the process of translating natural language into Structured Query Language
(SQL), represents a transformative application of large language models (LLMs), potentially …

Unifying the perspectives of nlp and software engineering: A survey on language models for code

Z Zhang, C Chen, B Liu, C Liao, Z Gong, H Yu… - arxiv preprint arxiv …, 2023 - arxiv.org
In this work we systematically review the recent advancements in software engineering with
language models, covering 70+ models, 40+ evaluation tasks, 180+ datasets, and 900 …

LGESQL: line graph enhanced text-to-SQL model with mixed local and non-local relations

R Cao, L Chen, Z Chen, Y Zhao, S Zhu, K Yu - arxiv preprint arxiv …, 2021 - arxiv.org
This work aims to tackle the challenging heterogeneous graph encoding problem in the text-
to-SQL task. Previous methods are typically node-centric and merely utilize different weight …

SmBoP: Semi-autoregressive bottom-up semantic parsing

O Rubin, J Berant - arxiv preprint arxiv:2010.12412, 2020 - arxiv.org
The de-facto standard decoding method for semantic parsing in recent years has been to
autoregressively decode the abstract syntax tree of the target program using a top-down …

How to prompt llms for text-to-sql: A study in zero-shot, single-domain, and cross-domain settings

S Chang, E Fosler-Lussier - arxiv preprint arxiv:2305.11853, 2023 - arxiv.org
Large language models (LLMs) with in-context learning have demonstrated remarkable
capability in the text-to-SQL task. Previous research has prompted LLMs with various …