[HTML][HTML] A survey of GPT-3 family large language models including ChatGPT and GPT-4

KS Kalyan - Natural Language Processing Journal, 2024 - Elsevier
Large language models (LLMs) are a special class of pretrained language models (PLMs)
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …

Large language models for generative information extraction: A survey

D Xu, W Chen, W Peng, C Zhang, T Xu, X Zhao… - Frontiers of Computer …, 2024 - Springer
Abstract Information Extraction (IE) aims to extract structural knowledge from plain natural
language texts. Recently, generative Large Language Models (LLMs) have demonstrated …

Text classification via large language models

X Sun, X Li, J Li, F Wu, S Guo, T Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Despite the remarkable success of large-scale Language Models (LLMs) such as GPT-3,
their performances still significantly underperform fine-tuned models in the task of text …

Tcra-llm: Token compression retrieval augmented large language model for inference cost reduction

J Liu, L Li, T **ang, B Wang, Y Qian - arxiv preprint arxiv:2310.15556, 2023 - arxiv.org
Since ChatGPT released its API for public use, the number of applications built on top of
commercial large language models (LLMs) increase exponentially. One popular usage of …

Having beer after prayer? measuring cultural bias in large language models

T Naous, MJ Ryan, A Ritter, W Xu - arxiv preprint arxiv:2305.14456, 2023 - arxiv.org
As the reach of large language models (LMs) expands globally, their ability to cater to
diverse cultural contexts becomes crucial. Despite advancements in multilingual …

Llmaaa: Making large language models as active annotators

R Zhang, Y Li, Y Ma, M Zhou, L Zou - arxiv preprint arxiv:2310.19596, 2023 - arxiv.org
Prevalent supervised learning methods in natural language processing (NLP) are
notoriously data-hungry, which demand large amounts of high-quality annotated data. In …

Retrieval-style in-context learning for few-shot hierarchical text classification

H Chen, Y Zhao, Z Chen, M Wang, L Li… - Transactions of the …, 2024 - direct.mit.edu
Hierarchical text classification (HTC) is an important task with broad applications, and few-
shot HTC has gained increasing interest recently. While in-context learning (ICL) with large …

A simple llm framework for long-range video question-answering

C Zhang, T Lu, MM Islam, Z Wang, S Yu… - arxiv preprint arxiv …, 2023 - arxiv.org
We present LLoVi, a language-based framework for long-range video question-answering
(LVQA). Unlike prior long-range video understanding methods, which are often costly and …

Empirical study of zero-shot ner with chatgpt

T **e, Q Li, J Zhang, Y Zhang, Z Liu, H Wang - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) exhibited powerful capability in various natural language
processing tasks. This work focuses on exploring LLM performance on zero-shot information …

Beyond factuality: A comprehensive evaluation of large language models as knowledge generators

L Chen, Y Deng, Y Bian, Z Qin, B Wu, TS Chua… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) outperform information retrieval techniques for downstream
knowledge-intensive tasks when being prompted to generate world knowledge. However …