[HTML][HTML] A survey of GPT-3 family large language models including ChatGPT and GPT-4
KS Kalyan - Natural Language Processing Journal, 2024 - Elsevier
Large language models (LLMs) are a special class of pretrained language models (PLMs)
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …
Large language models for generative information extraction: A survey
Abstract Information Extraction (IE) aims to extract structural knowledge from plain natural
language texts. Recently, generative Large Language Models (LLMs) have demonstrated …
language texts. Recently, generative Large Language Models (LLMs) have demonstrated …
Text classification via large language models
Despite the remarkable success of large-scale Language Models (LLMs) such as GPT-3,
their performances still significantly underperform fine-tuned models in the task of text …
their performances still significantly underperform fine-tuned models in the task of text …
Tcra-llm: Token compression retrieval augmented large language model for inference cost reduction
Since ChatGPT released its API for public use, the number of applications built on top of
commercial large language models (LLMs) increase exponentially. One popular usage of …
commercial large language models (LLMs) increase exponentially. One popular usage of …
Having beer after prayer? measuring cultural bias in large language models
As the reach of large language models (LMs) expands globally, their ability to cater to
diverse cultural contexts becomes crucial. Despite advancements in multilingual …
diverse cultural contexts becomes crucial. Despite advancements in multilingual …
Llmaaa: Making large language models as active annotators
Prevalent supervised learning methods in natural language processing (NLP) are
notoriously data-hungry, which demand large amounts of high-quality annotated data. In …
notoriously data-hungry, which demand large amounts of high-quality annotated data. In …
Retrieval-style in-context learning for few-shot hierarchical text classification
Hierarchical text classification (HTC) is an important task with broad applications, and few-
shot HTC has gained increasing interest recently. While in-context learning (ICL) with large …
shot HTC has gained increasing interest recently. While in-context learning (ICL) with large …
A simple llm framework for long-range video question-answering
We present LLoVi, a language-based framework for long-range video question-answering
(LVQA). Unlike prior long-range video understanding methods, which are often costly and …
(LVQA). Unlike prior long-range video understanding methods, which are often costly and …
Empirical study of zero-shot ner with chatgpt
Large language models (LLMs) exhibited powerful capability in various natural language
processing tasks. This work focuses on exploring LLM performance on zero-shot information …
processing tasks. This work focuses on exploring LLM performance on zero-shot information …
Beyond factuality: A comprehensive evaluation of large language models as knowledge generators
Large language models (LLMs) outperform information retrieval techniques for downstream
knowledge-intensive tasks when being prompted to generate world knowledge. However …
knowledge-intensive tasks when being prompted to generate world knowledge. However …