Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

Large language models for generative information extraction: A survey

D Xu, W Chen, W Peng, C Zhang, T Xu, X Zhao… - Frontiers of Computer …, 2024 - Springer
Abstract Information Extraction (IE) aims to extract structural knowledge from plain natural
language texts. Recently, generative Large Language Models (LLMs) have demonstrated …

Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

Glm-130b: An open bilingual pre-trained model

A Zeng, X Liu, Z Du, Z Wang, H Lai, M Ding… - arxiv preprint arxiv …, 2022 - arxiv.org
We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model
with 130 billion parameters. It is an attempt to open-source a 100B-scale model at least as …

Gpt-re: In-context learning for relation extraction using large language models

Z Wan, F Cheng, Z Mao, Q Liu, H Song, J Li… - arxiv preprint arxiv …, 2023 - arxiv.org
In spite of the potential for ground-breaking achievements offered by large language models
(LLMs)(eg, GPT-3), they still lag significantly behind fully-supervised baselines (eg, fine …

Promptner: Prompting for named entity recognition

D Ashok, ZC Lipton - arxiv preprint arxiv:2305.15444, 2023 - arxiv.org
In a surprising turn, Large Language Models (LLMs) together with a growing arsenal of
prompt-based heuristics now offer powerful off-the-shelf approaches providing few-shot …

Generative knowledge graph construction: A review

H Ye, N Zhang, H Chen, H Chen - arxiv preprint arxiv:2210.12714, 2022 - arxiv.org
Generative Knowledge Graph Construction (KGC) refers to those methods that leverage the
sequence-to-sequence framework for building knowledge graphs, which is flexible and can …

Universal information extraction as unified semantic matching

J Lou, Y Lu, D Dai, W Jia, H Lin, X Han… - Proceedings of the AAAI …, 2023 - ojs.aaai.org
The challenge of information extraction (IE) lies in the diversity of label schemas and the
heterogeneity of structures. Traditional methods require task-specific model design and rely …

Revisiting large language models as zero-shot relation extractors

G Li, P Wang, W Ke - arxiv preprint arxiv:2310.05028, 2023 - arxiv.org
Relation extraction (RE) consistently involves a certain degree of labeled or unlabeled data
even if under zero-shot setting. Recent studies have shown that large language models …

Knowledge graphs meet multi-modal learning: A comprehensive survey

Z Chen, Y Zhang, Y Fang, Y Geng, L Guo… - arxiv preprint arxiv …, 2024 - arxiv.org
Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …