A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Named entity recognition and relation extraction: State-of-the-art

Z Nasar, SW Jaffry, MK Malik - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
With the advent of Web 2.0, there exist many online platforms that result in massive textual-
data production. With ever-increasing textual data at hand, it is of immense importance to …

Unified named entity recognition as word-word relation classification

J Li, H Fei, J Liu, S Wu, M Zhang, C Teng… - proceedings of the AAAI …, 2022 - ojs.aaai.org
So far, named entity recognition (NER) has been involved with three major types, including
flat, overlapped (aka. nested), and discontinuous NER, which have mostly been studied …

Ernie 3.0: Large-scale knowledge enhanced pre-training for language understanding and generation

Y Sun, S Wang, S Feng, S Ding, C Pang… - arxiv preprint arxiv …, 2021 - arxiv.org
Pre-trained models have achieved state-of-the-art results in various Natural Language
Processing (NLP) tasks. Recent works such as T5 and GPT-3 have shown that scaling up …

FLAT: Chinese NER using flat-lattice transformer

X Li, H Yan, X Qiu, X Huang - arxiv preprint arxiv:2004.11795, 2020 - arxiv.org
Recently, the character-word lattice structure has been proved to be effective for Chinese
named entity recognition (NER) by incorporating the word information. However, since the …

Chinese NER using lattice LSTM

Y Zhang, J Yang - arxiv preprint arxiv:1805.02023, 2018 - arxiv.org
We investigate a lattice-structured LSTM model for Chinese NER, which encodes a
sequence of input characters as well as all potential words that match a lexicon. Compared …

Locate and label: A two-stage identifier for nested named entity recognition

Y Shen, X Ma, Z Tan, S Zhang, W Wang… - arxiv preprint arxiv …, 2021 - arxiv.org
Named entity recognition (NER) is a well-studied task in natural language processing.
Traditional NER research only deals with flat entities and ignores nested entities. The span …

Chinesebert: Chinese pretraining enhanced by glyph and pinyin information

Z Sun, X Li, X Sun, Y Meng, X Ao, Q He, F Wu… - arxiv preprint arxiv …, 2021 - arxiv.org
Recent pretraining models in Chinese neglect two important aspects specific to the Chinese
language: glyph and pinyin, which carry significant syntax and semantic information for …

TENER: adapting transformer encoder for named entity recognition

H Yan, B Deng, X Li, X Qiu - arxiv preprint arxiv:1911.04474, 2019 - arxiv.org
The Bidirectional long short-term memory networks (BiLSTM) have been widely used as an
encoder in models solving the named entity recognition (NER) task. Recently, the …

A general framework for information extraction using dynamic span graphs

Y Luan, D Wadden, L He, A Shah, M Ostendorf… - arxiv preprint arxiv …, 2019 - arxiv.org
We introduce a general framework for several information extraction tasks that share span
representations using dynamically constructed span graphs. The graphs are constructed by …