- Academic Search

S Wang, Y Zhu, H Liu, Z Zheng, C Chen, J Li - ACM Computing Surveys, 2024 - dl.acm.org

Large Language Models (LLMs) have recently transformed both the academic and industrial
landscapes due to their remarkable capacity to understand, analyze, and generate texts …

Save Cite Cited by 103 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A review on language models as knowledge bases

B AlKhamissi, M Li, A Celikyilmaz, M Diab… - arxiv preprint arxiv …, 2022 - arxiv.org

Recently, there has been a surge of interest in the NLP community on the use of pretrained
Language Models (LMs) as Knowledge Bases (KBs). Researchers have shown that LMs …

Save Cite Cited by 179 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Save Cite Cited by 4712 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Locating and editing factual associations in GPT

K Meng, D Bau, A Andonian… - Advances in Neural …, 2022 - proceedings.neurips.cc

We analyze the storage and recall of factual associations in autoregressive transformer
language models, finding evidence that these associations correspond to localized, directly …

Save Cite Cited by 1014 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Glm-130b: An open bilingual pre-trained model

A Zeng, X Liu, Z Du, Z Wang, H Lai, M Ding… - arxiv preprint arxiv …, 2022 - arxiv.org

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model
with 130 billion parameters. It is an attempt to open-source a 100B-scale model at least as …

Save Cite Cited by 602 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Editing large language models: Problems, methods, and opportunities

Y Yao, P Wang, B Tian, S Cheng, Z Li, S Deng… - arxiv preprint arxiv …, 2023 - arxiv.org

Despite the ability to train capable LLMs, the methodology for maintaining their relevancy
and rectifying errors remains elusive. To this end, the past few years have witnessed a surge …

Save Cite Cited by 246 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Does localization inform editing? surprising differences in causality-based localization vs. knowledge editing in language models

P Hase, M Bansal, B Kim… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Language models learn a great quantity of factual information during pretraining,
and recent work localizes this information to specific model weights like mid-layer MLP …

Save Cite Cited by 116 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Editing factual knowledge in language models

N De Cao, W Aziz, I Titov - arxiv preprint arxiv:2104.08164, 2021 - arxiv.org

The factual knowledge acquired during pre-training and stored in the parameters of
Language Models (LMs) can be useful in downstream tasks (eg, question answering or …

Save Cite Cited by 429 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Mquake: Assessing knowledge editing in language models via multi-hop questions

Z Zhong, Z Wu, CD Manning, C Potts… - arxiv preprint arxiv …, 2023 - arxiv.org

The information stored in large language models (LLMs) falls out of date quickly, and
retraining from scratch is often not an option. This has recently given rise to a range of …

Save Cite Cited by 144 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mit.edu

Time-aware language models as temporal knowledge bases

B Dhingra, JR Cole, JM Eisenschlos… - Transactions of the …, 2022 - direct.mit.edu

Many facts come with an expiration date, from the name of the President to the basketball
team Lebron James plays for. However, most language models (LMs) are trained on …

Save Cite Cited by 254 Related articles All 10 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Modifying memories in transformer models

Knowledge editing for large language models: A survey

A review on language models as knowledge bases

On the opportunities and risks of foundation models

Locating and editing factual associations in GPT

Glm-130b: An open bilingual pre-trained model

Editing large language models: Problems, methods, and opportunities

Does localization inform editing? surprising differences in causality-based localization vs. knowledge editing in language models

Editing factual knowledge in language models

Mquake: Assessing knowledge editing in language models via multi-hop questions

Time-aware language models as temporal knowledge bases