- Academic Search

C Barrett, B Boyd, E Bursztein, N Carlini… - … and Trends® in …, 2023 - nowpublishers.com

Every major technical invention resurfaces the dual-use dilemma—the new technology has
the potential to be used for good as well as for harm. Generative AI (GenAI) techniques, such …

Save Cite Cited by 86 Related articles All 7 versions Free GPT-4 Library Search Cached

[Free GPT-4]

[PDF] arxiv.org

Effective long-context scaling of foundation models

W ** Large
Multimodal Models (LMMs). The framework comprises meticulously curated datasets, a …

Save Cite Cited by 40 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Longbench: A bilingual, multitask benchmark for long context understanding

Y Bai, X Lv, J Zhang, H Lyu, J Tang, Z Huang… - arxiv preprint arxiv …, 2023 - arxiv.org

Although large language models (LLMs) demonstrate impressive performance for many
language tasks, most of them can only handle texts a few thousand tokens long, limiting their …

Save Cite Cited by 78 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Embrace divergence for richer insights: A multi-document summarization benchmark and a case study on summarizing diverse information from news articles

KH Huang, P Laban, AR Fabbri, PK Choubey… - arxiv preprint arxiv …, 2023 - arxiv.org

Previous research in multi-document news summarization has typically concentrated on
collating information that all sources agree upon. However, to our knowledge, the …

Save Cite Cited by 24 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

L-eval: Instituting standardized evaluation for long context language models

C An, S Gong, M Zhong, X Zhao, M Li, J Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org

Recently, there has been growing interest in extending the context length of large language
models (LLMs), aiming to effectively process long inputs of one turn or conversations with …

Save Cite Cited by 23 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aclanthology.org

Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training

J He, K Pan, X Dong, Z Song, LYB LiuYiBo… - Proceedings of the …, 2024 - aclanthology.org

While large language models (LLMs) are equipped with longer text input capabilities than
before, they are struggling to seek correct information in long contexts. The “lost in the …

Save Cite Cited by 5 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aaai.org

A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators

C Zhang, LF D'Haro, Y Chen, M Zhang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Automatic evaluation is an integral aspect of dialogue system research. The traditional
reference-based NLG metrics are generally found to be unsuitable for dialogue assessment …

Save Cite Cited by 16 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Never lost in the middle: Improving large language models via attention strengthening question answering

H Junqing, P Kunhao, D **aoqun, S Zhuoyang… - arxiv preprint arxiv …, 2023 - arxiv.org

While large language models (LLMs) are equipped with longer text input capabilities than
before, they are struggling to seek correct information in long contexts. The" lost in the …

Save Cite Cited by 14 Related articles View as HTML

Create alert

Cite

Advanced search

Saved to My library

Long sequence modeling with xgen: A 7b llm trained on 8k input sequence length

Identifying and mitigating the security risks of generative ai

Effective long-context scaling of foundation models

Longbench: A bilingual, multitask benchmark for long context understanding

Embrace divergence for richer insights: A multi-document summarization benchmark and a case study on summarizing diverse information from news articles

L-eval: Instituting standardized evaluation for long context language models

Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training

A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators

Never lost in the middle: Improving large language models via attention strengthening question answering