- Academic Search

BC Das, MH Amini, Y Wu - ACM Computing Surveys, 2025 - dl.acm.org

Large language models (LLMs) have demonstrated extraordinary capabilities and
contributed to multiple fields, such as generating and summarizing text, language …

Save Cite Cited by 98 Related articles All 3 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

Y Yao, J Duan, K Xu, Y Cai, Z Sun, Y Zhang - High-Confidence Computing, 2024 - Elsevier

Abstract Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized
natural language understanding and generation. They possess deep language …

Save Cite Cited by 552 Related articles All 11 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Large language models are not fair evaluators

P Wang, L Li, L Chen, Z Cai, D Zhu, B Lin… - arxiv preprint arxiv …, 2023 - arxiv.org

In this paper, we uncover a systematic bias in the evaluation paradigm of adopting large
language models~(LLMs), eg, GPT-4, as a referee to score and compare the quality of …

Save Cite Cited by 379 Related articles All 2 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org

This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Save Cite Cited by 124 Related articles All 3 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, RGH Cheng… - arxiv preprint arxiv …, 2023 - arxiv.org

Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

Save Cite Cited by 285 Related articles All 3 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Frontier AI regulation: Managing emerging risks to public safety

M Anderljung, J Barnhart, A Korinek, J Leung… - arxiv preprint arxiv …, 2023 - arxiv.org

Advanced AI models hold the promise of tremendous benefits for humanity, but society
needs to proactively manage the accompanying risks. In this paper, we focus on what we …

Save Cite Cited by 123 Related articles All 5 versions Free GPT-4 DeepSeek Library Search View as HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Impact of AI assistance on student agency

A Darvishi, H Khosravi, S Sadiq, D Gašević… - Computers & …, 2024 - Elsevier

AI-powered learning technologies are increasingly being used to automate and scaffold
learning activities (eg, personalised reminders for completing tasks, automated real-time …

Save Cite Cited by 154 Related articles All 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] pubpub.org

[PDF][PDF] Ai transparency in the age of llms: A human-centered research roadmap

QV Liao, JW Vaughan - arxiv preprint arxiv:2306.01941, 2023 - assets.pubpub.org

The rise of powerful large language models (LLMs) brings about tremendous opportunities
for innovation but also looming risks for individuals and society at large. We have reached a …

Save Cite Cited by 139 Related articles All 6 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Who validates the validators? aligning llm-assisted evaluation of llm outputs with human preferences

S Shankar, JD Zamfirescu-Pereira… - Proceedings of the 37th …, 2024 - dl.acm.org

Due to the cumbersome nature of human evaluation and limitations of code-based
evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in …

Save Cite Cited by 53 Related articles All 2 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On protecting the data privacy of large language models (llms): A survey

B Yan, K Li, M Xu, Y Dong, Y Zhang, Z Ren… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs) are complex artificial intelligence systems capable of
understanding, generating and translating human language. They learn language patterns …

Save Cite Cited by 76 Related articles All 4 versions Free GPT-4 DeepSeek View as HTML

Cite

Advanced search

Saved to My library

Security and privacy challenges of large language models: A survey

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

Large language models are not fair evaluators

Foundational challenges in assuring alignment and safety of large language models

Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment

Frontier AI regulation: Managing emerging risks to public safety

[HTML][HTML] Impact of AI assistance on student agency

[PDF][PDF] Ai transparency in the age of llms: A human-centered research roadmap

Who validates the validators? aligning llm-assisted evaluation of llm outputs with human preferences

On protecting the data privacy of large language models (llms): A survey