Академия Google

BC Das, MH Amini, Y Wu - ACM Computing Surveys, 2025 - dl.acm.org

Large language models (LLMs) have demonstrated extraordinary capabilities and
contributed to multiple fields, such as generating and summarizing text, language …

Сохранить Цитировать Цитируется: 97 Похожие статьи Все версии статьи (3)

[Free GPT-4]
[DeepSeek]

[PDF] wiley.com Full View

Combating misinformation in the age of llms: Opportunities and challenges

C Chen, K Shu - AI Magazine, 2024 - Wiley Online Library

Misinformation such as fake news and rumors is a serious threat for information ecosystems
and public trust. The emergence of large language models (LLMs) has great potential to …

Сохранить Цитировать Цитируется: 129 Похожие статьи Все версии статьи (4)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arxiv preprint arxiv …, 2023 - arxiv.org

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

Сохранить Цитировать Цитируется: 3629 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Trustllm: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs), exemplified by ChatGPT, have gained considerable
attention for their excellent natural language processing capabilities. Nonetheless, these …

Сохранить Цитировать Цитируется: 251 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Jailbreak and guard aligned language models with only few in-context demonstrations

Z Wei, Y Wang, A Li, Y Mo, Y Wang - arxiv preprint arxiv:2310.06387, 2023 - arxiv.org

Large Language Models (LLMs) have shown remarkable success in various tasks, yet their
safety and the risk of generating harmful content remain pressing concerns. In this paper, we …

Сохранить Цитировать Цитируется: 183 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[HTML] mlr.press

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu… - International …, 2024 - proceedings.mlr.press

Large language models (LLMs) have gained considerable attention for their excellent
natural language processing capabilities. Nonetheless, these LLMs present many …

Сохранить Цитировать Цитируется: 46 Похожие статьи Сохраненная копия

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Jailbreaking black box large language models in twenty queries

P Chao, A Robey, E Dobriban, H Hassani… - arxiv preprint arxiv …, 2023 - arxiv.org

There is growing interest in ensuring that large language models (LLMs) align with human
values. However, the alignment of such models is vulnerable to adversarial jailbreaks, which …

Сохранить Цитировать Цитируется: 448 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org

This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Сохранить Цитировать Цитируется: 124 Похожие статьи Все версии статьи (3) В виде HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] From cobit to iso 42001: Evaluating cybersecurity frameworks for opportunities, risks, and regulatory compliance in commercializing large language models

TR McIntosh, T Susnjak, T Liu, P Watters, D Xu… - Computers & …, 2024 - Elsevier

This study investigated the integration readiness of four predominant cybersecurity
Governance, Risk and Compliance (GRC) frameworks–NIST CSF 2.0, COBIT 2019, ISO …

Сохранить Цитировать Цитируется: 69 Похожие статьи Все версии статьи (3)

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

On the adversarial robustness of multi-modal foundation models

C Schlarmann, M Hein - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Multi-modal foundation models combining vision and language models such as Flamingo or
GPT-4 have recently gained enormous interest. Alignment of foundation models is used to …

Сохранить Цитировать Цитируется: 91 Похожие статьи Все версии статьи (5) В виде HTML

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Security and privacy challenges of large language models: A survey

Combating misinformation in the age of llms: Opportunities and challenges

A survey of large language models

Trustllm: Trustworthiness in large language models

Jailbreak and guard aligned language models with only few in-context demonstrations

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Jailbreaking black box large language models in twenty queries

Foundational challenges in assuring alignment and safety of large language models

[HTML][HTML] From cobit to iso 42001: Evaluating cybersecurity frameworks for opportunities, risks, and regulatory compliance in commercializing large language models

On the adversarial robustness of multi-modal foundation models