- Academic Search

BC Das, MH Amini, Y Wu - ACM Computing Surveys, 2024 - dl.acm.org

Large language models (LLMs) have demonstrated extraordinary capabilities and
contributed to multiple fields, such as generating and summarizing text, language …

保存引用被引用次数：91 相关文章所有 3 个版本

[Free GPT-4]

[PDF] acm.org

Tool learning with foundation models

Y Qin, S Hu, Y Lin, W Chen, N Ding, G Cui… - ACM Computing …, 2024 - dl.acm.org

Humans possess an extraordinary ability to create and utilize tools. With the advent of
foundation models, artificial intelligence systems have the potential to be equally adept in …

保存引用被引用次数：254 相关文章所有 6 个版本

[Free GPT-4]

[PDF] qub.ac.uk

[PDF][PDF] DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.

B Wang, W Chen, H Pei, C **e, M Kang, C Zhang, C Xu… - NeurIPS, 2023 - blogs.qub.ac.uk

Abstract Generative Pre-trained Transformer (GPT) models have exhibited exciting progress
in their capabilities, capturing the interest of practitioners and the public alike. Yet, while the …

保存引用被引用次数：375 相关文章所有 8 个版本 HTML 版

[Free GPT-4]

[PDF] neurips.cc

Revisiting out-of-distribution robustness in nlp: Benchmarks, analysis, and LLMs evaluations

L Yuan, Y Chen, G Cui, H Gao, F Zou… - Advances in …, 2023 - proceedings.neurips.cc

This paper reexamines the research on out-of-distribution (OOD) robustness in the field of
NLP. We find that the distribution shift settings in previous studies commonly lack adequate …

保存引用被引用次数：77 相关文章所有 6 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Privacy in large language models: Attacks, defenses and future directions

H Li, Y Chen, J Luo, J Wang, H Peng, Y Kang… - arxiv preprint arxiv …, 2023 - arxiv.org

The advancement of large language models (LLMs) has significantly enhanced the ability to
effectively tackle various downstream NLP tasks and unify these tasks into generative …

保存引用被引用次数：53 相关文章所有 2 个版本 HTML 版

[Free GPT-4]

[PDF] aclanthology.org

Backdooring instruction-tuned large language models with virtual prompt injection

J Yan, V Yadav, S Li, L Chen, Z Tang… - Proceedings of the …, 2024 - aclanthology.org

Abstract Instruction-tuned Large Language Models (LLMs) have become a ubiquitous
platform for open-ended applications due to their ability to modulate responses based on …

保存引用被引用次数：72 相关文章所有 4 个版本 HTML 版

[Free GPT-4]

[PDF] aaai.org

Plmmark: a secure and robust black-box watermarking framework for pre-trained language models

P Li, P Cheng, F Li, W Du, H Zhao, G Liu - Proceedings of the AAAI …, 2023 - ojs.aaai.org

The huge training overhead, considerable commercial value, and various potential security
risks make it urgent to protect the intellectual property (IP) of Deep Neural Networks (DNNs) …

保存引用被引用次数：43 相关文章所有 2 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Attention-enhancing backdoor attacks against bert-based models

W Lyu, S Zheng, L Pang, H Ling, C Chen - arxiv preprint arxiv:2310.14480, 2023 - arxiv.org

Recent studies have revealed that\textit {Backdoor Attacks} can threaten the safety of natural
language processing (NLP) models. Investigating the strategies of backdoor attacks will help …

保存引用被引用次数：52 相关文章所有 4 个版本 HTML 版

[Free GPT-4]

[PDF] acm.org

Representation in AI evaluations

AS Bergman, LA Hendricks, M Rauh, B Wu… - Proceedings of the …, 2023 - dl.acm.org

Calls for representation in artificial intelligence (AI) and machine learning (ML) are
widespread, with" representation" or" representativeness" generally understood to be both …

保存引用被引用次数：20 相关文章

[Free GPT-4]

[PDF] neurips.cc

Setting the trap: Capturing and defeating backdoors in pretrained language models through honeypots

RR Tang, J Yuan, Y Li, Z Liu… - Advances in Neural …, 2023 - proceedings.neurips.cc

In the field of natural language processing, the prevalent approach involves fine-tuning
pretrained language models (PLMs) using local samples. Recent research has exposed the …

保存引用被引用次数：13 相关文章所有 5 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

A unified evaluation of textual backdoor learning: Frameworks and benchmarks

Security and privacy challenges of large language models: A survey

Tool learning with foundation models

[PDF][PDF] DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.

Revisiting out-of-distribution robustness in nlp: Benchmarks, analysis, and LLMs evaluations

Privacy in large language models: Attacks, defenses and future directions

Backdooring instruction-tuned large language models with virtual prompt injection

Plmmark: a secure and robust black-box watermarking framework for pre-trained language models

Attention-enhancing backdoor attacks against bert-based models

Representation in AI evaluations

Setting the trap: Capturing and defeating backdoors in pretrained language models through honeypots