Securing large language models: Addressing bias, misinformation, and prompt attacks

B Peng, K Chen, M Li, P Feng, Z Bi, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) demonstrate impressive capabilities across various fields,
yet their increasing use raises critical security concerns. This article reviews recent literature …

The landscape of emerging ai agent architectures for reasoning, planning, and tool calling: A survey

T Masterman, S Besen, M Sawtell, A Chao - arxiv preprint arxiv …, 2024 - arxiv.org
This survey paper examines the recent advancements in AI agent implementations, with a
focus on their ability to achieve complex goals that require enhanced reasoning, planning …

Rethinking open source generative AI: open-washing and the EU AI Act

A Liesenfeld, M Dingemanse - … of the 2024 ACM Conference on …, 2024 - dl.acm.org
The past year has seen a steep rise in generative AI systems that claim to be open. But how
open are they really? The question of what counts as open source in generative AI is poised …

Multi-layer transformers gradient can be approximated in almost linear time

Y Liang, Z Sha, Z Shi, Z Song, Y Zhou - arxiv preprint arxiv:2408.13233, 2024 - arxiv.org
The computational complexity of the self-attention mechanism in popular transformer
architectures poses significant challenges for training and inference, and becomes the …

When search engine services meet large language models: visions and challenges

H **ong, J Bian, Y Li, X Li, M Du… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Combining Large Language Models (LLMs) with search engine services marks a significant
shift in the field of services computing, opening up new possibilities to enhance how we …

Shieldgemma: Generative ai content moderation based on gemma

W Zeng, Y Liu, R Mullins, L Peran, J Fernandez… - arxiv preprint arxiv …, 2024 - arxiv.org
We present ShieldGemma, a comprehensive suite of LLM-based safety content moderation
models built upon Gemma2. These models provide robust, state-of-the-art predictions of …

Learning from failure: Integrating negative examples when fine-tuning large language models as agents

R Wang, H Li, X Han, Y Zhang, T Baldwin - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have achieved success in acting as agents, which interact
with environments through tools such as search engines. However, LLMs are optimized for …

Position: Evolving AI collectives enhance human diversity and enable self-regulation

S Lai, Y Potter, J Kim, R Zhuang, D Song… - Forty-first International …, 2024 - openreview.net
Large language model behavior is shaped by the language of those with whom they
interact. This capacity and their increasing prevalence online portend that they will …

Towards accurate and efficient document analytics with large language models

Y Lin, M Hulsebos, R Ma, S Shankar… - arxiv preprint arxiv …, 2024 - arxiv.org
Unstructured data formats account for over 80% of the data currently stored, and extracting
value from such formats remains a considerable challenge. In particular, current approaches …

Alcm: Autonomous llm-augmented causal discovery framework

E Khatibi, M Abbasian, Z Yang, I Azimi… - arxiv preprint arxiv …, 2024 - arxiv.org
To perform effective causal inference in high-dimensional datasets, initiating the process
with causal discovery is imperative, wherein a causal graph is generated based on …