Fundamental limitations of alignment in large language models

Y Wolf, N Wies, O Avnery, Y Levine… - ar** language models that interact with humans is aligning
their behavior to be useful and unharmful for their human users. This is usually achieved by …

Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?

P Mayilvahanan, T Wiedemer, E Rusak… - arxiv preprint arxiv …, 2023 - arxiv.org
Foundation models like CLIP are trained on hundreds of millions of samples and effortlessly
generalize to new tasks and inputs. Out of the box, CLIP shows stellar zero-shot and few …

Bard, ChatGPT and 3DGPT: a scientometric analysis of generative AI tools and assessment of implications for mechanical engineering education

KB Mustapha, EH Yap, YA Abakr - Interactive Technology and Smart …, 2024 - emerald.com
Purpose Following the recent rise in generative artificial intelligence (GenAI) tools,
fundamental questions about their wider impacts have started to reverberate around various …

In search of forgotten domain generalization

P Mayilvahanan, RS Zimmermann, T Wiedemer… - arxiv preprint arxiv …, 2024 - arxiv.org
Out-of-Domain (OOD) generalization is the ability of a model trained on one or more
domains to generalize to unseen domains. In the ImageNet era of computer vision …

Robust ai-generated text detection by restricted embeddings

K Kuznetsov, E Tulchinskii, L Kushnareva… - arxiv preprint arxiv …, 2024 - arxiv.org
Growing amount and quality of AI-generated texts makes detecting such content more
difficult. In most real-world scenarios, the domain (style and topic) of generated data and the …

Unknown Claims: Generation of Fact-Checking Training Examples from Unstructured and Structured Data

JF Bussotti, L Ragazzi, G Frisoni, G Moro… - Proceedings of the …, 2024 - aclanthology.org
Computational fact-checking (FC) relies on supervised models to verify claims based on
given evidence, requiring a resource-intensive process to annotate large volumes of training …

Modeling real-time interactive conversations as timed diarized transcripts

G Tanzer, G Ahdritz, L Melas-Kyriazi - arxiv preprint arxiv:2405.13203, 2024 - arxiv.org
Chatbots built upon language models have exploded in popularity, but they have largely
been limited to synchronous, turn-by-turn dialogues. In this paper we present a simple yet …

The economic advantage of computer vision over human labor, and its market implications

MS Svanberg - 2023 - dspace.mit.edu
With the emergence of Artificial Intelligence (AI), our lives and economy are under-going a
profound transformation. While there are huge benefits to be realized by the technology, we …

Tradeoffs Between Alignment and Helpfulness in Language Models with Representation Engineering

Y Wolf, N Wies, D Shteyman, B Rothberg… - arxiv preprint arxiv …, 2024 - arxiv.org
Language model alignment has become an important component of AI safety, allowing safe
interactions between humans and language models, by enhancing desired behaviors and …

Aligning Diffusion-Based Text-to-Image Models using Reinforcement Learning from Human Feedback

M Kristiansen, MM Vågen - 2023 - ntnuopen.ntnu.no
Denne avhandlingen utforsker krysningen mellom dype generative modeller og
forsterkningslæring, med søkelys på tilpasningen av diffusjonsbaserte tekst-til-bilde …