Language model behavior: A comprehensive survey

TA Chang, BK Bergen - Computational Linguistics, 2024 - direct.mit.edu
Transformer language models have received widespread public attention, yet their
generated text is often surprising even to NLP researchers. In this survey, we discuss over …

Large language models meet text-centric multimodal sentiment analysis: A survey

H Yang, Y Zhao, Y Wu, S Wang, T Zheng… - arxiv preprint arxiv …, 2024 - arxiv.org
Compared to traditional sentiment analysis, which only considers text, multimodal sentiment
analysis needs to consider emotional signals from multimodal sources simultaneously and …

The llama 3 herd of models

A Dubey, A Jauhri, A Pandey, A Kadian… - arxiv preprint arxiv …, 2024 - arxiv.org
Modern artificial intelligence (AI) systems are powered by foundation models. This paper
presents a new set of foundation models, called Llama 3. It is a herd of language models …

Wizardcoder: Empowering code large language models with evol-instruct

Z Luo, C Xu, P Zhao, Q Sun, X Geng, W Hu… - arxiv preprint arxiv …, 2023 - arxiv.org
Code Large Language Models (Code LLMs), such as StarCoder, have demonstrated
exceptional performance in code-related tasks. However, most existing models are solely …

Prompting gpt-3 to be reliable

C Si, Z Gan, Z Yang, S Wang, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
Large language models (LLMs) show impressive abilities via few-shot prompting.
Commercialized APIs such as OpenAI GPT-3 further increase their use in real-world …

Is ChatGPT a good sentiment analyzer? A preliminary study

Z Wang, Q **e, Y Feng, Z Ding, Z Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Recently, ChatGPT has drawn great attention from both the research community and the
public. We are particularly interested in whether it can serve as a universal sentiment …

Self-evaluation guided beam search for reasoning

Y **e, K Kawaguchi, Y Zhao, JX Zhao… - Advances in …, 2023 - proceedings.neurips.cc
Breaking down a problem into intermediate steps has demonstrated impressive
performance in Large Language Model (LLM) reasoning. However, the growth of the …

Dreamllm: Synergistic multimodal comprehension and creation

R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper presents DreamLLM, a learning framework that first achieves versatile
Multimodal Large Language Models (MLLMs) empowered with frequently overlooked …

Long range language modeling via gated state spaces

H Mehta, A Gupta, A Cutkosky, B Neyshabur - arxiv preprint arxiv …, 2022 - arxiv.org
State space models have shown to be effective at modeling long range dependencies,
specially on sequence classification tasks. In this work we focus on autoregressive …

Satlm: Satisfiability-aided language models using declarative prompting

X Ye, Q Chen, I Dillig, G Durrett - Advances in Neural …, 2023 - proceedings.neurips.cc
Prior work has combined chain-of-thought prompting in large language models (LLMs) with
programmatic representations to perform effective and transparent reasoning. While such an …