A literature survey of recent advances in chatbots

G Caldarini, S Jaf, K McGarry - Information, 2022 - mdpi.com
Chatbots are intelligent conversational computer systems designed to mimic human
conversation to enable automated online guidance and support. The increased benefits of …

Conversational agents: Goals, technologies, vision and challenges

M Allouch, A Azaria, R Azoulay - Sensors, 2021 - mdpi.com
In recent years, conversational agents (CAs) have become ubiquitous and are a presence in
our daily routines. It seems that the technology has finally ripened to advance the use of CAs …

Gpt-neox-20b: An open-source autoregressive language model

S Black, S Biderman, E Hallahan, Q Anthony… - arxiv preprint arxiv …, 2022 - arxiv.org
We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive language model
trained on the Pile, whose weights will be made freely and openly available to the public …

Call for Papers--The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

A Warstadt, L Choshen, A Mueller, A Williams… - arxiv preprint arxiv …, 2023 - arxiv.org
We present the call for papers for the BabyLM Challenge: Sample-efficient pretraining on a
developmentally plausible corpus. This shared task is intended for participants with an …

The bigscience roots corpus: A 1.6 tb composite multilingual dataset

H Laurençon, L Saulnier, T Wang… - Advances in …, 2022 - proceedings.neurips.cc
As language models grow ever larger, the need for large-scale high-quality text datasets has
never been more pressing, especially in multilingual settings. The BigScience workshop, a 1 …

An introduction to deep learning in natural language processing: Models, techniques, and tools

I Lauriola, A Lavelli, F Aiolli - Neurocomputing, 2022 - Elsevier
Abstract Natural Language Processing (NLP) is a branch of artificial intelligence that
involves the design and implementation of systems and algorithms able to interact through …

Branch-train-merge: Embarrassingly parallel training of expert language models

M Li, S Gururangan, T Dettmers, M Lewis… - arxiv preprint arxiv …, 2022 - arxiv.org
We present Branch-Train-Merge (BTM), a communication-efficient algorithm for
embarrassingly parallel training of large language models (LLMs). We show it is possible to …

mgpt: Few-shot learners go multilingual

O Shliazhko, A Fenogenova, M Tikhonova… - Transactions of the …, 2024 - direct.mit.edu
This paper introduces mGPT, a multilingual variant of GPT-3, pretrained on 61 languages
from 25 linguistically diverse language families using Wikipedia and the C4 Corpus. We …

Making monolingual sentence embeddings multilingual using knowledge distillation

N Reimers, I Gurevych - arxiv preprint arxiv:2004.09813, 2020 - arxiv.org
We present an easy and efficient method to extend existing sentence embedding models to
new languages. This allows to create multilingual versions from previously monolingual …

What artificial neural networks can tell us about human language acquisition

A Warstadt, SR Bowman - Algebraic structures in natural …, 2022 - taylorfrancis.com
Rapid progress in machine learning for natural language processing has the potential to
transform debates about how humans learn language. However, the learning environments …