Grounded language learning: Where robotics and nlp meet (invited talk)

C Matuszek - Proceedings of the International Joint Conference on …, 2018 - par.nsf.gov
Grounded language acquisition is concerned with learning the meaning of language as it
applies to the physical world. As robots become more capable and ubiquitous, there is an …

Meetup! a corpus of joint activity dialogues in a visual environment

N Ilinykh, S Zarrieß, D Schlangen - arxiv preprint arxiv:1907.05084, 2019 - arxiv.org
Building computer systems that can converse about their visual environment is one of the
oldest concerns of research in Artificial Intelligence and Computational Linguistics (see, for …

What are the goals of distributional semantics?

G Emerson - arxiv preprint arxiv:2005.02982, 2020 - arxiv.org
Distributional semantic models have become a mainstay in NLP, providing useful features
for downstream tasks. However, assessing long-term progress requires explicit long-term …

Gated multi-task network for text classification

L **ao, H Zhang, W Chen - … of the 2018 Conference of the North …, 2018 - aclanthology.org
Multi-task learning with Convolutional Neural Network (CNN) has shown great success in
many Natural Language Processing (NLP) tasks. This success can be largely attributed to …

Enriching language models with visually-grounded word vectors and the Lancaster sensorimotor norms

C Kennington - Proceedings of the 25th conference on …, 2021 - aclanthology.org
Abstract Language models are trained only on text despite the fact that humans learn their
first language in a highly interactive and multimodal environment where the first set of …

Resolving References in Visually-Grounded Dialogue via Text Generation

B Willemsen, L Qian, G Skantze - arxiv preprint arxiv:2309.13430, 2023 - arxiv.org
Vision-language models (VLMs) have shown to be effective at image retrieval based on
simple text queries, but text-image retrieval based on conversational input remains a …

Crossmodal Language Comprehension—Psycholinguistic Insights and Computational Approaches

Ö Alaçam, X Li, W Menzel, T Staron - Frontiers in neurorobotics, 2020 - frontiersin.org
Crossmodal interaction in situated language comprehension is important for effective and
efficient communication. The relationship between linguistic and visual stimuli provides …

Decoding strategies for neural referring expression generation

S Zarrieß, D Schlangen - … of the 11th International Conference on …, 2018 - aclanthology.org
RNN-based sequence generation is now widely used in NLP and NLG (natural language
generation). Most work focusses on how to train RNNs, even though also decoding is not …

Affordance-based robot object retrieval

T Nguyen, N Gopalan, R Patel, M Corsaro, E Pavlick… - Autonomous …, 2022 - Springer
Natural language object retrieval is a highly useful yet challenging task for robots in human-
centric environments. Previous work has primarily focused on commands specifying the …

Grounding as a side‐effect of grounding

S Larsson - Topics in cognitive science, 2018 - Wiley Online Library
In relation to semantics,“grounding” has (at least) two relevant meanings.“Symbol
grounding” is the process of connecting symbols (eg, words) to perception and the …