- Academic Search

R Suzuki, H Yanaka, M Yoshikawa… - arxiv preprint arxiv …, 2019 - arxiv.org

A large amount of research about multimodal inference across text and vision has been
recently developed to obtain visually grounded word and sentence representations. In this …

Save Cite Cited by 19 Related articles All 11 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] cam.ac.uk

Functional distributional semantics: Learning linguistically informed representations from a precisely annotated corpus

G Emerson - 2018 - repository.cam.ac.uk

The aim of distributional semantics is to design computational techniques that can
automatically learn the meanings of words from a body of text. The twin challenges are: how …

Save Cite Cited by 15 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aclanthology.org

Points, paths, and playscapes: Large-scale spatial language understanding tasks set in the real world

J Baldridge, T Bedrax-Weiss, D Luong… - Proceedings of the …, 2018 - aclanthology.org

Spatial language understanding is important for practical applications and as a building
block for better abstract language understanding. Much progress has been made through …

Save Cite Cited by 11 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] brighton.ac.uk

Learning to generate descriptions of visual data anchored in spatial relations

A Muscat, A Belz - IEEE Computational Intelligence Magazine, 2017 - ieeexplore.ieee.org

The explosive growth of visual data both online and offline in private and public repositories
has led to urgent requirements for better ways to index, search, retrieve, process and …

Save Cite Cited by 11 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] aclanthology.org

Transfer of isospace into a 3d environment for annotations and applications

A Henlein, G Abrami, A Kett… - … of the 16th Joint ACL-ISO …, 2020 - aclanthology.org

People's visual perception is very pronounced and therefore it is usually no problem for
them to describe the space around them in words. Conversely, people also have no …

Save Cite Cited by 6 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Natural language semantics with pictures: Some language & vision datasets and potential uses for computational semantics

D Schlangen - arxiv preprint arxiv:1904.07318, 2019 - arxiv.org

Propelling, and propelled by, the" deep learning revolution", recent years have seen the
introduction of ever larger corpora of images annotated with natural language expressions …

Save Cite Cited by 6 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ieee.org

Natural language generation with computational intelligence [guest editorial]

JM Alonso, A Bugarin, E Reiter - IEEE Computational …, 2017 - ieeexplore.ieee.org

The articles in this special section focus on using natural language generation techniques
(NLG) and natural language processing (NLP) to build computational systems that generate …

Save Cite Cited by 8 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] sagepub.com

The clarity and correctness of visualized thrust actions: a description and insights from users and experts

I Van der sluis, G Matoušková… - Visual …, 2022 - journals.sagepub.com

This article presents three studies that evaluate the effectiveness of instructional pictures that
visualize Heimlich maneuver thrusts. Firstly, a corpus study is used to describe a collection …

Save Cite Cited by 1 Related articles

[Free GPT-4]

[PDF] aclanthology.org

Visual-Textual Entailment with Quantities Using Model Checking and Knowledge Injection

N Iokawa, H Yanaka - Proceedings of the 2024 Joint International …, 2024 - aclanthology.org

In recent years, there has been great interest in multimodal inference. We concentrate on
visual-textual entailment (VTE), a critical task in multimodal inference. VTE is the task of …

Save Cite Related articles View as HTML

[Free GPT-4]

[PDF] aclanthology.org

What did this castle look like before? exploring referential relations in naturally occurring multimodal texts

R Utescher, S Zarrieß - Proceedings of the Third Workshop on …, 2021 - aclanthology.org

Multi-modal texts are abundant and diverse in structure, yet Language & Vision research of
these naturally occurring texts has mostly focused on genres that are comparatively light on …

Save Cite Cited by 2 Related articles All 2 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Combining lexical and spatial knowledge to predict spatial relations between objects in images

Multimodal logical inference system for visual-textual entailment

Functional distributional semantics: Learning linguistically informed representations from a precisely annotated corpus

Points, paths, and playscapes: Large-scale spatial language understanding tasks set in the real world

Learning to generate descriptions of visual data anchored in spatial relations

Transfer of isospace into a 3d environment for annotations and applications

Natural language semantics with pictures: Some language & vision datasets and potential uses for computational semantics

Natural language generation with computational intelligence [guest editorial]

The clarity and correctness of visualized thrust actions: a description and insights from users and experts

Visual-Textual Entailment with Quantities Using Model Checking and Knowledge Injection

What did this castle look like before? exploring referential relations in naturally occurring multimodal texts