Physics-informed computer vision: A review and perspectives

C Banerjee, K Nguyen, C Fookes, K George - ACM Computing Surveys, 2024 - dl.acm.org
The incorporation of physical information in machine learning frameworks is opening and
transforming many application domains. Here the learning process is augmented through …

Neuro-symbolic artificial intelligence: a survey

BP Bhuyan, A Ramdane-Cherif, R Tomar… - Neural Computing and …, 2024 - Springer
The goal of the growing discipline of neuro-symbolic artificial intelligence (AI) is to develop
AI systems with more human-like reasoning capabilities by combining symbolic reasoning …

Conditional object-centric learning from video

T Kipf, GF Elsayed, A Mahendran, A Stone… - arxiv preprint arxiv …, 2021 - arxiv.org
Object-centric representations are a promising path toward more systematic generalization
by providing flexible abstractions upon which compositional world models can be built …

Star: A benchmark for situated reasoning in real-world videos

B Wu, S Yu, Z Chen, JB Tenenbaum, C Gan - arxiv preprint arxiv …, 2024 - arxiv.org
Reasoning in the real world is not divorced from situations. How to capture the present
knowledge from surrounding situations and perform reasoning accordingly is crucial and …

Slotformer: Unsupervised visual dynamics simulation with object-centric models

Z Wu, N Dvornik, K Greff, T Kipf, A Garg - arxiv preprint arxiv:2210.05861, 2022 - arxiv.org
Understanding dynamics from visual observations is a challenging problem that requires
disentangling individual objects from the scene and learning their interactions. While recent …

A survey of reasoning with foundation models

J Sun, C Zheng, E **e, Z Liu, R Chu, J Qiu, J Xu… - arxiv preprint arxiv …, 2023 - arxiv.org
Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-
world settings such as negotiation, medical diagnosis, and criminal investigation. It serves …

Intentqa: Context-aware video intent reasoning

J Li, P Wei, W Han, L Fan - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
In this paper, we propose a novel task IntentQA, a special VideoQA task focusing on video
intent reasoning, which has become increasingly important for AI with its advantages in …

Video question answering: Datasets, algorithms and challenges

Y Zhong, J **ao, W Ji, Y Li, W Deng… - arxiv preprint arxiv …, 2022 - arxiv.org
Video Question Answering (VideoQA) aims to answer natural language questions according
to the given videos. It has earned increasing attention with recent research trends in joint …

A framework for the general design and computation of hybrid neural networks

R Zhao, Z Yang, H Zheng, Y Wu, F Liu, Z Wu… - Nature …, 2022 - nature.com
There is a growing trend to design hybrid neural networks (HNNs) by combining spiking
neural networks and artificial neural networks to leverage the strengths of both. Here, we …

Coupling large language models with logic programming for robust and general reasoning from text

Z Yang, A Ishay, J Lee - arxiv preprint arxiv:2307.07696, 2023 - arxiv.org
While large language models (LLMs), such as GPT-3, appear to be robust and general, their
reasoning ability is not at a level to compete with the best models trained for specific natural …