Capsule networks–a survey

M Kwabena Patrick, A Felix Adekoya… - Journal of King Saud …, 2019 - Elsevier
Modern day computer vision tasks requires efficient solution to problems such as image
recognition, natural language processing, object detection, object segmentation and …

Capsule networks for image classification: A review

SJ Pawan, J Rajan - Neurocomputing, 2022 - Elsevier
Over the past few years, the computer vision domain has evolved and made a revolutionary
transition from human-engineered features to automated features to address challenging …

Video action transformer network

R Girdhar, J Carreira, C Doersch… - Proceedings of the …, 2019 - openaccess.thecvf.com
Abstract We introduce the Action Transformer model for recognizing and localizing human
actions in video clips. We repurpose a Transformer-style architecture to aggregate features …

Efficient-capsnet: Capsule network with self-attention routing

V Mazzia, F Salvetti, M Chiaberge - Scientific reports, 2021 - nature.com
Deep convolutional neural networks, assisted by architectural design strategies, make
extensive use of data augmentation techniques and layers with a high number of feature …

3D point capsule networks

Y Zhao, T Birdal, H Deng… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
In this paper, we propose 3D point-capsule networks, an auto-encoder designed to process
sparse 3D point clouds while preserving spatial arrangements of the input data. 3D capsule …

A survey on the new generation of deep learning in image processing

L Jiao, J Zhao - Ieee Access, 2019 - ieeexplore.ieee.org
During the past decade, deep learning is one of the essential breakthroughs made in
artificial intelligence. In particular, it has achieved great success in image processing …

Part-object relational visual saliency

Y Liu, D Zhang, Q Zhang, J Han - IEEE transactions on pattern …, 2021 - ieeexplore.ieee.org
Recent years have witnessed a big leap in automatic visual saliency detection attributed to
advances in deep learning, especially Convolutional Neural Networks (CNNs). However …

You only watch once: A unified cnn architecture for real-time spatiotemporal action localization

O Köpüklü, X Wei, G Rigoll - arxiv preprint arxiv:1911.06644, 2019 - arxiv.org
Spatiotemporal action localization requires the incorporation of two sources of information
into the designed architecture:(1) temporal information from the previous frames and (2) …

Cross-media structured common space for multimedia event extraction

M Li, A Zareian, Q Zeng, S Whitehead, D Lu… - arxiv preprint arxiv …, 2020 - arxiv.org
We introduce a new task, MultiMedia Event Extraction (M2E2), which aims to extract events
and their arguments from multimedia documents. We develop the first benchmark and …

Capsules for biomedical image segmentation

R LaLonde, Z Xu, I Irmakci, S Jain, U Bagci - Medical image analysis, 2021 - Elsevier
Our work expands the use of capsule networks to the task of object segmentation for the first
time in the literature. This is made possible via the introduction of locally-constrained routing …