A review of convolutional neural networks in computer vision

X Zhao, L Wang, Y Zhang, X Han, M Deveci… - Artificial Intelligence …, 2024 - Springer
In computer vision, a series of exemplary advances have been made in several areas
involving image classification, semantic segmentation, object detection, and image super …

A review of multimodal human activity recognition with special emphasis on classification, applications, challenges and future directions

SK Yadav, K Tiwari, HM Pandey, SA Akbar - Knowledge-Based Systems, 2021 - Elsevier
Human activity recognition (HAR) is one of the most important and challenging problems in
the computer vision. It has critical application in wide variety of tasks including gaming …

Simvp: Simpler yet better video prediction

Z Gao, C Tan, L Wu, SZ Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …

Earthformer: Exploring space-time transformers for earth system forecasting

Z Gao, X Shi, H Wang, Y Zhu… - Advances in …, 2022 - proceedings.neurips.cc
Conventionally, Earth system (eg, weather and climate) forecasting relies on numerical
simulation with complex physical models and hence is both expensive in computation and …

Predrnn: A recurrent neural network for spatiotemporal predictive learning

Y Wang, H Wu, J Zhang, Z Gao, J Wang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
The predictive learning of spatiotemporal sequences aims to generate future images by
learning from the historical context, where the visual dynamics are believed to have modular …

Openstl: A comprehensive benchmark of spatio-temporal predictive learning

C Tan, S Li, Z Gao, W Guan, Z Wang… - Advances in …, 2023 - proceedings.neurips.cc
Spatio-temporal predictive learning is a learning paradigm that enables models to learn
spatial and temporal patterns by predicting future frames from given past frames in an …

Temporal attention unit: Towards efficient spatiotemporal predictive learning

C Tan, Z Gao, L Wu, Y Xu, J **a… - Proceedings of the …, 2023 - openaccess.thecvf.com
Spatiotemporal predictive learning aims to generate future frames by learning from historical
frames. In this paper, we investigate existing methods and present a general framework of …

Swinlstm: Improving spatiotemporal prediction accuracy using swin transformer and lstm

S Tang, C Li, P Zhang, RN Tang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Integrating CNNs and RNNs to capture spatiotemporal dependencies is a prevalent
strategy for spatiotemporal prediction tasks. However, the property of CNNs to learn local …

Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer
In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

Vmrnn: Integrating vision mamba and lstm for efficient and accurate spatiotemporal forecasting

Y Tang, P Dong, Z Tang, X Chu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Combining Convolutional Neural Networks (CNNs) or Vision Transformers (ViTs)
with Recurrent Neural Networks (RNNs) for spatiotemporal forecasting has yielded …