Human action recognition: A taxonomy-based survey, updates, and opportunities
Human action recognition systems use data collected from a wide range of sensors to
accurately identify and interpret human actions. One of the most challenging issues for …
accurately identify and interpret human actions. One of the most challenging issues for …
A survey of human action recognition and posture prediction
Human action recognition and posture prediction aim to recognize and predict respectively
the action and postures of persons in videos. They are both active research topics in …
the action and postures of persons in videos. They are both active research topics in …
Skilful nowcasting of extreme precipitation with NowcastNet
Extreme precipitation is a considerable contributor to meteorological disasters and there is a
great need to mitigate its socioeconomic effects through skilful nowcasting that has high …
great need to mitigate its socioeconomic effects through skilful nowcasting that has high …
Simvp: Simpler yet better video prediction
Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …
Earthformer: Exploring space-time transformers for earth system forecasting
Conventionally, Earth system (eg, weather and climate) forecasting relies on numerical
simulation with complex physical models and hence is both expensive in computation and …
simulation with complex physical models and hence is both expensive in computation and …
Temporal attention unit: Towards efficient spatiotemporal predictive learning
Spatiotemporal predictive learning aims to generate future frames by learning from historical
frames. In this paper, we investigate existing methods and present a general framework of …
frames. In this paper, we investigate existing methods and present a general framework of …
Openstl: A comprehensive benchmark of spatio-temporal predictive learning
Spatio-temporal predictive learning is a learning paradigm that enables models to learn
spatial and temporal patterns by predicting future frames from given past frames in an …
spatial and temporal patterns by predicting future frames from given past frames in an …
Extdm: Distribution extrapolation diffusion model for video prediction
Video prediction is a challenging task due to its nature of uncertainty especially for
forecasting a long period. To model the temporal dynamics advanced methods benefit from …
forecasting a long period. To model the temporal dynamics advanced methods benefit from …
Simvp: Towards simple yet powerful spatiotemporal predictive learning
Recent years have witnessed remarkable advances in spatiotemporal predictive learning,
incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training …
incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training …
Mmvp: Motion-matrix-based video prediction
A central challenge of video prediction lies where the system has to reason the object's
future motion from image frames while simultaneously maintaining the consistency of its …
future motion from image frames while simultaneously maintaining the consistency of its …