Aiatrack: Attention in attention for transformer visual tracking

S Gao, C Zhou, C Ma, X Wang, J Yuan - European Conference on …, 2022 - Springer
Transformer trackers have achieved impressive advancements recently, where the attention
mechanism plays an important role. However, the independent correlation computation in …

Advances of machine learning in materials science: Ideas and techniques

SS Chong, YS Ng, HQ Wang, JC Zheng - Frontiers of Physics, 2024 - Springer
In this big data era, the use of large dataset in conjunction with machine learning (ML) has
been increasingly popular in both industry and academia. In recent times, the field of …

Tubedetr: Spatio-temporal video grounding with transformers

A Yang, A Miech, J Sivic, I Laptev… - Proceedings of the …, 2022 - openaccess.thecvf.com
We consider the problem of localizing a spatio-temporal tube in a video corresponding to a
given text query. This is a challenging task that requires the joint and efficient modeling of …

TransVOD: end-to-end video object detection with spatial-temporal transformers

Q Zhou, X Li, L He, Y Yang, G Cheng… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …

ISTVT: interpretable spatial-temporal video transformer for deepfake detection

C Zhao, C Wang, G Hu, H Chen, C Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the rapid development of Deepfake synthesis technology, our information security and
personal privacy have been severely threatened in recent years. To achieve a robust …

Object detection in traffic videos: A survey

H Ghahremannezhad, H Shi… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Traffic video analytics has become one of the core components in the evolution of
transportation systems. Artificially intelligent traffic management systems apply computer …

Sodformer: Streaming object detection with transformer using events and frames

D Li, Y Tian, J Li - IEEE Transactions on Pattern Analysis and …, 2023 - ieeexplore.ieee.org
DAVIS camera, streaming two complementary sensing modalities of asynchronous events
and frames, has gradually been used to address major object detection challenges (eg, fast …

Read: Large-scale neural scene rendering for autonomous driving

Z Li, L Li, J Zhu - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
With the development of advanced driver assistance systems~(ADAS) and autonomous
vehicles, conducting experiments in various scenarios becomes an urgent need. Although …

Yolov: Making still image object detectors great at video object detection

Y Shi, N Wang, X Guo - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Video object detection (VID) is challenging because of the high variation of object
appearance as well as the diverse deterioration in some frames. On the positive side, the …

Learning complementary spatial–temporal transformer for video salient object detection

N Liu, K Nan, W Zhao, X Yao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Besides combining appearance and motion information, another crucial factor for video
salient object detection (VSOD) is to mine spatial–temporal (ST) knowledge, including …