Object detection in traffic videos: A survey

H Ghahremannezhad, H Shi… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Traffic video analytics has become one of the core components in the evolution of
transportation systems. Artificially intelligent traffic management systems apply computer …

Advances of machine learning in materials science: Ideas and techniques

SS Chong, YS Ng, HQ Wang, JC Zheng - Frontiers of Physics, 2024 - Springer
In this big data era, the use of large dataset in conjunction with machine learning (ML) has
been increasingly popular in both industry and academia. In recent times, the field of …

Aiatrack: Attention in attention for transformer visual tracking

S Gao, C Zhou, C Ma, X Wang, J Yuan - European conference on …, 2022 - Springer
Transformer trackers have achieved impressive advancements recently, where the attention
mechanism plays an important role. However, the independent correlation computation in …

ISTVT: interpretable spatial-temporal video transformer for deepfake detection

C Zhao, C Wang, G Hu, H Chen, C Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the rapid development of Deepfake synthesis technology, our information security and
personal privacy have been severely threatened in recent years. To achieve a robust …

TransVOD: End-to-end video object detection with spatial-temporal transformers

Q Zhou, X Li, L He, Y Yang, G Cheng… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …

Tubedetr: Spatio-temporal video grounding with transformers

A Yang, A Miech, J Sivic, I Laptev… - Proceedings of the …, 2022 - openaccess.thecvf.com
We consider the problem of localizing a spatio-temporal tube in a video corresponding to a
given text query. This is a challenging task that requires the joint and efficient modeling of …

Ba-sam: Scalable bias-mode attention mask for segment anything model

Y Song, Q Zhou, X Li, DP Fan… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this paper we address the challenge of image resolution variation for the Segment
Anything Model (SAM). SAM known for its zero-shot generalizability exhibits a performance …

YOLOV: Making still image object detectors great at video object detection

Y Shi, N Wang, X Guo - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Video object detection (VID) is challenging because of the high variation of object
appearance as well as the diverse deterioration in some frames. On the positive side, the …

Read: Large-scale neural scene rendering for autonomous driving

Z Li, L Li, J Zhu - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
With the development of advanced driver assistance systems~(ADAS) and autonomous
vehicles, conducting experiments in various scenarios becomes an urgent need. Although …

Sodformer: Streaming object detection with transformer using events and frames

D Li, Y Tian, J Li - IEEE Transactions on Pattern Analysis and …, 2023 - ieeexplore.ieee.org
DAVIS camera, streaming two complementary sensing modalities of asynchronous events
and frames, has gradually been used to address major object detection challenges (eg, fast …