Aiatrack: Attention in attention for transformer visual tracking
Transformer trackers have achieved impressive advancements recently, where the attention
mechanism plays an important role. However, the independent correlation computation in …
mechanism plays an important role. However, the independent correlation computation in …
Advances of machine learning in materials science: Ideas and techniques
In this big data era, the use of large dataset in conjunction with machine learning (ML) has
been increasingly popular in both industry and academia. In recent times, the field of …
been increasingly popular in both industry and academia. In recent times, the field of …
Tubedetr: Spatio-temporal video grounding with transformers
We consider the problem of localizing a spatio-temporal tube in a video corresponding to a
given text query. This is a challenging task that requires the joint and efficient modeling of …
given text query. This is a challenging task that requires the joint and efficient modeling of …
TransVOD: end-to-end video object detection with spatial-temporal transformers
Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …
need for many hand-designed components in object detection while demonstrating good …
ISTVT: interpretable spatial-temporal video transformer for deepfake detection
With the rapid development of Deepfake synthesis technology, our information security and
personal privacy have been severely threatened in recent years. To achieve a robust …
personal privacy have been severely threatened in recent years. To achieve a robust …
Object detection in traffic videos: A survey
Traffic video analytics has become one of the core components in the evolution of
transportation systems. Artificially intelligent traffic management systems apply computer …
transportation systems. Artificially intelligent traffic management systems apply computer …
Sodformer: Streaming object detection with transformer using events and frames
DAVIS camera, streaming two complementary sensing modalities of asynchronous events
and frames, has gradually been used to address major object detection challenges (eg, fast …
and frames, has gradually been used to address major object detection challenges (eg, fast …
Read: Large-scale neural scene rendering for autonomous driving
With the development of advanced driver assistance systems~(ADAS) and autonomous
vehicles, conducting experiments in various scenarios becomes an urgent need. Although …
vehicles, conducting experiments in various scenarios becomes an urgent need. Although …
Yolov: Making still image object detectors great at video object detection
Video object detection (VID) is challenging because of the high variation of object
appearance as well as the diverse deterioration in some frames. On the positive side, the …
appearance as well as the diverse deterioration in some frames. On the positive side, the …
Learning complementary spatial–temporal transformer for video salient object detection
Besides combining appearance and motion information, another crucial factor for video
salient object detection (VSOD) is to mine spatial–temporal (ST) knowledge, including …
salient object detection (VSOD) is to mine spatial–temporal (ST) knowledge, including …