Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on vision transformer
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …
network mainly based on the self-attention mechanism. Thanks to its strong representation …
End-to-end human pose and mesh reconstruction with transformers
We present a new method, called MEsh TRansfOrmer (METRO), to reconstruct 3D human
pose and mesh vertices from a single image. Our method uses a transformer encoder to …
pose and mesh vertices from a single image. Our method uses a transformer encoder to …
Transvg: End-to-end visual grounding with transformers
In this paper, we present a neat yet effective transformer-based framework for visual
grounding, namely TransVG, to address the task of grounding a language query to the …
grounding, namely TransVG, to address the task of grounding a language query to the …
HiFT: Hierarchical feature transformer for aerial tracking
Most existing Siamese-based tracking methods execute the classification and regression of
the target object based on the similarity maps. However, they either employ a single map …
the target object based on the similarity maps. However, they either employ a single map …
Survey on depth and RGB image-based 3D hand shape and pose estimation
The field of vision-based human hand three-dimensional (3D) shape and pose estimation
has attracted significant attention recently owing to its key role in various applications, such …
has attracted significant attention recently owing to its key role in various applications, such …
Transpose: Keypoint localization via transformer
While CNN-based models have made remarkable progress on human pose estimation,
what spatial dependencies they capture to localize keypoints remains unclear. In this work …
what spatial dependencies they capture to localize keypoints remains unclear. In this work …
A survey on visual transformer
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …
network mainly based on the self-attention mechanism. Thanks to its strong representation …
E^ 2vpt: An effective and efficient approach for visual prompt tuning
As the size of transformer-based models continues to grow, fine-tuning these large-scale
pretrained vision models for new tasks has become increasingly parameter-intensive …
pretrained vision models for new tasks has become increasingly parameter-intensive …
Handoccnet: Occlusion-robust 3d hand mesh estimation network
Hands are often severely occluded by objects, which makes 3D hand mesh estimation
challenging. Previous works often have disregarded information at occluded regions …
challenging. Previous works often have disregarded information at occluded regions …
Ptq4vit: Post-training quantization for vision transformers with twin uniform quantization
Quantization is one of the most effective methods to compress neural networks, which has
achieved great success on convolutional neural networks (CNNs). Recently, vision …
achieved great success on convolutional neural networks (CNNs). Recently, vision …