Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
CNN architectures for geometric transformation-invariant feature representation in computer vision: a review
A Mumuni, F Mumuni - SN Computer Science, 2021 - Springer
One of the main challenges in machine vision relates to the problem of obtaining robust
representation of visual features that remain unaffected by geometric transformations. This …
representation of visual features that remain unaffected by geometric transformations. This …
Deep high-resolution representation learning for visual recognition
High-resolution representations are essential for position-sensitive vision problems, such as
human pose estimation, semantic segmentation, and object detection. Existing state-of-the …
human pose estimation, semantic segmentation, and object detection. Existing state-of-the …
Deep high-resolution representation learning for human pose estimation
In this paper, we are interested in the human pose estimation problem with a focus on
learning reliable high-resolution representations. Most existing methods recover high …
learning reliable high-resolution representations. Most existing methods recover high …
On translation invariance in cnns: Convolutional layers can exploit absolute spatial location
In this paper we challenge the common assumption that convolutional layers in modern
CNNs are translation invariant. We show that CNNs can and will exploit the absolute spatial …
CNNs are translation invariant. We show that CNNs can and will exploit the absolute spatial …
Why do deep convolutional networks generalize so poorly to small image transformations?
Abstract Convolutional Neural Networks (CNNs) are commonly assumed to be invariant to
small image transformations: either because of the convolutional architecture or because …
small image transformations: either because of the convolutional architecture or because …
Text2light: Zero-shot text-driven hdr panorama generation
High-quality HDRIs (High Dynamic Range Images), typically HDR panoramas, are one of
the most popular ways to create photorealistic lighting and 360-degree reflections of 3D …
the most popular ways to create photorealistic lighting and 360-degree reflections of 3D …
Chaos is a ladder: A new theoretical understanding of contrastive learning via augmentation overlap
Recently, contrastive learning has risen to be a promising approach for large-scale self-
supervised learning. However, theoretical understanding of how it works is still unclear. In …
supervised learning. However, theoretical understanding of how it works is still unclear. In …
Deviant: Depth equivariant network for monocular 3d object detection
Modern neural networks use building blocks such as convolutions that are equivariant to
arbitrary 2 D translations. However, these vanilla blocks are not equivariant to arbitrary 3 D …
arbitrary 2 D translations. However, these vanilla blocks are not equivariant to arbitrary 3 D …
Scale-aware fast R-CNN for pedestrian detection
In this paper, we consider the problem of pedestrian detection in natural scenes. Intuitively,
instances of pedestrians with different spatial scales may exhibit dramatically different …
instances of pedestrians with different spatial scales may exhibit dramatically different …
Why self-attention? a targeted evaluation of neural machine translation architectures
Recently, non-recurrent architectures (convolutional, self-attentional) have outperformed
RNNs in neural machine translation. CNNs and self-attentional networks can connect distant …
RNNs in neural machine translation. CNNs and self-attentional networks can connect distant …