Transformers as support vector machines

DA Tarzanagh, Y Li, C Thrampoulidis… - arxiv preprint arxiv …, 2023 - arxiv.org
Since its inception in" Attention Is All You Need", transformer architecture has led to
revolutionary advancements in NLP. The attention layer within the transformer admits a …

Primal-attention: Self-attention through asymmetric kernel svd in primal representation

Y Chen, Q Tao, F Tonin… - Advances in Neural …, 2023 - proceedings.neurips.cc
Recently, a new line of works has emerged to understand and improve self-attention in
Transformers by treating it as a kernel machine. However, existing works apply the methods …

SURE: SUrvey REcipes for building reliable and robust deep networks

Y Li, Y Chen, X Yu, D Chen… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
In this paper we revisit techniques for uncertainty estimation within deep neural networks
and consolidate a suite of techniques to enhance their reliability. Our investigation reveals …

Decoding class dynamics in learning with noisy labels

A Tatjer, B Nagarajan, R Marques, P Radeva - Pattern Recognition Letters, 2024 - Elsevier
The creation of large-scale datasets annotated by humans inevitably introduces noisy
labels, leading to reduced generalization in deep-learning models. Sample selection-based …

GANzzle++: Generative approaches for jigsaw puzzle solving as local to global assignment in latent spatial representations

D Talon, A Del Bue, S James - Pattern Recognition Letters, 2025 - Elsevier
Jigsaw puzzles are a popular and enjoyable pastime that humans can easily solve, even
with many pieces. However, solving a jigsaw is a combinatorial problem, and the space of …

MS-DINO: Masked self-supervised distributed learning using vision transformer

S Park, IJ Lee, JW Kim, JC Ye - IEEE Journal of Biomedical and …, 2024 - ieeexplore.ieee.org
Despite promising advancements in deep learning in medical domains, challenges still
remain owing to data scarcity, compounded by privacy concerns and data ownership …

Computational techniques for virtual reconstruction of fragmented archaeological textiles

D Gigilashvili, H Lukesova, CF Gulbrandsen, A Harijan… - 2023 - nature.com
Archaeological artifacts play important role in understanding the past developments of the
humanity. However, the artifacts are often highly fragmented and degraded, with many …

Coarse is better? a new pipeline towards self-supervised learning with uncurated images

K Zhu, YY He, J Wu - Pattern Recognition, 2025 - Elsevier
Most self-supervised learning (SSL) methods often work on curated datasets where the
object-centric assumption holds. This assumption breaks down in uncurated images …

ViTs as backbones: Leveraging vision transformers for feature extraction

O Elharrouss, Y Himeur, Y Mahmood, S Alrabaee… - Information …, 2025 - Elsevier
Abstract The emergence of Vision Transformers (ViTs) has marked a significant shift in the
field of computer vision, presenting new methodologies that challenge traditional …

Robust semi-supervised learning with multi-consistency and data augmentation

JM Guo, CC Sun, KY Chan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In this paper, we address the problem of noisy datasets by proposing a dual screening
scheme to improve the performance of models trained on two public noisy datasets …