Advances in medical image analysis with vision transformers: a comprehensive review

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier
The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

A comprehensive survey on deep graph representation learning

W Ju, Z Fang, Y Gu, Z Liu, Q Long, Z Qiao, Y Qin… - Neural Networks, 2024 - Elsevier
Graph representation learning aims to effectively encode high-dimensional sparse graph-
structured data into low-dimensional dense vectors, which is a fundamental task that has …

Semi-supervised semantic segmentation using unreliable pseudo-labels

Y Wang, H Wang, Y Shen, J Fei, W Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
The crux of semi-supervised semantic segmentation is to assign pseudo-labels to the pixels
of unlabeled images. A common practice is to select the highly confident predictions as the …

Large language models can self-improve

J Huang, SS Gu, L Hou, Y Wu, X Wang, H Yu… - arxiv preprint arxiv …, 2022 - arxiv.org
Large Language Models (LLMs) have achieved excellent performances in various tasks.
However, fine-tuning an LLM requires extensive supervision. Human, on the other hand …

Extract free dense labels from clip

C Zhou, CC Loy, B Dai - European Conference on Computer Vision, 2022 - Springer
Abstract Contrastive Language-Image Pre-training (CLIP) has made a remarkable
breakthrough in open-vocabulary zero-shot image recognition. Many recent studies …

Revisiting weak-to-strong consistency in semi-supervised semantic segmentation

L Yang, L Qi, L Feng, W Zhang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this work, we revisit the weak-to-strong consistency framework, popularized by FixMatch
from semi-supervised classification, where the prediction of a weakly perturbed image …

Zegclip: Towards adapting clip for zero-shot semantic segmentation

Z Zhou, Y Lei, B Zhang, L Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Recently, CLIP has been applied to pixel-level zero-shot learning tasks via a wo-stage
scheme. The general idea is to first generate class-agnostic region proposals and then feed …

St++: Make self-training work better for semi-supervised semantic segmentation

L Yang, W Zhuo, L Qi, Y Shi… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Self-training via pseudo labeling is a conventional, simple, and popular pipeline to leverage
unlabeled data. In this work, we first construct a strong baseline of self-training (namely ST) …

Perturbed and strict mean teachers for semi-supervised semantic segmentation

Y Liu, Y Tian, Y Chen, F Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Consistency learning using input image, feature, or network perturbations has shown
remarkable results in semi-supervised semantic segmentation, but this approach can be …

Dense distinct query for end-to-end object detection

S Zhang, X Wang, J Wang, J Pang… - Proceedings of the …, 2023 - openaccess.thecvf.com
One-to-one label assignment in object detection has successfully obviated the need of non-
maximum suppression (NMS) as a postprocessing and makes the pipeline end-to-end …