[HTML][HTML] Deep learning for urban land use category classification: A review and experimental assessment

Z Li, B Chen, S Wu, M Su, JM Chen, B Xu - Remote Sensing of …, 2024 - Elsevier
Map** the distribution, pattern, and composition of urban land use categories plays a
valuable role in understanding urban environmental dynamics and facilitating sustainable …

[HTML][HTML] Generative AI for visualization: State of the art and future directions

Y Ye, J Hao, Y Hou, Z Wang, S **ao, Y Luo, W Zeng - Visual Informatics, 2024 - Elsevier
Generative AI (GenAI) has witnessed remarkable progress in recent years and
demonstrated impressive performance in various generation tasks in different domains such …

Metaformer baselines for vision

W Yu, C Si, P Zhou, M Luo, Y Zhou… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
MetaFormer, the abstracted architecture of Transformer, has been found to play a significant
role in achieving competitive performance. In this paper, we further explore the capacity of …

Dynamic graph learning with content-guided spatial-frequency relation reasoning for deepfake detection

Y Wang, K Yu, C Chen, X Hu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
With the springing up of face synthesis techniques, it is prominent in need to develop
powerful face forgery detection methods due to security concerns. Some existing methods …

Clusterfomer: clustering as a universal visual learner

J Liang, Y Cui, Q Wang, T Geng… - Advances in neural …, 2023 - proceedings.neurips.cc
This paper presents ClusterFormer, a universal vision model that is based on the Clustering
paradigm with TransFormer. It comprises two novel designs: 1) recurrent cross-attention …

A generalization of vit/mlp-mixer to graphs

X He, B Hooi, T Laurent, A Perold… - International …, 2023 - proceedings.mlr.press
Abstract Graph Neural Networks (GNNs) have shown great potential in the field of graph
representation learning. Standard GNNs define a local message-passing mechanism which …

G-cascade: Efficient cascaded graph convolutional decoding for 2d medical image segmentation

MM Rahman, R Marculescu - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
In this paper, we are the first to propose a new graph convolution-based decoder namely,
Cascaded Graph Convolutional Attention Decoder (G-CASCADE), for 2D medical image …

Image processing gnn: Breaking rigidity in super-resolution

Y Tian, H Chen, C Xu, Y Wang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Super-Resolution (SR) reconstructs high-resolution images from low-resolution ones. CNNs
and window-attention methods are two major categories of canonical SR models. However …

A survey on graph neural networks and graph transformers in computer vision: A task-oriented perspective

C Chen, Y Wu, Q Dai, HY Zhou, M Xu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Graph Neural Networks (GNNs) have gained momentum in graph representation learning
and boosted the state of the art in a variety of areas, such as data mining (eg, social network …

Vision hgnn: An image is more than a graph of nodes

Y Han, P Wang, S Kundu, Y Ding… - Proceedings of the …, 2023 - openaccess.thecvf.com
The realm of graph-based modeling has proven its adaptability across diverse real-world
data types. However, its applicability to general computer vision tasks had been limited until …