Domain generalization: A survey

K Zhou, Z Liu, Y Qiao, T **ang… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Generalization to out-of-distribution (OOD) data is a capability natural to humans yet
challenging for machines to reproduce. This is because most learning algorithms strongly …

Graph convolutional networks: a comprehensive review

S Zhang, H Tong, J Xu, R Maciejewski - Computational Social Networks, 2019 - Springer
Graphs naturally appear in numerous application domains, ranging from social analysis,
bioinformatics to computer vision. The unique capability of graphs enables capturing the …

Scaling up gans for text-to-image synthesis

M Kang, JY Zhu, R Zhang, J Park… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent success of text-to-image synthesis has taken the world by storm and captured the
general public's imagination. From a technical standpoint, it also marked a drastic change in …

Learning to upsample by learning to sample

W Liu, H Lu, H Fu, Z Cao - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
We present DySample, an ultra-lightweight and effective dynamic upsampler. While
impressive performance gains have been witnessed from recent kernel-based dynamic …

Hornet: Efficient high-order spatial interactions with recursive gated convolutions

Y Rao, W Zhao, Y Tang, J Zhou… - Advances in Neural …, 2022 - proceedings.neurips.cc
Recent progress in vision Transformers exhibits great success in various tasks driven by the
new spatial modeling mechanism based on dot-product self-attention. In this paper, we …

Sparsebev: High-performance sparse 3d object detection from multi-camera videos

H Liu, Y Teng, T Lu, H Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Camera-based 3D object detection in BEV (Bird's Eye View) space has drawn great
attention over the past few years. Dense detectors typically follow a two-stage pipeline by …

Simvp: Simpler yet better video prediction

Z Gao, C Tan, L Wu, SZ Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …

Mask3d: Mask transformer for 3d semantic instance segmentation

J Schult, F Engelmann, A Hermans… - … on Robotics and …, 2023 - ieeexplore.ieee.org
Modern 3D semantic instance segmentation approaches predominantly rely on specialized
voting mechanisms followed by carefully designed geometric clustering techniques. Building …

Adaptive rotated convolution for rotated object detection

Y Pu, Y Wang, Z **a, Y Han, Y Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Rotated object detection aims to identify and locate objects in images with arbitrary
orientation. In this scenario, the oriented directions of objects vary considerably across …

Graph neural networks: foundation, frontiers and applications

L Wu, P Cui, J Pei, L Zhao, X Guo - … of the 28th ACM SIGKDD conference …, 2022 - dl.acm.org
The field of graph neural networks (GNNs) has seen rapid and incredible strides over the
recent years. Graph neural networks, also known as deep learning on graphs, graph …