Knowledge distillation and student-teacher learning for visual intelligence: A review and new outlooks

L Wang, KJ Yoon - IEEE transactions on pattern analysis and …, 2021 - ieeexplore.ieee.org
Deep neural models, in recent years, have been successful in almost every field, even
solving the most complex problem statements. However, these models are huge in size with …

2dpass: 2d priors assisted semantic segmentation on lidar point clouds

X Yan, J Gao, C Zheng, C Zheng, R Zhang… - European conference on …, 2022 - Springer
As camera and LiDAR sensors capture complementary information in autonomous driving,
great efforts have been made to conduct semantic segmentation through multi-modality data …

Knowledge distillation: A survey

J Gou, B Yu, SJ Maybank, D Tao - International Journal of Computer Vision, 2021 - Springer
In recent years, deep neural networks have been successful in both industry and academia,
especially for computer vision tasks. The great success of deep learning is mainly due to its …

Fsdr: Frequency space domain randomization for domain generalization

J Huang, D Guan, A **ao, S Lu - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract Domain generalization aims to learn a generalizable model from aknown'source
domain for variousunknown'target domains. It has been studied widely by domain …

Learning to learn single domain generalization

F Qiao, L Zhao, X Peng - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
We are concerned with a worst-case scenario in model generalization, in the sense that a
model aims to perform well on many unseen domains while there is only one single domain …

Parser-free virtual try-on via distilling appearance flows

Y Ge, Y Song, R Zhang, C Ge… - Proceedings of the …, 2021 - openaccess.thecvf.com
Image virtual try-on aims to fit a garment image (target clothes) to a person image. Prior
methods are heavily based on human parsing. However, slightly-wrong segmentation …

Smil: Multimodal learning with severely missing modality

M Ma, J Ren, L Zhao, S Tulyakov, C Wu… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
A common assumption in multimodal learning is the completeness of training data, ie, full
modalities are available in all training examples. Although there exists research endeavor in …

Sfd2: Semantic-guided feature detection and description

F Xue, I Budvytis, R Cipolla - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Visual localization is a fundamental task for various applications including autonomous
driving and robotics. Prior methods focus on extracting large amounts of often redundant …

Maximum-entropy adversarial data augmentation for improved generalization and robustness

L Zhao, T Liu, X Peng… - Advances in Neural …, 2020 - proceedings.neurips.cc
Adversarial data augmentation has shown promise for training robust deep neural networks
against unforeseen data shifts or corruptions. However, it is difficult to define heuristics to …

Liga-stereo: Learning lidar geometry aware representations for stereo-based 3d detector

X Guo, S Shi, X Wang, H Li - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Stereo-based 3D detection aims at detecting 3D object bounding boxes from stereo images
using intermediate depth maps or implicit 3D geometry representations, which provides a …