Cafe: Learning to condense dataset by aligning features

K Wang, B Zhao, X Peng, Z Zhu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Dataset condensation aims at reducing the network training effort through condensing a
cumbersome training set into a compact synthetic one. State-of-the-art approaches largely …

Dataset pruning: Reducing training data by examining generalization influence

S Yang, Z **e, H Peng, M Xu, M Sun, P Li - arxiv preprint arxiv …, 2022 - arxiv.org
The great success of deep learning heavily relies on increasingly larger training data, which
comes at a price of huge computational and infrastructural costs. This poses crucial …

Gsd: View-guided gaussian splatting diffusion for 3d reconstruction

Y Mu, X Zuo, C Guo, Y Wang, J Lu, X Wu, S Xu… - … on Computer Vision, 2024 - Springer
We present GSD, a diffusion model approach based on Gaussian Splatting (GS)
representation for 3D object reconstruction from a single view. Prior works suffer from …

Tmvnet: Using transformers for multi-view voxel-based 3d reconstruction

K Peng, R Islam, J Quarles… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Previous research in multi-view 3D reconstruction had used different convolution neural
network (CNN) architectures to obtain a 3D voxel representation. Even though CNN works …

Objects in semantic topology

S Yang, P Sun, Y Jiang, X **a, R Zhang, Z Yuan… - arxiv preprint arxiv …, 2021 - arxiv.org
A more realistic object detection paradigm, Open-World Object Detection, has arisen
increasing research interests in the community recently. A qualified open-world object …

Bridging the gap between few-shot and many-shot learning via distribution calibration

S Yang, S Wu, T Liu, M Xu - IEEE Transactions on Pattern …, 2021 - ieeexplore.ieee.org
A major gap between few-shot and many-shot learning is the data distribution empirically
oserved by the model during training. In few-shot learning, the learned model can easily …

Semi-supervised single-view 3d reconstruction via prototype shape priors

Z **ng, H Li, Z Wu, YG Jiang - European Conference on Computer Vision, 2022 - Springer
The performance of existing single-view 3D reconstruction methods heavily relies on large-
scale 3D annotations. However, such annotations are tedious and expensive to collect …

Umiformer: Mining the correlations between similar tokens for multi-view 3d reconstruction

Z Zhu, L Yang, N Li, C Jiang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In recent years, many video tasks have achieved breakthroughs by utilizing the vision
transformer and establishing spatial-temporal decoupling for feature extraction. Although …

Garnet: Global-aware multi-view 3d reconstruction network and the cost-performance tradeoff

Z Zhu, L Yang, X Lin, L Yang, Y Liang - Pattern Recognition, 2023 - Elsevier
Deep learning technology has made great progress in multi-view 3D reconstruction tasks. At
present, the mainstream solutions adopt different ways to fusion the features from several …

CPG3D: Cross-modal priors guided 3D object reconstruction

W Nie, C Jiao, R Chang, L Qu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Three-dimensional reconstruction is a multimedia technology widely used in computer-
aided modeling and 3D animation. Nevertheless, it is still hard for reconstruction methods to …