Review of multi-view 3D object recognition methods based on deep learning

S Qi, X Ning, G Yang, L Zhang, P Long, W Cai, W Li - Displays, 2021 - Elsevier
Abstract Three-dimensional (3D) object recognition is widely used in automated driving,
medical image analysis, virtual/augmented reality, artificial intelligence robots, and other …

[HTML][HTML] DILF: Differentiable rendering-based multi-view Image–Language Fusion for zero-shot 3D shape understanding

X Ning, Z Yu, L Li, W Li, P Tiwari - Information Fusion, 2024 - Elsevier
Zero-shot 3D shape understanding aims to recognize “unseen” 3D categories that are not
present in training data. Recently, Contrastive Language–Image Pre-training (CLIP) has …

Deep learning for 3D object recognition: A survey

AAM Muzahid, H Han, Y Zhang, D Li, Y Zhang… - Neurocomputing, 2024 - Elsevier
With the growing availability of extensive 3D datasets and the rapid progress in
computational power, deep learning (DL) has emerged as a highly promising approach for …

Mv-reid: 3d multi-view transformation network for occluded person re-identification

Z Yu, P Tiwari, L Hou, L Li, W Li, L Jiang… - Knowledge-Based Systems, 2024 - Elsevier
Re-identification (ReID) of occluded persons is a challenging task due to the loss of
information in scenes with occlusions. Most existing methods for occluded ReID use 2D …

[PDF][PDF] Intelligent deep learning and improved whale optimization algorithm based framework for object recognition

N Hussain, MA Khan, S Kadry, U Tariq… - Hum. Cent. Comput …, 2021 - researchgate.net
In pattern recognition, object recognition is an important research domain due to major
applications such as autonomous driving, robotics, and visual surveillance. Many computer …

Cross-lingual cross-modal retrieval with noise-robust learning

Y Wang, J Dong, T Liang, M Zhang, R Cai… - Proceedings of the 30th …, 2022 - dl.acm.org
Despite the recent developments in the field of cross-modal retrieval, there has been less
research focusing on low-resource languages due to the lack of manually annotated …

Fine-grained fashion similarity prediction by attribute-specific embedding learning

J Dong, Z Ma, X Mao, X Yang, Y He… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This paper strives to predict fine-grained fashion similarity. In this similarity paradigm, one
should pay more attention to the similarity in terms of a specific design/attribute between …

RGAM: A novel network architecture for 3D point cloud semantic segmentation in indoor scenes

XT Chen, Y Li, JH Fan, R Wang - Information Sciences, 2021 - Elsevier
Abstract Three-dimensional (3D) point cloud semantic segmentation is an essential part of
computer vision for scene comprehension. Nevertheless, due to their loss of detail, existing …

Multi-view 3D object retrieval leveraging the aggregation of view and instance attentive features

D Lin, Y Li, Y Cheng, S Prasad, TL Nwe, S Dong… - Knowledge-Based …, 2022 - Elsevier
In multi-view 3D object retrieval tasks, it is pivotal to aggregate visual features extracted from
multiple view images to generate a discriminative representation for a 3D object. The …

A clustering-guided contrastive fusion for multi-view representation learning

G Ke, G Chao, X Wang, C Xu, Y Zhu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Multi-view representation learning aims to extract comprehensive information from multiple
sources. It has achieved significant success in applications such as video understanding …